Sciweavers

498 search results - page 58 / 100
» Robust web content extraction
Sort
View
CIKM
2009
Springer
14 years 2 months ago
Vetting the links of the web
Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...
Na Dai, Brian D. Davison
ICASSP
2010
IEEE
13 years 6 months ago
CBCD based on color features and landmark MDS-assisted distance estimation
Content-Based Copy Detection (CBCD) of digital videos is an important research field that aims at the identification of modified copies of an original clip, e.g., on the Intern...
Marzia Corvaglia, Fabrizio Guerrini, Riccardo Leon...
IUI
2004
ACM
14 years 25 days ago
Evaluating adaptive user profiles for news classification
Never before have so many information sources been available. Most are accessible on-line and some exist on the Internet alone. However, this large information quantity makes inte...
Ricardo Carreira, Jaime M. Crato
DC
2001
13 years 8 months ago
Metadata Interoperability and Meta-search on the Web
Several initiatives for establishing standards for metadata models are being carried out at the moment, but everyone focuses on their own requirements when defining metadata attri...
Enric Peig, Jaime Delgado, Ismael Pérez
AGENTS
1997
Springer
13 years 11 months ago
A Scalable Comparison-Shopping Agent for the World-Wide Web
The World-Wide-Web is less agent-friendly than we might hope. Most information on the Web is presented in loosely structured natural language text with no agent-readable semantics...
Robert B. Doorenbos, Oren Etzioni, Daniel S. Weld