Sciweavers

2189 search results - page 41 / 438
» Webbed documents
Sort
View
CIKM
2005
Springer
14 years 1 months ago
Document quality models for web ad hoc retrieval
The quality of document content, which is an issue that is usually ignored for the traditional ad hoc retrieval task, is a critical issue for Web search. Web pages have a huge var...
Yun Zhou, W. Bruce Croft
MAICS
2004
13 years 9 months ago
Intelligent Content Based Title and Author Name Extraction from Formatted Documents
This paper describes the development of algorithms for extracting the title and the names of the authors from documents available on the World Wide Web. In this paper we describe ...
Eric G. Berkowitz, Mohamed Reda Elkhadiri, Tim Sah...
WWW
2008
ACM
14 years 8 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
SEMWEB
2010
Springer
13 years 2 months ago
Semantic search on the Web
Web search is a key technology of the Web, since it is the primary way to access content on the Web. Current standard Web search is essentially based on a combination of textual ke...
Bettina Fazzinga, Thomas Lukasiewicz
CIKM
2011
Springer
12 years 7 months ago
Relative effect of spam and irrelevant documents on user interaction with search engines
Meaningful evaluation of web search must take account of spam. Here we conduct a user experiment to investigate whether satisfaction with search engine result pages as a whole is ...
Timothy Jones, David Hawking, Paul Thomas, Ramesh ...