Sciweavers

2190 search results - page 37 / 438
» Unweaving a web of documents
Sort
View
ECIR
2008
Springer
13 years 11 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
LAWEB
2006
IEEE
14 years 4 months ago
Analysis of Web Search Engine Clicked Documents
In this paper we process and analyze web search engine query and click data from the perspective of the documents (URL’s) selected. We initially define possible document categor...
David F. Nettleton, Liliana Calderón-Benavi...
RIAO
2007
13 years 11 months ago
From Layout to Semantic: a Reranking Model for Mapping Web Documents to Mediated XML Representations
Many documents on the Web are formated in a weakly structured format. Because of their weak semantic and because of the heterogeneity of their formats, the information conveyed by...
Guillaume Wisniewski, Patrick Gallinari
ACL
2009
13 years 8 months ago
Exploiting Bilingual Information to Improve Web Search
Web search quality can vary widely across languages, even for the same information need. We propose to exploit this variation in quality by learning a ranking function on bilingua...
Wei Gao, John Blitzer, Ming Zhou, Kam-Fai Wong
IADIS
2003
13 years 11 months ago
Significance of HTML Tags for Document Indexing and Retrieval
Indexing quality has an overwhelming effect on retrieval effectiveness of search engines. In the past few years it has become one of the major challenges in the search engines are...
Byurhan Hyusein, Ahmed Patel