Sciweavers

1380 search results - page 107 / 276
» Combination of Document Priors in Web Information Retrieval
Sort
View
WSDM
2010
ACM
210views Data Mining» more  WSDM 2010»
14 years 5 months ago
Leveraging Temporal Dynamics of Document Content in Relevance Ranking
Many web documents are dynamic, with content changing in varying amounts at varying frequencies. However, current document search algorithms have a static view of the document con...
Jonathan L. Elsas, Susan T. Dumais
IJCAI
2001
13 years 9 months ago
Mining Soft-Matching Rules from Textual Data
Text mining concerns the discovery of knowledge from unstructured textual data. One important task is the discovery of rules that relate specific words and phrases. Although exist...
Un Yong Nahm, Raymond J. Mooney
WWW
2005
ACM
14 years 9 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
ITCC
2003
IEEE
14 years 1 months ago
The Effects of Search Engines and Query Operators on Top Ranked Results
We examine whether the use of query operators changes the documents retrieved by three popular Web search engines. One hundred queries containing query operators were selected fro...
Bernard J. Jansen, Caroline M. Eastman
ICDE
2010
IEEE
200views Database» more  ICDE 2010»
13 years 8 months ago
Towards better entity resolution techniques for Web document collections
— As person names are non-unique, the same name on different Web pages might or might not refer to the same real-world person. This entity identification problem is one of the m...
Surender Reddy Yerva, Zoltán Miklós,...