Sciweavers

186 search results - page 5 / 38
» Experiments in Term Weighting and Keyword Extraction in Docu...
Sort
View
IJCAI
2001
13 years 8 months ago
Keyword Spices: A New Method for Building Domain-Specific Web Search Engines
This paper presents a new method for building domain-specific web search engines. Previous methods eliminate irrelevant documents from the pages accessed using heuristics based on...
Satoshi Oyama, Takashi Kokubo, Toru Ishida, Teruhi...
WWW
2009
ACM
13 years 11 months ago
Extracting data records from the web using tag path clustering
Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the first step of this object extraction process, identifies...
Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...
SIGIR
2008
ACM
13 years 6 months ago
Pagerank based clustering of hypertext document collections
Clustering hypertext document collection is an important task in Information Retrieval. Most clustering methods are based on document content and do not take into account the hype...
Konstantin Avrachenkov, Vladimir Dobrynin, Danil N...
CIS
2005
Springer
14 years 13 days ago
Concept Chain Based Text Clustering
Different from familiar clustering objects, text documents have sparse data spaces. A common way of representing a document is as a bag of its component words, but the semantic re...
Shaoxu Song, Jian Zhang, Chunping Li
SIGIR
2012
ACM
11 years 9 months ago
Automatic term mismatch diagnosis for selective query expansion
People are seldom aware that their search queries frequently mismatch a majority of the relevant documents. This may not be a big problem for topics with a large and diverse set o...
Le Zhao, Jamie Callan