Sciweavers

416 search results - page 50 / 84
» Utilizing Passage-Based Language Models for Document Retriev...
Sort
View
WWW
2011
ACM
13 years 2 months ago
Identifying primary content from web pages and its application to web search ranking
Web pages are usually highly structured documents. In some documents, content with different functionality is laid out in blocks, some merely supporting the main discourse. In ot...
Srinivas Vadrevu, Emre Velipasaoglu
KDD
2009
ACM
169views Data Mining» more  KDD 2009»
14 years 3 months ago
On burstiness-aware search for document sequences
As the number and size of large timestamped collections (e.g. sequences of digitized newspapers, periodicals, blogs) increase, the problem of efficiently indexing and searching su...
Theodoros Lappas, Benjamin Arai, Manolis Platakis,...
CIKM
2005
Springer
14 years 2 months ago
Predicting accuracy of extracting information from unstructured text collections
Exploiting lexical and semantic relationships in large unstructured text collections can significantly enhance managing, integrating, and querying information locked in unstructur...
Eugene Agichtein, Silviu Cucerzan
SIGIR
2012
ACM
11 years 11 months ago
Clarity re-visited
We present a novel interpretation of Clarity [5], a widely used query performance predictor. While Clarity is commonly described as a measure of the “distance” between the lan...
Shay Hummel, Anna Shtok, Fiana Raiber, Oren Kurlan...
ICML
2005
IEEE
14 years 9 months ago
Learn to weight terms in information retrieval using category information
How to assign appropriate weights to terms is one of the critical issues in information retrieval. Many term weighting schemes are unsupervised. They are either based on the empir...
Rong Jin, Joyce Y. Chai, Luo Si