Sciweavers

1018 search results - page 41 / 204
» Document Representation in Natural Language Text Retrieval
Sort
View
SIGIR
2009
ACM
14 years 3 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
DCC
2008
IEEE
14 years 8 months ago
Word-Based Statistical Compressors as Natural Language Compression Boosters
Semistatic word-based byte-oriented compression codes are known to be attractive alternatives to compress natural language texts. With compression ratios around 30%, they allow di...
Antonio Fariña, Gonzalo Navarro, José...
SIGIR
2004
ACM
14 years 1 months ago
Focused named entity recognition using machine learning
In this paper we study the problem of finding most topical named entities among all entities in a document, which we refer to as focused named entity recognition. We show that th...
Li Zhang, Yue Pan, Tong Zhang
SOCIALCOM
2010
13 years 6 months ago
Opinion Summarization in Bengali: A Theme Network Model
Theme network is a semantic network of document specific themes. So far Natural Language Processing (NLP) research patronized much of topic based summarizer system, unable to captu...
Amitava Das, Sivaji Bandyopadhyay
RIAO
1994
13 years 9 months ago
An Association Thesaurus for Information Retrieval
Although commonly used in both commercial and experimental information retrieval systems, thesauri have not demonstrated consistent bene ts for retrieval performance, and it is di...
Bruce Croft, Jing Yufeng