Sciweavers

1018 search results - page 41 / 204
» Document Representation in Natural Language Text Retrieval
Sort
View
124
Voted
SIGIR
2009
ACM
15 years 10 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
123
Voted
DCC
2008
IEEE
16 years 3 months ago
Word-Based Statistical Compressors as Natural Language Compression Boosters
Semistatic word-based byte-oriented compression codes are known to be attractive alternatives to compress natural language texts. With compression ratios around 30%, they allow di...
Antonio Fariña, Gonzalo Navarro, José...
145
Voted
SIGIR
2004
ACM
15 years 9 months ago
Focused named entity recognition using machine learning
In this paper we study the problem of finding most topical named entities among all entities in a document, which we refer to as focused named entity recognition. We show that th...
Li Zhang, Yue Pan, Tong Zhang
168
Voted
SOCIALCOM
2010
15 years 1 months ago
Opinion Summarization in Bengali: A Theme Network Model
Theme network is a semantic network of document specific themes. So far Natural Language Processing (NLP) research patronized much of topic based summarizer system, unable to captu...
Amitava Das, Sivaji Bandyopadhyay
135
Voted
RIAO
1994
15 years 5 months ago
An Association Thesaurus for Information Retrieval
Although commonly used in both commercial and experimental information retrieval systems, thesauri have not demonstrated consistent bene ts for retrieval performance, and it is di...
Bruce Croft, Jing Yufeng