large text collections

179

EMNLP
2009

159views Natural Language Processing» more EMNLP 2009»

15 years 4 months ago

Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...

David M. Mimno, Hanna M. Wallach, Jason Naradowsky...

claim paper

Read More »

189

click to vote

APWEB
2006
Springer

102views Internet Technology» more APWEB 2006»

The Case of the Duplicate Documents Measurement, Search, and Science

15 years 10 months ago

Download goanna.cs.rmit.edu.au

Many of the documents in large text collections are duplicates and versions of each other. In recent research, we developed new methods for finding such duplicates; however, as the...

Justin Zobel, Yaniv Bernstein

claim paper

Read More »

137

click to vote

CIKM
2000
Springer

97views Information Technology» more CIKM 2000»

Collection Selection and Results Merging with Topically Organized U.S. Patents and TREC Data

15 years 11 months ago

Download delivery.acm.org

We investigate three issues in distributed information retrieval, considering both TREC data and U.S. Patents: (1) topical organization of large text collections, (2) collection r...

Leah S. Larkey, Margaret E. Connell, James P. Call...

claim paper

Read More »

211

click to vote

SAC
2005
ACM

141views Applied Computing» more SAC 2005»

Mining concept associations for knowledge discovery in large textual databases

16 years 7 days ago

Download www.ualr.edu

In this paper, we describe a new approach for mining concept associations from large text collections. The concepts are short sequences of words that occur frequently together acr...

Xiaowei Xu, Mutlu Mete, Nurcan Yuruk

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers