document collections

170

UAI
2008

216views Artificial Intelligence» more UAI 2008»

15 years 8 months ago

Latent topic models have been successfully applied as an unsupervised topic discovery technique in large document collections. With the proliferation of hypertext document collect...

Amit Gruber, Michal Rosen-Zvi, Yair Weiss

claim paper

Read More »

184

click to vote

INFOSCALE
2007
ACM

104views Information Technology» more INFOSCALE 2007»

Query-driven indexing for scalable peer-to-peer text retrieval

15 years 8 months ago

Download lsirpeople.epfl.ch

We present a query-driven algorithm for the distributed indexing of large document collections within structured P2P networks. To cope with bandwidth consumption that has been ide...

Gleb Skobeltsyn, Toan Luu, Ivana Podnar Zarko, Mar...

claim paper

Read More »

188

click to vote

DAS
2008
Springer

127views Document Analysis» more DAS 2008»

HistoSketch: A Semi-Automatic Annotation Tool for Archival Documents

15 years 8 months ago

Download www.karatzas.co.uk

This article describes a sketch-based framework for semi-automatic annotation of historical document collections. It is motivated by the fact that fully automatic methods, while h...

Joan Mas, José A. Rodríguez, Dimosth...

claim paper

Read More »

189

click to vote

CIKM
2008
Springer

162views Information Technology» more CIKM 2008»

15 years 8 months ago

Peer-to-peer similarity search over widely distributed document collections

Download www.idi.ntnu.no

This paper addresses the challenging problem of similarity search over widely distributed ultra-high dimensional data. Such an application is retrieval of the top-k most similar d...

Christos Doulkeridis, Kjetil Nørvåg, ...

claim paper

Read More »

186

Voted

DEXA
2006
Springer

193views Database» more DEXA 2006»

Understanding and Enhancing the Folding-In Method in Latent Semantic Indexing

15 years 10 months ago

Download bayou.cs.ucdavis.edu

Abstract. Latent Semantic Indexing(LSI) has been proved to be effective to capture the semantic structure of document collections. It is widely used in content-based text retrieval...

Xiang Wang 0002, Xiaoming Jin

claim paper

Read More »

182

click to vote

VLDB
1994
ACM

148views Database» more VLDB 1994»

Fast Incremental Indexing for Full-Text Information Retrieval

15 years 10 months ago

Download reference.kfupm.edu.sa

Full-text information retrieval systems have traditionally been designed for archival environments. They often provide little or no support for adding new documents to an existing...

Eric W. Brown, James P. Callan, W. Bruce Croft

claim paper

Read More »

158

click to vote

CIKM
1997
Springer

133views Information Technology» more CIKM 1997»

The Need for Metrics in Visual Information Analysis

15 years 10 months ago

Download infoviz.pnl.gov

CT This paper explores several methods for visualizing the thematic content of large document collections. As opposed to traditional query-driven document retrieval, these methods ...

Nancy Miller, Elizabeth G. Hetzler, Grant Nakamura...

claim paper

Read More »

160

click to vote

SIGMOD
2000
ACM

85views Database» more SIGMOD 2000»

Finding Replicated Web Collections

15 years 11 months ago

Download ilpubs.stanford.edu

Many web documents (such as JAVA FAQs) are being replicated on the Internet. Often entire document collections (such as hyperlinked Linux manuals) are being replicated many times....

Junghoo Cho, Narayanan Shivakumar, Hector Garcia-M...

claim paper

Read More »

171

click to vote

ERCIMDL
2001
Springer

132views Education» more ERCIMDL 2001»

A Combined Phrase and Thesaurus Browser for Large Document Collections

15 years 11 months ago

Download comminfo.rutgers.edu

A hierarchical browsing interface to a document collection can be constructed by identifying the phrases that recur in the full text of the documents and structuring them into a h...

Gordon W. Paynter, Ian H. Witten

claim paper

Read More »

162

click to vote

VLDB
2005
ACM

126views Database» more VLDB 2005»

Hubble: An Advanced Dynamic Folder Technology for XML

16 years 3 days ago

Download www.vldb2005.org

A significant amount of information is stored in computer systems today, but people are struggling to manage their documents such that the information is easily found. XML is a de...

Ning Li, Joshua Hui, Hui-I Hsiao, Kevin S. Beyer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers