Search Sciweavers | Sciweavers

142 search results - page 4 / 29

» Entropy-Based Authorship Search in Large Document Collection...

265

click to vote

ICDE
2004
IEEE

151views Database» more ICDE 2004»

Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks

16 years 7 months ago

Download cis.poly.edu

We study the problem of maintaining large replicated collections of files or documents in a distributed environment with limited bandwidth. This problem arises in a number of impo...

Torsten Suel, Patrick Noel, Dimitre Trendafilov

claim paper

Read More »

173

click to vote

CIKM
2010
Springer

175views Information Technology» more CIKM 2010»

Improved index compression techniques for versioned document collections

15 years 4 months ago

Download cis.poly.edu

Current Information Retrieval systems use inverted index structures for eﬃcient query processing. Due to the extremely large size of many data sets, these index structures are u...

Jinru He, Junyuan Zeng, Torsten Suel

claim paper

Read More »

157

click to vote

SAC
2005
ACM

195views Applied Computing» more SAC 2005»

A hierarchical naive Bayes mixture model for name disambiguation in author citations

15 years 11 months ago

Download clgiles.ist.psu.edu

Because of name variations, an author may have multiple names and multiple authors may share the same name. Such name ambiguity affects the performance of document retrieval, web ...

Hui Han, Wei Xu, Hongyuan Zha, C. Lee Giles

claim paper

Read More »

151

click to vote

EDBT
2004
ACM

133views Database» more EDBT 2004»

HOPI: An Efficient Connection Index for Complex XML Document Collections

16 years 5 months ago

Download wwwcs.upb.de

In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2?hop cover of a directed graph introduced by Cohen et al. In contrast to most o...

Ralf Schenkel, Anja Theobald, Gerhard Weikum

claim paper

Read More »

138

click to vote

CIKM
2010
Springer

143views Information Technology» more CIKM 2010»

Document allocation policies for selective searching of distributed indexes

15 years 4 months ago

Download www.cs.cmu.edu

Indexes for large collections are often divided into shards that are distributed across multiple computers and searched in parallel to provide rapid interactive search. Typically,...

Anagha Kulkarni, Jamie Callan

claim paper

Read More »

« Prev « First page 4 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers