Search Sciweavers | Sciweavers

142 search results - page 5 / 29

» Entropy-Based Authorship Search in Large Document Collection...

198

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 5 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

163

click to vote

KDD
2009
ACM

169views Data Mining» more KDD 2009»

On burstiness-aware search for document sequences

16 years 19 days ago

Download www-ai.cs.uni-dortmund.de

As the number and size of large timestamped collections (e.g. sequences of digitized newspapers, periodicals, blogs) increase, the problem of eﬃciently indexing and searching su...

Theodoros Lappas, Benjamin Arai, Manolis Platakis,...

claim paper

Read More »

211

click to vote

SIGIR
2012
ACM

242views Information Technology» more SIGIR 2012»

Optimizing positional index structures for versioned document collections

13 years 8 months ago

Download cis.poly.edu

Versioned document collections are collections that contain multiple versions of each document. Important examples are Web archives, Wikipedia and other wikis, or source code and ...

Jinru He, Torsten Suel

claim paper

Read More »

255

click to vote

IR
2008

96views Natural Language Processing» more IR 2008»

Output-sensitive autocompletion search

15 years 5 months ago

Download www.mpi-inf.mpg.de

We consider the following autocompletion search scenario: imagine a user of a search engine typing a query; then with every keystroke display those completions of the last query wo...

Holger Bast, Christian Worm Mortensen, Ingmar Webe...

claim paper

Read More »

204

click to vote

SIGIR
2011
ACM

220views Information Technology» more SIGIR 2011»

Pseudo test collections for learning web search ranking functions

14 years 8 months ago

Download www.cs.umd.edu

Test collections are the primary drivers of progress in information retrieval. They provide a yardstick for assessing the eﬀectiveness of ranking functions in an automatic, rapi...

Nima Asadi, Donald Metzler, Tamer Elsayed, Jimmy L...

claim paper

Read More »

« Prev « First page 5 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers