Search Sciweavers | Sciweavers

174

SIGIR
2006
ACM

209views Information Technology» more SIGIR 2006»

Finding near-duplicate web pages: a large-scale evaluation of algorithms

15 years 12 months ago

Broder et al.’s [3] shingling algorithm and Charikar’s [4] random projection based approach are considered “state-of-theart” algorithms for ﬁnding near-duplicate web pag...

Monika Rauch Henzinger

claim paper

Read More »

126

click to vote

SIGIR
2006
ACM

110views Information Technology» more SIGIR 2006»

Building a test collection for complex document information processing

15 years 12 months ago

Download www.ir.iit.edu

Research and development of information access technology for scanned paper documents has been hampered by the lack of public test collections of realistic scope and complexity. A...

David D. Lewis, Gady Agam, Shlomo Argamon, Ophir F...

claim paper

Read More »

167

click to vote

SIGIR
2006
ACM

163views Information Technology» more SIGIR 2006»

15 years 12 months ago

Measuring similarity of semi-structured documents with context weights

Download www.ischool.drexel.edu

In this work, we study similarity measures for text-centric XML documents based on an extended vector space model, which considers both document content and structure. Experimenta...

Christopher C. Yang, Nan Liu

claim paper

Read More »

142

click to vote

SIGIR
2006
ACM

149views Information Technology» more SIGIR 2006»

Learning a ranking from pairwise preferences

15 years 12 months ago

Download ciir.cs.umass.edu

We introduce a novel approach to combining rankings from multiple retrieval systems. We use a logistic regression model or an SVM to learn a ranking from pairwise document prefere...

Ben Carterette, Desislava Petkova

claim paper

Read More »

135

click to vote

SIGIR
2006
ACM

81views Information Technology» more SIGIR 2006»

User expectations from XML element retrieval

15 years 12 months ago

Download www.dcs.gla.ac.uk

The primary aim of XML element retrieval is to return to users XML elements, rather than whole documents. This poster describes a small study, in which we elicited users’ expect...

Stamatina Betsi, Mounia Lalmas, Anastasios Tombros...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers