Search Sciweavers | Sciweavers

112

SIGIR
2008
ACM

104views Information Technology» more SIGIR 2008»

Compressed collections for simulated crawling

15 years 2 months ago

Collections are a fundamental tool for reproducible evaluation of information retrieval techniques. We describe a new method for distributing the document lengths and term counts ...

Alessio Orlandi, Sebastiano Vigna

claim paper

Read More »

114

Voted

SIGIR
2010
ACM

205views Information Technology» more SIGIR 2010»

Adaptive near-duplicate detection via similarity learning

15 years 6 months ago

Download research.microsoft.com

In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...

Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz

claim paper

Read More »

116

click to vote

ESWS
2008
Springer

101views Internet Technology» more ESWS 2008»

Improving Interoperability Using Query Interpretation in Semantic Vector Spaces

15 years 4 months ago

Download www.eswc2008.org

Abstract. In semantic web applications where query initiators and information providers do not necessarily share the same ontology, semantic interoperability generally relies on on...

Anthony Ventresque, Sylvie Cazalens, Philippe Lama...

claim paper

Read More »

109

click to vote

EMNLP
2008

182views Natural Language Processing» more EMNLP 2008»

HTM: A Topic Model for Hypertexts

15 years 4 months ago

Download www.aclweb.org

Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...

Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li

claim paper

Read More »

139

Voted

TREC
2007

156views Information Technology» more TREC 2007»

Overview of the TREC 2007 Question Answering Track

15 years 3 months ago

Download trec.nist.gov

The TREC 2007 question answering (QA) track contained two tasks: the main task consisting of series of factoid, list, and “Other” questions organized around a set of targets, ...

Hoa Trang Dang, Diane Kelly, Jimmy J. Lin

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers