Sciweavers

2189 search results - page 192 / 438
» Webbed documents
Sort
View
SIGIR
2008
ACM
15 years 2 months ago
Compressed collections for simulated crawling
Collections are a fundamental tool for reproducible evaluation of information retrieval techniques. We describe a new method for distributing the document lengths and term counts ...
Alessio Orlandi, Sebastiano Vigna
114
Voted
SIGIR
2010
ACM
15 years 6 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
ESWS
2008
Springer
15 years 4 months ago
Improving Interoperability Using Query Interpretation in Semantic Vector Spaces
Abstract. In semantic web applications where query initiators and information providers do not necessarily share the same ontology, semantic interoperability generally relies on on...
Anthony Ventresque, Sylvie Cazalens, Philippe Lama...
EMNLP
2008
15 years 4 months ago
HTM: A Topic Model for Hypertexts
Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...
Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li
139
Voted
TREC
2007
15 years 3 months ago
Overview of the TREC 2007 Question Answering Track
The TREC 2007 question answering (QA) track contained two tasks: the main task consisting of series of factoid, list, and “Other” questions organized around a set of targets, ...
Hoa Trang Dang, Diane Kelly, Jimmy J. Lin