Sciweavers

472 search results - page 40 / 95
» Crawling the Hidden Web
Sort
View
EACL
2006
ACL Anthology
13 years 9 months ago
Large Linguistically-Processed Web Corpora for Multiple Languages
The Web contains vast amounts of linguistic data. One key issue for linguists and language technologists is how to access it. Commercial search engines give highly compromised acc...
Marco Baroni, Adam Kilgarriff
SIGIR
2012
ACM
11 years 10 months ago
Creating temporally dynamic web search snippets
Content on the Internet is always changing. We explore the value of biasing search result snippets towards new webpage content. We present results from a user study comparing trad...
Krysta Marie Svore, Jaime Teevan, Susan T. Dumais,...
WWW
2003
ACM
14 years 27 days ago
AnswerBus News Engine
AnswerBus News Engine1 is a question answering system using the contents of CNN Web site2 as its knowledge base. Comparing to other question answering systems including its previo...
Zhiping Zheng
CLEF
2010
Springer
13 years 8 months ago
MapReduce for Information Retrieval Evaluation: "Let's Quickly Test This on 12 TB of Data"
We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use ...
Djoerd Hiemstra, Claudia Hauff
CORR
2010
Springer
102views Education» more  CORR 2010»
13 years 7 months ago
MIREX: MapReduce Information Retrieval Experiments
We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use...
Djoerd Hiemstra, Claudia Hauff