Search Sciweavers | Sciweavers

281 search results - page 24 / 57

» Introducing the Enron Corpus

146

click to vote

ACL
1997

115views Computational Linguistics» more ACL 1997»

String Transformation Learning

15 years 7 months ago

Download www.aclweb.org

String transformation systems have been introduced in (Brill, 1995) and have several applications in natural language processing. In this work we consider the computational proble...

Giorgio Satta, John C. Henderson

claim paper

Read More »

166

click to vote

WWW
2006
ACM

107views Internet Technology» more WWW 2006»

Random sampling from a search engine's index

16 years 6 months ago

Download webee.technion.ac.il

We revisit a problem introduced by Bharat and Broder almost a decade ago: how to sample random pages from the corpus of documents indexed by a search engine, using only the search...

Ziv Bar-Yossef, Maxim Gurevich

claim paper

Read More »

181

click to vote

CLEF
2006
Springer

152views Information Technology» more CLEF 2006»

MSRA Columbus at GeoCLEF 2006

15 years 9 months ago

Download www.clef-campaign.org

This paper describes the participation of Columbus Project of Microsoft Research Asia (MSRA) in the GeoCLEF 2006 (a cross-language geographical retrieval track which is part of Cr...

Zhisheng Li, Chong Wang 0002, Xing Xie, Xufa Wang,...

claim paper

Read More »

136

click to vote

WWW
2007
ACM

108views Internet Technology» more WWW 2007»

Efficient search engine measurements

16 years 6 months ago

Download www2007.org

We address the problem of measuring global quality metrics of search engines, like corpus size, index freshness, and density of duplicates in the corpus. The recently proposed est...

Ziv Bar-Yossef, Maxim Gurevich

claim paper

Read More »

129

click to vote

TSD
2007
Springer

88views Signal Processing» more TSD 2007»

On the Relative Hardness of Clustering Corpora

16 years 3 days ago

Download users.dsic.upv.es

Abstract. Clustering is often considered the most important unsupervised learning problem and several clustering algorithms have been proposed over the years. Many of these algorit...

David Pinto, Paolo Rosso

claim paper

Read More »

« Prev « First page 24 / 57 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers