Sciweavers

76 search results - page 8 / 16
» Harvesting for Full-Text Retrieval
Sort
View
ERCIMDL
2003
Springer
109views Education» more  ERCIMDL 2003»
14 years 21 days ago
Distributed IR for Digital Libraries
Abstract. This paper examines technology developed to support largescale distributed digital libraries. We describe the method used for harvesting collection information using stan...
Ray R. Larson
WWW
2006
ACM
14 years 8 months ago
Estimating required recall for successful knowledge acquisition from the web
Information on the Web is not only abundant but also redundant. This redundancy of information has an important consequence on the relation between the recall of an information ga...
Wolfgang Gatterbauer
WWW
2009
ACM
14 years 8 months ago
A densitometric analysis of web template content
What makes template content in the Web so special that we need to remove it? In this paper I present a large-scale aggregate analysis of textual Web content, corroborating statist...
Christian Kohlschütter
KDD
2006
ACM
118views Data Mining» more  KDD 2006»
14 years 8 months ago
Mining for proposal reviewers: lessons learned at the national science foundation
In this paper, we discuss a prototype application deployed at the U.S. National Science Foundation for assisting program directors in identifying reviewers for proposals. The appl...
Seth Hettich, Michael J. Pazzani
JCDL
2003
ACM
145views Education» more  JCDL 2003»
14 years 22 days ago
Automatic Disambiguation of Latin Abbreviations in Early Modern Texts for Humanities Digital Libraries
Early modern books written in Latin contain many abbreviations of common words that are derived from earlier manuscript practice. While these abbreviations are usually easily deci...
Jeffrey A. Rydberg-Cox