Sciweavers

1372 search results - page 82 / 275
» Information retrieval on Turkish texts
Sort
View
CIKM
2005
Springer
14 years 3 months ago
Similarity measures for tracking information flow
Text similarity spans a spectrum, with broad topical similarity near one extreme and document identity at the other. Intermediate levels of similarity – resulting from summariza...
Donald Metzler, Yaniv Bernstein, W. Bruce Croft, A...
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 7 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
ERCIMDL
2005
Springer
114views Education» more  ERCIMDL 2005»
14 years 3 months ago
Compressing Dynamic Text Collections via Phrase-Based Coding
We present a new statistical compression method, which we call Phrase Based Dense Code (PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection to ...
Nieves R. Brisaboa, Antonio Fariña, Gonzalo...
NLDB
2005
Springer
14 years 3 months ago
The Role of Word Sense Disambiguation in Automated Text Categorization
Abstract. Automated Text Categorization has reached the levels of accuracy of human experts. Provided that enough training data is available, it is possible to learn accurate autom...
José María Gómez Hidalgo, Man...
WWW
2008
ACM
14 years 10 months ago
Folksoviz: a subsumption-based folksonomy visualization using wikipedia texts
In this paper, targeting del.icio.us tag data, we propose a method, FolksoViz, for deriving subsumption relationships between tags by using Wikipedia texts, and visualizing a folk...
Kangpyo Lee, Hyunwoo Kim, Chungsu Jang, Hyoung-Joo...