Sciweavers

317 search results - page 9 / 64
» Style-independent document labeling: design and performance ...
Sort
View
WEBI
2009
Springer
14 years 2 months ago
Full-Subtopic Retrieval with Keyphrase-Based Search Results Clustering
We consider the problem of retrieving multiple documents relevant to the single subtopics of a given web query, termed “full-subtopic retrieval”. To solve this problem we pres...
Andrea Bernardini, Claudio Carpineto, Massimiliano...
DGO
2006
148views Education» more  DGO 2006»
13 years 9 months ago
Automatically labeling hierarchical clusters
Government agencies must often quickly organize and analyze large amounts of textual information, for example comments received as part of notice and comment rulemaking. Hierarchi...
Pucktada Treeratpituk, Jamie Callan
ICDAR
2009
IEEE
14 years 2 months ago
Evaluating Retraining Rules for Semi-Supervised Learning in Neural Network Based Cursive Word Recognition
Training a system to recognize handwritten words is a task that requires a large amount of data with their correct transcription. However, the creation of such a training set, inc...
Volkmar Frinken, Horst Bunke
CIKM
2009
Springer
14 years 2 months ago
Improving web page classification by label-propagation over click graphs
In this paper, we present a semi-supervised learning method for web page classification, leveraging click logs to augment training data by propagating class labels to unlabeled si...
Soo-Min Kim, Patrick Pantel, Lei Duan, Scott Gaffn...
ICDAR
1999
IEEE
13 years 12 months ago
WISDOM++: An Interactive and Adaptive Document Analysis System
WISDOM++ is a document analysis system whose main design requirements are real-time user interaction and adaptivity. This paper presents the two-phased skew estimation algorithm a...
Oronzo Altamura, Floriana Esposito, Donato Malerba