Sciweavers

264 search results - page 14 / 53
» Clustering Documents with Active Learning Using Wikipedia
Sort
View
EMNLP
2008
13 years 9 months ago
An Analysis of Active Learning Strategies for Sequence Labeling Tasks
Active learning is well-suited to many problems in natural language processing, where unlabeled data may be abundant but annotation is slow and expensive. This paper aims to shed ...
Burr Settles, Mark Craven
ICDAR
2003
IEEE
14 years 27 days ago
Unsupervised Feature Selection Using Multi-Objective Genetic Algorithms for Handwritten Word Recognition
In this paper a methodology for feature selection in unsupervised learning is proposed. It makes use of a multiobjective genetic algorithm where the minimization of the number of ...
Marisa E. Morita, Robert Sabourin, Flávio B...
MLDM
2005
Springer
14 years 1 months ago
CorePhrase: Keyphrase Extraction for Document Clustering
Abstract. The ability to discover the topic of a large set of text documents using relevant keyphrases is usually regarded as a very tedious task if done by hand. Automatic keyphra...
Khaled M. Hammouda, Diego N. Matute, Mohamed S. Ka...
HIS
2003
13 years 9 months ago
Evolving Better Stoplists for Document Clustering and Web Intelligence
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
Mark P. Sinka, David Corne
ICIP
2001
IEEE
14 years 9 months ago
Image data mining from financial documents based on wavelet features
In this paper, we present a framework for clustering and classifying cheque images according to their payee-line content. The features used in the clustering and classificationpro...
Ossama El Badawy, Mahmoud R. El-Sakka, Khaled Hass...