Sciweavers

523 search results - page 15 / 105
» Metric Learning for Text Documents
Sort
View
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
15 years 10 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
168
Voted
SSPR
2010
Springer
15 years 1 months ago
Impact of Visual Information on Text and Content Based Image Retrieval
Abstract. Nowadays, multimedia documents composed of text and images are increasingly used, thanks to the Internet and the increasing capacity of data storage. It is more and more ...
Christophe Moulin, Christine Largeron, Mathias G&e...
113
Voted
SIGIR
2010
ACM
15 years 7 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
130
Voted
IJCAI
2003
15 years 5 months ago
Information Extraction from Tree Documents by Learning Subtree Delimiters
Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...
Boris Chidlovskii
157
Voted
AND
2009
15 years 1 months ago
A comprehensive evaluation methodology for noisy historical document recognition techniques
In this paper, we propose a new comprehensive methodology in order to evaluate the performance of noisy historical document recognition techniques. We aim to evaluate not only the...
Nikolaos Stamatopoulos, Georgios Louloudis, Basili...