Sciweavers

359 search results - page 13 / 72
» Document clustering using word clusters via the information ...
Sort
View
ECAI
2000
Springer
13 years 11 months ago
Enriching very large ontologies using the WWW
This paper explores the possibility to exploit text on the world wide web in order to enrich the concepts in existing ontologies. First, a method to retrieve documents from the WWW...
Eneko Agirre, Olatz Ansa, Eduard H. Hovy, David Ma...
IRCDL
2007
13 years 9 months ago
An Hybrid Approach for Improving Word Sense Disambiguation and Text Clustering
Abstract— In this paper we suggest a new approach to represent text document collections, integrating background knowledge to improve clustering effectiveness. Background knowled...
Paolo Casoto, Carlo Tasso
ICDAR
2003
IEEE
14 years 23 days ago
Indexing and retrieval of words in old documents
This paper describes a system for efficient indexing and retrieval of words in collections of document images. The proposed method is based on two main principles: unsupervised pr...
Simone Marinai, Emanuele Marino, Giovanni Soda
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
14 years 1 months ago
Semi-supervised Clustering Using Bayesian Regularization
Text clustering is most commonly treated as a fully automated task without user supervision. However, we can improve clustering performance using supervision in the form of pairwi...
Zuobing Xu, Ram Akella, Mike Ching, Renjie Tang
ICTIR
2009
Springer
14 years 2 months ago
A New Measure of the Cluster Hypothesis
Abstract. We have found that the nearest neighbor (NN) test is an insufficient measure of the cluster hypothesis. The NN test is a local measure of the cluster hypothesis. Designer...
Mark D. Smucker, James Allan