Sciweavers

328 search results - page 23 / 66
» A Multi-level Approach for Document Clustering
Sort
View
COLING
2010
13 years 2 months ago
Streaming Cross Document Entity Coreference Resolution
Previous research in cross-document entity coreference has generally been restricted to the offline scenario where the set of documents is provided in advance. As a consequence, t...
Delip Rao, Paul McNamee, Mark Dredze
SIGIR
2008
ACM
13 years 7 months ago
Knowledge transformation from word space to document space
In most IR clustering problems, we directly cluster the documents, working in the document space, using cosine similarity between documents as the similarity measure. In many real...
Tao Li, Chris H. Q. Ding, Yi Zhang 0005, Bo Shao
SIGIR
2009
ACM
14 years 2 months ago
A comparison of retrieval-based hierarchical clustering approaches to person name disambiguation
This paper describes a simple clustering approach to person name disambiguation of retrieved documents. The methods are based on standard IR concepts and do not require any task-s...
Christof Monz, Wouter Weerkamp
KES
2005
Springer
14 years 1 months ago
OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation
This paper describes a theoretical approach on data mining, information classifying and a global overview of our OntoExtractor application, concerning the analysis of incoming data...
Zhan Cui, Ernesto Damiani, Marcello Leida, Marco V...
ICDAR
2009
IEEE
14 years 2 months ago
Unsupervised HMM Adaptation Using Page Style Clustering
In this paper we present an innovative two-stage adaptation approach for handwriting recognition that is based on clustering of similar pages in the training data. In our approach...
Huaigu Cao, Rohit Prasad, Shirin Saleem, Premkumar...