Sciweavers

1125 search results - page 34 / 225
» A flocking based algorithm for document clustering analysis
Sort
View
ICDAR
2007
IEEE
14 years 4 months ago
Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents
Logical entity recognition in heterogeneous collections of document page images remains a challenging problem since the performance of traditional supervised methods degrade drama...
S. Chen, S. Mao, G. Thoma
ACL
1997
13 years 11 months ago
Document Classification Using a Finite Mixture Model
We propose a new method of classifying documents into categories. We define for each category a finite mixture model based on soft clustering of words. We treat the problem of cla...
Hang Li, Kenji Yamanishi
ICDAR
2009
IEEE
14 years 5 months ago
Unsupervised HMM Adaptation Using Page Style Clustering
In this paper we present an innovative two-stage adaptation approach for handwriting recognition that is based on clustering of similar pages in the training data. In our approach...
Huaigu Cao, Rohit Prasad, Shirin Saleem, Premkumar...
INEX
2005
Springer
14 years 3 months ago
A Flexible Structured-Based Representation for XML Document Mining
This paper reports on the INRIA group’s approach to XML mining while participating in the INEX XML Mining track 2005. We use a flexible representation of XML documents that allo...
Anne-Marie Vercoustre, Mounir Fegas, Saba Gul, Yve...
ESORICS
2005
Springer
14 years 3 months ago
Privacy Preserving Clustering
The freedom and transparency of information flow on the Internet has heightened concerns of privacy. Given a set of data items, clustering algorithms group similar items together...
Somesh Jha, Louis Kruger, Patrick McDaniel