Sciweavers

1125 search results - page 11 / 225
» A flocking based algorithm for document clustering analysis
Sort
View
ICDAR
2009
IEEE
14 years 2 months ago
Enhanced Text Extraction from Arabic Degraded Document Images Using EM Algorithm
This paper presents a new enhanced text extraction algorithm from degraded document images on the basis of the probabilistic models. The observed document image is considered as a...
Wafa Boussellaa, Aymen Bougacha, Abderrazak Zahour...
DAS
2010
Springer
13 years 8 months ago
Context-aware and content-based dynamic Voronoi page segmentation
This paper presents a dynamic approach to document page segmentation based on inter-component relationships and their local features. State-of-the art page segmentation algorithms...
Mudit Agrawal, David S. Doermann
MMM
2011
Springer
368views Multimedia» more  MMM 2011»
12 years 11 months ago
Correlated PLSA for Image Clustering
Probabilistic Latent Semantic Analysis (PLSA) has become a popular topic model for image clustering. However, the traditional PLSA method considers each image (document) independen...
Peng Li, Jian Cheng, Zechao Li, Hanqing Lu
KDD
2002
ACM
166views Data Mining» more  KDD 2002»
14 years 8 months ago
Frequent term-based text clustering
Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...
Florian Beil, Martin Ester, Xiaowei Xu
GRC
2005
IEEE
14 years 1 months ago
Semantic based clustering of Web documents
Abstract. A new methodology that structures the semantics of a collection of documents into the geometry of a simplicial complex is developed. A simplicial complex is topologically...
Tsau Young Lin, I-Jen Chiang