Sciweavers

832 search results - page 37 / 167
» Document clustering with committees
Sort
View
EEE
2005
IEEE
14 years 2 months ago
Learning the Kernel Matrix for XML Document Clustering
The rapid growth of XML adoption has urged for the need of a proper representation for semi-structured documents, where the document structural information has to be taken into ac...
Jianwu Yang, William Kwok-Wai Cheung, Xiaoou Chen
ICPR
2004
IEEE
14 years 10 months ago
Coordinate Systems Reconstruction for Graphical Documents by Hough-feature Clustering and Geometric Analysis
Two-dimensional and three-dimensional coordinate systems are the basic graphics symbols in many graphical documents. A robust coordinate system detection scheme is needed in order...
Chew Lim Tan, Yan Ping Zhou
IJCAI
2001
13 years 10 months ago
Combining Statistics and Semantics for Word and Document Clustering
A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...
Alexandre Termier, Michèle Sebag, Marie-Chr...
EMNLP
2009
13 years 6 months ago
Unsupervised morphological segmentation and clustering with document boundaries
Many approaches to unsupervised morphology acquisition incorporate the frequency of character sequences with respect to each other to identify word stems and affixes. This typical...
Taesun Moon, Katrin Erk, Jason Baldridge
CIKM
2004
Springer
14 years 2 months ago
Stemming and lemmatization in the clustering of finnish text documents
Under construction… Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval – clustering. General Terms Algorithms, Expe...
Tuomo Korenius, Jorma Laurikkala, Kalervo Jär...