Sciweavers

808 search results - page 36 / 162
» Keyword-based document clustering
Sort
View
WEBI
2005
Springer
14 years 1 months ago
Integrating Element and Term Semantics for Similarity-Based XML Document Clustering
Structured link vector model (SLVM) is a recently proposed document representation that takes into account both structural and semantic information for measuring XML document simi...
Jianwu Yang, William K. Cheung, Xiaoou Chen
KDD
2007
ACM
231views Data Mining» more  KDD 2007»
14 years 8 months ago
Xproj: a framework for projected structural clustering of xml documents
XML has become a popular method of data representation both on the web and in databases in recent years. One of the reasons for the popularity of XML has been its ability to encod...
Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua F...
EEE
2005
IEEE
14 years 1 months ago
Learning the Kernel Matrix for XML Document Clustering
The rapid growth of XML adoption has urged for the need of a proper representation for semi-structured documents, where the document structural information has to be taken into ac...
Jianwu Yang, William Kwok-Wai Cheung, Xiaoou Chen
ICPR
2004
IEEE
14 years 8 months ago
Coordinate Systems Reconstruction for Graphical Documents by Hough-feature Clustering and Geometric Analysis
Two-dimensional and three-dimensional coordinate systems are the basic graphics symbols in many graphical documents. A robust coordinate system detection scheme is needed in order...
Chew Lim Tan, Yan Ping Zhou
IJCAI
2001
13 years 9 months ago
Combining Statistics and Semantics for Word and Document Clustering
A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...
Alexandre Termier, Michèle Sebag, Marie-Chr...