Clustering XML Documents by Structure

15 years 12 months ago

Download www.dbnet.ece.ntua.gr

This work explores the application of clustering methods for grouping structurally similar XML documents. Modeling the XML documents as rooted ordered labeled trees, we apply clustering algorithms using distances that estimate the similarity between those trees in terms of the hierarchical relationships of their nodes. We suggest the usage of tree structural summaries to improve the performance of the distance calculation and at the same time to maintain or even improve its quality. Experimental results are provided using a prototype testbed.

Theodore Dalamagas, Tao Cheng, Klaas-Jan Winkel, T

Real-time Traffic

Artificial Intelligence | SETN 2004 | Similar Xml Documents | Tree Structural Summaries | XML Documents |

claim paper

» A methodology for clustering XML documents by structure

» Clustering XML Documents Using Structural Summaries

» Clustering XML Documents Using Selforganizing Maps for Structures

» Combining Structure and Content Similarities for XML Document Clustering

» FRACTURE mining Mining frequently and concurrently mutating structures from historical XML...

» XEdge clustering homogeneous and heterogeneous XML documents using edge summaries

» A Flexible StructuredBased Representation for XML Document Mining

» A SelfOrganising Map Approach for Clustering of XML Documents

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	SETN
Authors	Theodore Dalamagas, Tao Cheng, Klaas-Jan Winkel, Timos K. Sellis

Comments (0)

Sciweavers

Clustering XML Documents by Structure

Artificial Intelligence | SETN 2004 | Similar Xml Documents | Tree Structural Summaries | XML Documents |

Explore & Download

Productivity Tools

Sciweavers