Efficiently mining frequent trees in a forest

15 years 1 months ago

Download www.lans.ece.utexas.edu

Mining frequent trees is very useful in domains like bioinformatics, web mining, mining semi-structured data, and so on. We formulate the problem of mining (embedded) subtrees in a forest of rooted, labeled, and ordered trees. We present TreeMiner, a novel algorithm to discover all frequent subtrees in a forest, using a new data structure called scope-list. We contrast TreeMiner with a pattern matching tree mining algorithm (PatternMatcher). We conduct detailed experiments to test the performance and scalability of these methods. We find that TreeMiner outperforms the pattern matching approach by a factor of 4 to 20, and has good scaleup properties. We also present an application of tree mining to analyze real web logs for usage patterns.

Mohammed Javeed Zaki

Real-time Traffic

Data Mining | KDD 2002 | Mining Frequent Trees | Mining Semi-structured Data | Tree Mining Algorithm |

claim paper

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2002
Where	KDD
Authors	Mohammed Javeed Zaki

Comments (0)

Sciweavers

Efficiently mining frequent trees in a forest

Data Mining | KDD 2002 | Mining Frequent Trees | Mining Semi-structured Data | Tree Mining Algorithm |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers