Online Algorithms for Mining Semi-structured Data Stream

15 years 11 hour ago

Download www-ikn.ist.hokudai.ac.jp

In this paper, we study an online data mining problem from streams of semi-structured data such as XML data. Modeling semi-structured data and patterns as labeled ordered trees, we present an online algorithm StreamT that receives fragments of an unseen possibly inﬁnite semistructured data in the document order through a data stream, and can return the current set of frequent patterns immediately on request at any time. A crucial part of our algorithm is the incremental maintenance of the occurrences of possibly frequent patterns using a tree sweeping technique. We give modiﬁcations of the algorithm to other online mining model. We present theoretical and empirical analyses to evaluate the performance of the algorithm.

Tatsuya Asai, Hiroki Arimura, Kenji Abe, Shinji Ka

Real-time Traffic

Algorithm | Data Mining | Frequent Patterns | ICDM 2002 | Semi-structured Data |

claim paper

Related Content

» Kernels for SemiStructured Data

» Combining the web content and usage mining to understand the visitor behavior in a web sit...

» Designing an inductive data stream management system the stream mill experience

» Online mining of frequent query trees over XML data streams

» Finding recent frequent itemsets adaptively over online data streams

» LOCUST An Online Analytical Processing Framework for High Dimensional Classification of Da...

» MAIDS Mining Alarming Incidents from Data Streams

» Online clustering of parallel data streams

» CostEfficient Mining Techniques for Data Streams

Post Info
More Details (n/a)

Added	14 Jul 2010
Updated	14 Jul 2010
Type	Conference
Year	2002
Where	ICDM
Authors	Tatsuya Asai, Hiroki Arimura, Kenji Abe, Shinji Kawasoe, Setsuo Arikawa

Comments (0)

Sciweavers

Online Algorithms for Mining Semi-structured Data Stream

Algorithm | Data Mining | Frequent Patterns | ICDM 2002 | Semi-structured Data |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers