Sciweavers

DEXA
2004
Springer

On Efficient and Effective Association Rule Mining from XML Data

14 years 4 months ago
On Efficient and Effective Association Rule Mining from XML Data
: In this paper, we propose a framework, called XAR-Miner, for mining ARs from XML documents efficiently and effectively. In XAR-Miner, raw XML data are first transformed to either an Indexed Content Tree (IX-tree) or Multi-relational databases (Multi-DB), depending on the size of XML document and memory constraint of the system, for efficient data selection in the AR mining. Concepts that are relevant to the AR mining task are generalized to produce generalized metapatterns. A suitable metric is devised for measuring the degree of concept generalization in order to prevent under-generalization or over-generalization. Resultant generalized meta-patterns are used to generate large ARs that meet the support and confidence levels. An efficient AR mining algorithm is also presented based on candidate AR generation in the hierarchy of generalized meta-patterns. The experiments show that XAR-Miner is more efficient in performing a large number of AR mining tasks from XML documents than the s...
Ji Zhang, Tok Wang Ling, Robert M. Bruckner, A. Mi
Added 01 Jul 2010
Updated 01 Jul 2010
Type Conference
Year 2004
Where DEXA
Authors Ji Zhang, Tok Wang Ling, Robert M. Bruckner, A. Min Tjoa, Han Liu
Comments (0)