Sciweavers

ICDE
2004
IEEE

A Frequency-based Approach for Mining Coverage Statistics in Data Integration

15 years 27 days ago
A Frequency-based Approach for Mining Coverage Statistics in Data Integration
Query optimization in data integration requires source coverage and overlap statistics. Gathering and storing the required statistics presents many challenges, not the least of which is controlling the amount of statistics learned. In this paper we introduce StatMiner, a novel statistics mining approach which automatically generates attribute value hierarchies, efficiently discovers frequently accessed query classes based on the learned attribute value hierarchies, and learns statistics only with respect to these classes. We describe the details of our method, and present experimental results demonstrating the efficiency and effectiveness of our approach. Our experiments are done in the context of BibFinder, a publicly fielded bibliography mediator.
Zaiqing Nie, Subbarao Kambhampati
Added 01 Nov 2009
Updated 01 Nov 2009
Type Conference
Year 2004
Where ICDE
Authors Zaiqing Nie, Subbarao Kambhampati
Comments (0)