Sciweavers

VLDB
2005
ACM

Statistical Learning Techniques for Costing XML Queries

14 years 5 months ago
Statistical Learning Techniques for Costing XML Queries
Developing cost models for query optimization is significantly harder for XML queries than for traditional relational queries. The reason is that XML query operators are much more complex than relational operators such as table scans and joins. In this paper, we propose a new approach, called Comet, to modeling the cost of XML operators; to our knowledge, Comet is the first method ever proposed for addressing the XML query costing problem. As in relational cost estimation, Comet exploits a set of system catalog statistics that summarizes the XML data; the set of “simple path” statistics that we propose is new, and is well suited to the XML setting. Unlike the traditional approach, Comet uses a new statistical learning technique called “transform regression” instead of detailed analytical models to predict the overall cost. Besides rendering the cost estimation problem tractable for XML queries, Comet has the further advantage of enabling the query optimizer to be self-tuning...
Ning Zhang 0002, Peter J. Haas, Vanja Josifovski,
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where VLDB
Authors Ning Zhang 0002, Peter J. Haas, Vanja Josifovski, Guy M. Lohman, Chun Zhang
Comments (0)