Fast and effective histogram construction

15 years 9 months ago

Download www.comp.nus.edu.sg

Histogram construction or sequence segmentation is a basic task with applications in database systems, information retrieval, and knowledge management. Its aim is to approximate a sequence by line segments. Unfortunately, the quadratic algorithm that derives an optimal histogram for Euclidean error lacks the desired scalability. Therefore, sophisticated approximation algorithms have been recently proposed, while several simple heuristics are used in practice. Still, these solutions fail to resolve the eﬃciency-quality tradeoﬀ in a satisfactory manner. In this paper we take a fresh view on the problem. We propose conceptually clear and scalable algorithms that eﬃciently derive high-quality histograms. We experimentally demonstrate that existing approximation schemes fail to deliver the desired eﬃciency and conventional heuristics do not fare well on the side of quality. On the other hand, our schemes match or exceed the quality of the former and the eﬃciency of the latter. Ca...

Felix Halim, Panagiotis Karras, Roland H. C. Yap

Real-time Traffic