Queries issued by casual users or specialists exploring a data set often point us to important subsets of the data, be it clusters, outliers or other features of particular importance. Capturing and caching such queries (henceforth called nuggets) has many potential benefits, including the optimization of both the performance of the underlying system as well as the search experience of users. Unfortunately, current visual exploration systems, while facilitating data exploration by providing graphical depictions of the data, have not yet tapped into this potential resource of identifying and sharing important queries. In this paper, we introduce a query consolidation strategy aimed at solving the general problem of isolating important queries from the potentially huge amount of queries submitted. Particularly, our solution clusters redundant queries caused by exploration-style query specification, which is prevalent in data exploration systems. Then, it generates a representative for...
Di Yang, Elke A. Rundensteiner, Matthew O. Ward