There is growing public concern about personal data collected by both private and public sectors. People have very little control over what kinds of data are stored and how such da...
Abstract. We analyse data from the Edinburgh Mouse Atlas GeneExpression Database (EMAGE) which is a high quality data source for spatio-temporal gene expression patterns. Using a n...
Estimating the result size of a join is an important query optimization problem as it determines the choice of a good query evaluation strategy. Yet, there are few efficient techni...
Data mining algorithms use various Trie and bitmap-based representations to optimize the support (i.e., frequency) counting performance. In this paper, we compare the memory requi...
Data mining is most commonly used in attempts to induce association rules from transaction data. Most previous studies focused on binary-valued transaction data. Transaction data i...