Mining High Utility Itemsets from a transaction database is to find itemsests that have utility above a user-specified threshold. This problem is an extension of Frequent Itemset Mining, which discovers itemsets that occur frequently (i.e. with occurrence count larger than a user given value). The problem of finding High Utility Itemsets is challenging, because the anti-monotone property so useful for pruning the search space in conventional Frequent Itemset Mining does not apply to it. In this paper we propose a new algorithm called CTU-PRO that mines high utility itemsets by bottom up traversal of a compressed utility pattern (CUP) tree. We have tested our algorithm on several sparse and dense data sets, comparing it with the recent algorithms for High Utility Itemset Mining and the results show that our algorithm works more efficiently. 1
Alva Erwin, Raj P. Gopalan, N. R. Achuthan