Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses

14 years 3 months ago

Download membres-liglab.imag.fr

The problem of closed frequent itemset discovery is a fundamental problem of data mining, having applications in numerous domains. It is thus very important to have efﬁcient parallel algorithms to solve this probem, capable of eﬃciently harnessing the power of multicore processors that exists in our computers (notebooks as well as desktops). In this paper we present PLCMQS, a parallel algorithm based on the LCM algorithm, recognized as the most eﬃcient algorithm for sequential discovery of closed frequent itemsets. We also present a simple yet powerfull parallelism interface based on the concept of Tuple Space, which allows an efﬁcient dynamic sharing of the work. Thanks to a detailed experimental study, we show that PLCMQS is the only algorithm which is generic enough to compute eﬃciently closed frequent itemsets both on sparse and dense databases, thus improving the state of the art.

Benjamin Négrevergne, Alexandre Termier, Je

Real-time Traffic

Algorithm | Applied Computing | Frequent Itemset Discovery | Frequent Itemsets | IEEEHPCS 2010 |

claim paper

Post Info
More Details (n/a)

Added	26 Jan 2011
Updated	26 Jan 2011
Type	Journal
Year	2010
Where	IEEEHPCS
Authors	Benjamin Négrevergne, Alexandre Termier, Jean-François Méhaut, Takeaki Uno

Comments (0)

Sciweavers

Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses

Algorithm | Applied Computing | Frequent Itemset Discovery | Frequent Itemsets | IEEEHPCS 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers