Sciweavers

VLDB
1999
ACM

SPIRIT: Sequential Pattern Mining with Regular Expression Constraints

14 years 4 months ago
SPIRIT: Sequential Pattern Mining with Regular Expression Constraints
Discovering sequential patterns is an important problem in data mining with a host of application domains including medicine, telecommunications, and the World Wide Web. Conventional mining systems provide users with only a very restricted mechanism (based on minimum support) for specifying patterns of interest. In this paper, we propose the use of Regular Expressions (REs) as a flexible constraint specification tool that enables user-controlled focus to be incorporated into the pattern mining process. We develop a family of novel algorithms (termed SPIRIT – Sequential Pattern mIning with Regular expressIon consTraints) for mining frequent sequential patterns that also satisfy user-specified RE constraints. The main distinguishing factor among the proposed schemes is the degree to which the RE constraints are enforced to prune the search space of patterns during computation. Our solutions provide valuable insights into the tradeoffs that arise when constraints that do not subscri...
Minos N. Garofalakis, Rajeev Rastogi, Kyuseok Shim
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1999
Where VLDB
Authors Minos N. Garofalakis, Rajeev Rastogi, Kyuseok Shim
Comments (0)