gSpan: Graph-Based Substructure Pattern Mining

14 years 12 months ago

Download www.cs.uiuc.edu

We investigate new approaches for frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan (graph-based Substructure pattern mining), which discovers frequent substructures without candidate generation. gSpan builds a new lexicographic order among graphs, and maps each graph to a unique minimum DFS code as its canonical label. Based on this lexicographic order, gSpan adopts the depth-ﬁrst search strategy to mine frequent connected subgraphs efﬁciently. Our performance study shows that gSpan substantially outperforms previous algorithms, sometimes by an order of magnitude.

Xifeng Yan, Jiawei Han

Real-time Traffic