KDD 2007 | Sciweavers

161

Voted

KDD
2007
ACM

148views Data Mining» more KDD 2007»

Detecting research topics via the correlation between graphs and texts

16 years 7 months ago

In this paper we address the problem of detecting topics in large-scale linked document collections. Recently, topic detection has become a very active area of research due to its...

Yookyung Jo, Carl Lagoze, C. Lee Giles

claim paper

Read More »

209

click to vote

KDD
2007
ACM

184views Data Mining» more KDD 2007»

Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis

16 years 7 months ago

Download www1bpt.bridgeport.edu

To unravel the concept structure and dynamics of the bioinformatics field, we analyze a set of 7401 publications from the Web of Science and MEDLINE databases, publication years 1...

Bart De Moor, Frizo A. L. Janssens, Wolfgang Gl&au...

claim paper

Read More »

206

Voted

KDD
2007
ACM

182views Data Mining» more KDD 2007»

Cleaning disguised missing data: a heuristic approach

16 years 7 months ago

Download www.cs.sfu.ca

In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...

Ming Hua, Jian Pei

claim paper

Read More »

215

click to vote

KDD
2007
ACM

165views Data Mining» more KDD 2007»

Finding low-entropy sets and trees from binary data

16 years 7 months ago

Download eprints.pascal-network.org

The discovery of subsets with special properties from binary data has been one of the key themes in pattern discovery. Pattern classes such as frequent itemsets stress the co-occu...

Eino Hinkkanen, Hannes Heikinheimo, Heikki Mannila...

claim paper

Read More »

144

click to vote

KDD
2007
ACM

138views Data Mining» more KDD 2007»

Trajectory pattern mining

16 years 7 months ago

Download velblod.videolectures.net

Fosca Giannotti, Mirco Nanni, Fabio Pinelli, Dino ...

claim paper

Read More »

168

Voted

KDD
2007
ACM

159views Data Mining» more KDD 2007»

Constraint-driven clustering

16 years 7 months ago

Download www.cs.sfu.ca

Clustering methods can be either data-driven or need-driven. Data-driven methods intend to discover the true structure of the underlying data while need-driven methods aims at org...

Rong Ge, Martin Ester, Wen Jin, Ian Davidson

claim paper

Read More »

187

Voted

KDD
2007
ACM

168views Data Mining» more KDD 2007»

Finding tribes: identifying close-knit individuals from employment patterns

16 years 7 months ago

Download kdl.cs.umass.edu

We present a family of algorithms to uncover tribes--groups of individuals who share unusual sequences of affiliations. While much work inferring community structure describes lar...

Lisa Friedland, David Jensen

claim paper

Read More »

196

click to vote

KDD
2007
ACM

152views Data Mining» more KDD 2007»

Relational data pre-processing techniques for improved securities fraud detection

16 years 7 months ago

Download kdl.cs.umass.edu

Commercial datasets are often large, relational, and dynamic. They contain many records of people, places, things, events and their interactions over time. Such datasets are rarel...

Andrew Fast, Lisa Friedland, Marc Maier, Brian Tay...

claim paper

Read More »

125

click to vote

KDD
2007
ACM

132views Data Mining» more KDD 2007»

Semi-supervised classification with hybrid generative/discriminative methods

16 years 7 months ago

Download www.cs.umass.edu

Gregory Druck, Chris Pal, Andrew McCallum, Xiaojin...

claim paper