Data Mining | Sciweavers

221

KDD
2007
ACM

151views Data Mining» more KDD 2007»

Efficient mining of iterative patterns for software specification discovery

16 years 8 months ago

Studies have shown that program comprehension takes up to 45% of software development costs. Such high costs are caused by the lack-of documented specification and further aggrava...

Chao Liu 0001, David Lo, Siau-Cheng Khoo

claim paper

Read More »

186

click to vote

KDD
2007
ACM

181views Data Mining» more KDD 2007»

BoostCluster: boosting clustering by pairwise constraints

16 years 8 months ago

Download dataclustering.cse.msu.edu

Data clustering is an important task in many disciplines. A large number of studies have attempted to improve clustering by using the side information that is often encoded as pai...

Yi Liu, Rong Jin, Anil K. Jain

claim paper

Read More »

225

click to vote

KDD
2007
ACM

191views Data Mining» more KDD 2007»

Cost-effective outbreak detection in networks

16 years 8 months ago

Download www.cs.cmu.edu

Given a water distribution network, where should we place sensors to quickly detect contaminants? Or, which blogs should we read to avoid missing important stories? These seemingl...

Andreas Krause, Carlos Guestrin, Christos Faloutso...

claim paper

Read More »

235

click to vote

KDD
2007
ACM

182views Data Mining» more KDD 2007»

A fast algorithm for finding frequent episodes in event streams

16 years 8 months ago

Download research.microsoft.com

Frequent episode discovery is a popular framework for mining data available as a long sequence of events. An episode is essentially a short ordered sequence of event types and the...

Srivatsan Laxman, P. S. Sastry, K. P. Unnikrishnan

claim paper

Read More »

190

click to vote

KDD
2007
ACM

139views Data Mining» more KDD 2007»

Raising the baseline for high-precision text classifiers

16 years 8 months ago

Download ir.iit.edu

Many important application areas of text classifiers demand high precision and it is common to compare prospective solutions to the performance of Naive Bayes. This baseline is us...

Aleksander Kolcz, Wen-tau Yih

claim paper

Read More »

190

click to vote

KDD
2007
ACM

159views Data Mining» more KDD 2007»

Practical guide to controlled experiments on the web: listen to your customers not to the hippo

16 years 8 months ago

Download exp-platform.com

The web provides an unprecedented opportunity to evaluate ideas quickly using controlled experiments, also called randomized experiments (single-factor or factorial designs), A/B ...

Ron Kohavi, Randal M. Henne, Dan Sommerfield

claim paper

Read More »

220

click to vote

KDD
2007
ACM

184views Data Mining» more KDD 2007»

Correlation search in graph databases

16 years 8 months ago

Download www.se.cuhk.edu.hk

Correlation mining has gained great success in many application domains for its ability to capture the underlying dependency between objects. However, the research of correlation ...

Yiping Ke, James Cheng, Wilfred Ng

claim paper

Read More »

195

click to vote

KDD
2007
ACM

148views Data Mining» more KDD 2007»

Detecting research topics via the correlation between graphs and texts

16 years 8 months ago

Download www.cs.cornell.edu

In this paper we address the problem of detecting topics in large-scale linked document collections. Recently, topic detection has become a very active area of research due to its...

Yookyung Jo, Carl Lagoze, C. Lee Giles

claim paper

Read More »

248

click to vote

KDD
2007
ACM

184views Data Mining» more KDD 2007»

Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis

16 years 8 months ago

Download www1bpt.bridgeport.edu

To unravel the concept structure and dynamics of the bioinformatics field, we analyze a set of 7401 publications from the Web of Science and MEDLINE databases, publication years 1...

Bart De Moor, Frizo A. L. Janssens, Wolfgang Gl&au...

claim paper

Read More »

231

click to vote

KDD
2007
ACM

182views Data Mining» more KDD 2007»

Cleaning disguised missing data: a heuristic approach

16 years 8 months ago

Download www.cs.sfu.ca

In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...

Ming Hua, Jian Pei

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers