Search Sciweavers | Sciweavers

1552 search results - page 253 / 311

» Mining for Patterns in Contradictory Data

137

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 4 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

143

click to vote

ICDM
2009
IEEE

145views Data Mining» more ICDM 2009»

Significance of Episodes Based on Minimal Windows

15 years 1 months ago

Download win.ua.ac.be

Discovering episodes, frequent sets of events from a sequence has been an active field in pattern mining. Traditionally, a level-wise approach is used to discover all frequent epis...

Nikolaj Tatti

claim paper

Read More »

122

Voted

KDD
2006
ACM

118views Data Mining» more KDD 2006»

Reducing the human overhead in text categorization

16 years 4 months ago

Download research.microsoft.com

Many applications in text processing require significant human effort for either labeling large document collections (when learning statistical models) or extrapolating rules from...

Arnd Christian König, Eric Brill

claim paper

Read More »

114

Voted

KDD
2006
ACM

163views Data Mining» more KDD 2006»

New EM derived from Kullback-Leibler divergence

16 years 4 months ago

Download www.cis.temple.edu

We introduce a new EM framework in which it is possible not only to optimize the model parameters but also the number of model components. A key feature of our approach is that we...

Longin Jan Latecki, Marc Sobel, Rolf Lakämper

claim paper

Read More »

133

click to vote

KDD
2004
ACM

190views Data Mining» more KDD 2004»

Kernel k-means: spectral clustering and normalized cuts

16 years 4 months ago

Download www.cs.utexas.edu

Kernel k-means and spectral clustering have both been used to identify clusters that are non-linearly separable in input space. Despite significant research, these methods have re...

Inderjit S. Dhillon, Yuqiang Guan, Brian Kulis

claim paper

Read More »

« Prev « First page 253 / 311 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers