KDD 2003 | Sciweavers

152

KDD
2003
ACM

124views Data Mining» more KDD 2003»

16 years 7 months ago

Two-dimensional contingency or co-occurrence tables arise frequently in important applications such as text, web-log and market-basket data analysis. A basic problem in contingenc...

Inderjit S. Dhillon, Subramanyam Mallela, Dharmend...

claim paper

Read More »

190

Voted

KDD
2003
ACM

194views Data Mining» more KDD 2003»

Finding recent frequent itemsets adaptively over online data streams

16 years 7 months ago

Download magna.cs.ucla.edu

A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Consequently, the knowledge embedded in a data stream is more likely to be c...

Joong Hyuk Chang, Won Suk Lee

claim paper

Read More »

128

click to vote

KDD
2003
ACM

111views Data Mining» more KDD 2003»

Translation-invariant mixture models for curve clustering

16 years 7 months ago

Download www.datalab.uci.edu

Darya Chudova, Scott Gaffney, Eric Mjolsness, Padh...

claim paper

Read More »

181

Voted

KDD
2003
ACM

146views Data Mining» more KDD 2003»

Probabilistic discovery of time series motifs

16 years 7 months ago

Download www.cs.ucr.edu

Several important time series data mining problems reduce to the core task of finding approximately repeated subsequences in a longer time series. In an earlier work, we formalize...

Bill Yuan-chi Chiu, Eamonn J. Keogh, Stefano Lonar...

claim paper

Read More »

186

Voted

KDD
2003
ACM

122views Data Mining» more KDD 2003»

Understanding captions in biomedical publications

16 years 7 months ago

Download murphylab.web.cmu.edu

From the standpoint of the automated extraction of scientific knowledge, an important but little-studied part of scientific publications are the figures and accompanying captions....

William W. Cohen, Richard C. Wang, Robert F. Murph...

claim paper

Read More »

163

click to vote

KDD
2003
ACM

146views Data Mining» more KDD 2003»

Style mining of electronic messages for multiple authorship discrimination: first results

16 years 7 months ago

Download lingcog.iit.edu

This paper considers the use of computational stylistics for performing authorship attribution of electronic messages, addressing categorization problems with as many as 20 differ...

Shlomo Argamon, Marin Saric, Sterling Stuart Stein

claim paper

Read More »

157

click to vote

KDD
2003
ACM

152views Data Mining» more KDD 2003»

An adaptive nearest neighbor search for a parts acquisition ePortal

16 years 7 months ago

Download videotechresearch.com

Rafael Alonso, Jeffrey A. Bloom, Hua Li, Chumki Ba...

claim paper

Read More »

156

click to vote

KDD
2003
ACM

116views Data Mining» more KDD 2003»

Golden Path Analyzer: using divide-and-conquer to cluster Web clickstreams

16 years 7 months ago

Download lyonesse.stanford.edu

Kamal Ali, Steven P. Ketchpel

claim paper

Read More »

170

click to vote

KDD
2003
ACM

130views Data Mining» more KDD 2003»

Towards systematic design of distance functions for data mining applications

16 years 7 months ago

Download charuaggarwal.net

Distance function computation is a key subtask in many data mining algorithms and applications. The most effective form of the distance function can only be expressed in the conte...

Charu C. Aggarwal

claim paper

Read More »

157

click to vote

KDD
2003
ACM

156views Data Mining» more KDD 2003»

Mining distance-based outliers in near linear time with randomization and a simple pruning rule

16 years 7 months ago

Download www.isle.org

Defining outliers by their distance to neighboring examples is a popular approach to finding unusual examples in a data set. Recently, much work has been conducted with the goal o...

Stephen D. Bay, Mark Schwabacher

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers