Data Mining | Sciweavers

259

KDD
2004
ACM

136views Data Mining» more KDD 2004»

A cross-collection mixture model for comparative text mining

16 years 8 months ago

In this paper, we define and study a novel text mining problem, which we refer to as Comparative Text Mining (CTM). Given a set of comparable text collections, the task of compara...

ChengXiang Zhai, Atulya Velivelli, Bei Yu

claim paper

Read More »

191

click to vote

KDD
2004
ACM

138views Data Mining» more KDD 2004»

Privacy-preserving Bayesian network structure computation on distributed heterogeneous data

16 years 8 months ago

Download www.gwu.edu

Rebecca N. Wright, Zhiqiang Yang

claim paper

Read More »

203

click to vote

KDD
2004
ACM

114views Data Mining» more KDD 2004»

Scalable mining of large disk-based graph databases

16 years 8 months ago

Download www.cs.sfu.ca

Mining frequent structural patterns from graph databases is an interesting problem with broad applications. Most of the previous studies focus on pruning unfruitful search subspac...

Chen Wang, Wei Wang 0009, Jian Pei, Yongtai Zhu, B...

claim paper

Read More »

221

Voted

KDD
2004
ACM

179views Data Mining» more KDD 2004»

1-dimensional splines as building blocks for improving accuracy of risk outcomes models

16 years 8 months ago

Download www.dataminingsolutions.net

Transformation of both the response variable and the predictors is commonly used in fitting regression models. However, these transformation methods do not always provide the maxi...

David S. Vogel, Morgan C. Wang

claim paper

Read More »

186

click to vote

KDD
2004
ACM

182views Data Mining» more KDD 2004»

Rotation invariant distance measures for trajectories

16 years 8 months ago

Download www.cs.ucr.edu

For the discovery of similar patterns in 1D time-series, it is very typical to perform a normalization of the data (for example a transformation so that the data follow a zero mea...

Michail Vlachos, Dimitrios Gunopulos, Gautam Das

claim paper

Read More »

229

click to vote

KDD
2004
ACM

139views Data Mining» more KDD 2004»

Learning a complex metabolomic dataset using random forests and support vector machines

16 years 8 months ago

Download math.uc.edu

Metabolomics is the omics science of biochemistry. The associated data include the quantitative measurements of all small molecule metabolites in a biological sample. These datase...

Young Truong, Xiaodong Lin, Chris Beecher

claim paper

Read More »

192

Voted

KDD
2004
ACM

127views Data Mining» more KDD 2004»

A generative probabilistic approach to visualizing sets of symbolic sequences

16 years 8 months ago

Download www.cs.bham.ac.uk

There is a notable interest in extending probabilistic generative modeling principles to accommodate for more complex structured data types. In this paper we develop a generative ...

Peter Tiño, Ata Kabán, Yi Sun

claim paper

Read More »

215

click to vote

KDD
2004
ACM

164views Data Mining» more KDD 2004»

Ordering patterns by combining opinions from multiple sources

16 years 8 months ago

Download www.cse.msu.edu

Pattern ordering is an important task in data mining because the number of patterns extracted by standard data mining algorithms often exceeds our capacity to manually analyze the...

Pang-Ning Tan, Rong Jin

claim paper

Read More »

227

click to vote

KDD
2004
ACM

210views Data Mining» more KDD 2004»

Probabilistic author-topic models for information discovery

16 years 8 months ago

Download psiexp.ss.uci.edu

We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...

Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...

claim paper

Read More »

185

click to vote

KDD
2004
ACM

110views Data Mining» more KDD 2004»

Generalizing the notion of support

16 years 8 months ago

Download www-users.cs.umn.edu

The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data....

Michael Steinbach, Pang-Ning Tan, Hui Xiong, Vipin...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers