Data Mining | Sciweavers

202

PAKDD
2005
ACM

132views Data Mining» more PAKDD 2005»

On Multiple Query Optimization in Data Mining

16 years 1 months ago

Traditional multiple query optimization methods focus on identifying common subexpressions in sets of relational queries and on constructing their global execution plans. In this p...

Marek Wojciechowski, Maciej Zakrzewicz

claim paper

Read More »

228

click to vote

PAKDD
2005
ACM

133views Data Mining» more PAKDD 2005»

Feature Selection for High Dimensional Face Image Using Self-organizing Maps

16 years 1 months ago

Download parnec.nuaa.edu.cn

: While feature selection is very difficult for high dimensional, unstructured data such as face image, it may be much easier to do if the data can be faithfully transformed into l...

Xiaoyang Tan, Songcan Chen, Zhi-Hua Zhou, Fuyan Zh...

claim paper

Read More »

191

click to vote

PAKDD
2005
ACM

102views Data Mining» more PAKDD 2005»

Automatic Occupation Coding with Combination of Machine Learning and Hand-Crafted Rules

16 years 1 months ago

Download www.lr.pi.titech.ac.jp

Abstract. We apply a machine learning method to the occupation coding, which is a task to categorize the answers to open-ended questions regarding the respondent’s occupation. Sp...

Kazuko Takahashi, Hiroya Takamura, Manabu Okumura

claim paper

Read More »

192

click to vote

PAKDD
2005
ACM

134views Data Mining» more PAKDD 2005»

Improved Bayesian Spam Filtering Based on Co-weighted Multi-area Information

16 years 1 months ago

Download ss.hnu.cn

Abstract. Bayesian spam ﬁlters, in general, compute probability estimations for tokens either without considering the email areas of occurrences except the body or treating the s...

Raju Shrestha, Yaping Lin

claim paper

Read More »

213

click to vote

PAKDD
2005
ACM

120views Data Mining» more PAKDD 2005»

Speeding-Up Hierarchical Agglomerative Clustering in Presence of Expensive Metrics

16 years 1 months ago

Download ercolino.isti.cnr.it

In several contexts and domains, hierarchical agglomerative clustering (HAC) oﬀers best-quality results, but at the price of a high complexity which reduces the size of datasets ...

Mirco Nanni

claim paper

Read More »

226

Voted

PAKDD
2005
ACM

114views Data Mining» more PAKDD 2005»

Increasing Classification Accuracy by Combining Adaptive Sampling and Convex Pseudo-Data

16 years 1 months ago

Download personal.gscit.monash.edu.au

The availability of microarray data has enabled several studies on the application of aggregated classifiers for molecular classification. We present a combination of classifier ag...

Chia Huey Ooi, Madhu Chetty

claim paper

Read More »

262

click to vote

PAKDD
2005
ACM

184views Data Mining» more PAKDD 2005»

Adjusting Mixture Weights of Gaussian Mixture Model via Regularized Probabilistic Latent Semantic Analysis

16 years 1 months ago

Download www.cs.cmu.edu

Mixture models, such as Gaussian Mixture Model, have been widely used in many applications for modeling data. Gaussian mixture model (GMM) assumes that data points are generated fr...

Luo Si, Rong Jin

claim paper

Read More »

204

click to vote

PAKDD
2005
ACM

180views Data Mining» more PAKDD 2005»

Conditional Random Fields for Transmembrane Helix Prediction

16 years 1 months ago

Download pmg.it.usyd.edu.au

Abstract. It is estimated that 20% of genes in the human genome encode for integral membrane proteins (IMPs) and some estimates are much higher. IMPs control a broad range of event...

Lior Lukov, Sanjay Chawla, W. Bret Church

claim paper

Read More »

181

click to vote

PAKDD
2005
ACM

164views Data Mining» more PAKDD 2005»

Covariance and PCA for Categorical Variables

16 years 1 months ago

Download hp.vector.co.jp

Covariances from categorical variables are deﬁned using a regular simplex expression for categories. The method follows the variance deﬁnition by Gini, and it gives the covaria...

Hirotaka Niitsuma, Takashi Okada

claim paper

Read More »

192

click to vote

PAKDD
2005
ACM

161views Data Mining» more PAKDD 2005»

Online Algorithms for Mining Inter-stream Associations from Large Sensor Networks

16 years 1 months ago

Download www.cs.hku.hk

We study the problem of mining frequent value sets from a large sensor network. We discuss how sensor stream data could be represented that facilitates eﬃcient online mining and ...

K. K. Loo, Ivy Tong, Ben Kao

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers