Data Mining | Sciweavers

175

SDM
2010
SIAM

204views Data Mining» more SDM 2010»

Scalable Tensor Factorizations with Missing Data

15 years 7 months ago

The problem of missing data is ubiquitous in domains such as biomedical signal processing, network traffic analysis, bibliometrics, social network analysis, chemometrics, computer...

Evrim Acar, Daniel M. Dunlavy, Tamara G. Kolda, Mo...

claim paper

Read More »

158

click to vote

SDM
2010
SIAM

144views Data Mining» more SDM 2010»

Predictive Modeling with Heterogeneous Sources

15 years 7 months ago

Download www.cs.uic.edu

Lack of labeled training examples is a common problem for many applications. In the same time, there is usually an abundance of labeled data from related tasks. But they have diff...

Xiaoxiao Shi, Qi Liu, Wei Fan, Qiang Yang, Philip ...

claim paper

Read More »

168

click to vote

SDM
2010
SIAM

166views Data Mining» more SDM 2010»

A Permutation Approach to Validation

15 years 7 months ago

Download www.cs.rpi.edu

We give a permutation approach to validation (estimation of out-sample error). One typical use of validation is model selection. We establish the legitimacy of the proposed permut...

Malik Magdon-Ismail, Konstantin Mertsalov

claim paper

Read More »

133

click to vote

SDM
2010
SIAM

153views Data Mining» more SDM 2010»

The Generalized Dimensionality Reduction Problem

15 years 7 months ago

Download www.charuaggarwal.net

The dimensionality reduction problem has been widely studied in the database literature because of its application for concise data representation in a variety of database applica...

Charu C. Aggarwal

claim paper

Read More »

185

click to vote

SDM
2010
SIAM

165views Data Mining» more SDM 2010»

Direct Density Ratio Estimation with Dimensionality Reduction

15 years 7 months ago

Download sugiyama-www.cs.titech.ac.jp

Methods for directly estimating the ratio of two probability density functions without going through density estimation have been actively explored recently since they can be used...

Masashi Sugiyama, Satoshi Hara, Paul von Büna...

claim paper

Read More »

178

click to vote

SDM
2010
SIAM

218views Data Mining» more SDM 2010»

Confidence-Based Feature Acquisition to Minimize Training and Test Costs

15 years 7 months ago

Download maple.cs.umbc.edu

We present Confidence-based Feature Acquisition (CFA), a novel supervised learning method for acquiring missing feature values when there is missing data at both training and test...

Marie desJardins, James MacGlashan, Kiri L. Wagsta...

claim paper

Read More »

149

click to vote

SDM
2010
SIAM

181views Data Mining» more SDM 2010»

Making k-means Even Faster

15 years 7 months ago

Download cs.baylor.edu

The k-means algorithm is widely used for clustering, compressing, and summarizing vector data. In this paper, we propose a new acceleration for exact k-means that gives the same a...

Greg Hamerly

claim paper

Read More »

177

click to vote

SDM
2010
SIAM

146views Data Mining» more SDM 2010»

Evaluating Query Result Significance in Databases via Randomizations

15 years 7 months ago

Download eprints.pascal-network.org

Many sorts of structured data are commonly stored in a multi-relational format of interrelated tables. Under this relational model, exploratory data analysis can be done by using ...

Markus Ojala, Gemma C. Garriga, Aristides Gionis, ...

claim paper

Read More »

158

click to vote

SDM
2010
SIAM

149views Data Mining» more SDM 2010»

Temporal Collaborative Filtering with Bayesian Probabilistic Tensor Factorization

15 years 7 months ago

Download www.cs.cmu.edu

Real-world relational data are seldom stationary, yet traditional collaborative filtering algorithms generally rely on this assumption. Motivated by our sales prediction problem, ...

Liang Xiong, Xi Chen, Tzu-Kuo Huang, Jeff Schneide...

claim paper

Read More »

154

click to vote

SDM
2010
SIAM

195views Data Mining» more SDM 2010»

Adaptive Informative Sampling for Active Learning

15 years 7 months ago

Download www.cems.uvm.edu

Many approaches to active learning involve periodically training one classifier and choosing data points with the lowest confidence. An alternative approach is to periodically cho...

Zhenyu Lu, Xindong Wu, Josh Bongard

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers