SDM 2008 | Sciweavers

176

SDM
2008
SIAM

133views Data Mining» more SDM 2008»

Semantic Smoothing for Bayesian Text Classification with Small Training Data

15 years 8 months ago

Bayesian text classifiers face a common issue which is referred to as data sparsity problem, especially when the size of training data is very small. The frequently used Laplacian...

Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu

claim paper

Read More »

199

Voted

SDM
2008
SIAM

119views Data Mining» more SDM 2008»

An Efficient Local Algorithm for Distributed Multivariate Regression in Peer-to-Peer Networks

15 years 8 months ago

Download www.cs.umbc.edu

This paper offers a local distributed algorithm for multivariate regression in large peer-to-peer environments. The algorithm is designed for distributed inferencing, data compact...

Kanishka Bhaduri, Hillol Kargupta

claim paper

Read More »

183

click to vote

SDM
2008
SIAM

138views Data Mining» more SDM 2008»

Learning Markov Network Structure using Few Independence Tests

15 years 8 months ago

Download www.cs.cmu.edu

In this paper we present the Dynamic Grow-Shrink Inference-based Markov network learning algorithm (abbreviated DGSIMN), which improves on GSIMN, the state-ofthe-art algorithm for...

Parichey Gandhi, Facundo Bromberg, Dimitris Margar...

claim paper

Read More »

201

click to vote

SDM
2008
SIAM

144views Data Mining» more SDM 2008»

Semi-supervised Multi-label Learning by Solving a Sylvester Equation

15 years 8 months ago

Download siam.org

Multi-label learning refers to the problems where an instance can be assigned to more than one category. In this paper, we present a novel Semi-supervised algorithm for Multi-labe...

Gang Chen, Yangqiu Song, Fei Wang, Changshui Zhang

claim paper

Read More »

191

click to vote

SDM
2008
SIAM

165views Data Mining» more SDM 2008»

On the Dangers of Cross-Validation. An Experimental Evaluation

15 years 8 months ago

Download people.csail.mit.edu

Cross validation allows models to be tested using the full training set by means of repeated resampling; thus, maximizing the total number of points used for testing and potential...

R. Bharat Rao, Glenn Fung

claim paper

Read More »

214

click to vote

SDM
2008
SIAM

144views Data Mining» more SDM 2008»

Active Learning with Model Selection in Linear Regression

15 years 8 months ago

Download hrstc.org

Optimally designing the location of training input points (active learning) and choosing the best model (model selection) are two important components of supervised learning and h...

Masashi Sugiyama, Neil Rubens

claim paper

Read More »

181

click to vote

SDM
2008
SIAM

139views Data Mining» more SDM 2008»

Proximity Tracking on Time-Evolving Bipartite Graphs

15 years 8 months ago

Download www.cs.cmu.edu

Given an author-conference network that evolves over time, which are the conferences that a given author is most closely related with, and how do they change over time? Large time...

Hanghang Tong, Spiros Papadimitriou, Philip S. Yu,...

claim paper

Read More »

196

click to vote

SDM
2008
SIAM

118views Data Mining» more SDM 2008»

Massive-Scale Kernel Discriminant Analysis: Mining for Quasars

15 years 8 months ago

Download www.cc.gatech.edu

We describe a fast algorithm for kernel discriminant analysis, empirically demonstrating asymptotic speed-up over the previous best approach. We achieve this with a new pattern of...

Ryan Riegel, Alexander Gray, Gordon Richards

claim paper

Read More »

175

click to vote

SDM
2008
SIAM

177views Data Mining» more SDM 2008»

Cluster Ensemble Selection

15 years 8 months ago

Download web.engr.oregonstate.edu

This paper studies the ensemble selection problem for unsupervised learning. Given a large library of different clustering solutions, our goal is to select a subset of solutions t...

Xiaoli Z. Fern, Wei Lin

claim paper

Read More »

179

click to vote

SDM
2008
SIAM

177views Data Mining» more SDM 2008»

Roughly Balanced Bagging for Imbalanced Data

15 years 8 months ago

Download siam.org

Imbalanced class problems appear in many real applications of classification learning. We propose a novel sampling method to improve bagging for data sets with skewed class distri...

Shohei Hido, Hisashi Kashima

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers