Sciweavers

ICDM
2006
IEEE
193views Data Mining» more  ICDM 2006»
14 years 2 months ago
Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features
Several spatio-temporal data collected in many applications, such as fMRI data in medical applications, can be represented as a Multivariate Time Series (MTS) matrix with m rows (...
Hyunjin Yoon, Cyrus Shahabi
ICDM
2006
IEEE
76views Data Mining» more  ICDM 2006»
14 years 2 months ago
A Probabilistic Ensemble Pruning Algorithm
An ensemble is a group of learners that work together as a committee to solve a problem. However, the existing ensemble training algorithms sometimes generate unnecessary large en...
Huanhuan Chen, Peter Tiño, Xin Yao
ICDM
2006
IEEE
139views Data Mining» more  ICDM 2006»
14 years 2 months ago
Detecting Link Spam Using Temporal Information
How to effectively protect against spam on search ranking results is an important issue for contemporary web search engines. This paper addresses the problem of combating one majo...
Guoyang Shen, Bin Gao, Tie-Yan Liu, Guang Feng, Sh...
ICDM
2006
IEEE
105views Data Mining» more  ICDM 2006»
14 years 2 months ago
Learning to Use a Learned Model: A Two-Stage Approach to Classification
Maria-Luiza Antonie, Osmar R. Zaïane, Robert ...
ICDM
2006
IEEE
119views Data Mining» more  ICDM 2006»
14 years 2 months ago
Fast On-line Kernel Learning for Trees
Kernel methods have been shown to be very effective for applications requiring the modeling of structured objects. However kernels for structures usually are too computational dem...
Fabio Aiolli, Giovanni Da San Martino, Alessandro ...
ICDM
2006
IEEE
122views Data Mining» more  ICDM 2006»
14 years 2 months ago
Latent Friend Mining from Blog Data
The rapid growth of blog (also known as “weblog”) data provides a rich resource for social community mining. In this paper, we put forward a novel research problem of mining t...
Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen
ICDM
2006
IEEE
145views Data Mining» more  ICDM 2006»
14 years 2 months ago
Stability Region Based Expectation Maximization for Model-based Clustering
In spite of the initialization problem, the ExpectationMaximization (EM) algorithm is widely used for estimating the parameters in several data mining related tasks. Most popular ...
Chandan K. Reddy, Hsiao-Dong Chiang, Bala Rajaratn...
ICDM
2006
IEEE
100views Data Mining» more  ICDM 2006»
14 years 2 months ago
Semantic Smoothing for Model-based Document Clustering
Xiaodan Zhang, Xiaohua Zhou, Xiaohua Hu
ICDM
2006
IEEE
108views Data Mining» more  ICDM 2006»
14 years 2 months ago
Integrating Features from Different Sources for Music Information Retrieval
Efficient and intelligent music information retrieval is a very important topic of the 21st century. With the ultimate goal of building personal music information retrieval syste...
Tao Li, Mitsunori Ogihara, Shenghuo Zhu
ICDM
2006
IEEE
107views Data Mining» more  ICDM 2006»
14 years 2 months ago
Improving Grouped-Entity Resolution Using Quasi-Cliques
The entity resolution (ER) problem, which identifies duplicate entities that refer to the same real world entity, is essential in many applications. In this paper, in particular,...
Byung-Won On, Ergin Elmacioglu, Dongwon Lee, Jaewo...