Sciweavers

161 search results - page 15 / 33
» A Multiple Resampling Method for Learning from Imbalanced Da...
Sort
View
SDM
2008
SIAM
139views Data Mining» more  SDM 2008»
13 years 9 months ago
Simultaneous Unsupervised Learning of Disparate Clusterings
Most clustering algorithms produce a single clustering for a given data set even when the data can be clustered naturally in multiple ways. In this paper, we address the difficult...
Prateek Jain, Raghu Meka, Inderjit S. Dhillon
ICDM
2009
IEEE
117views Data Mining» more  ICDM 2009»
14 years 2 months ago
Clustering with Multiple Graphs
—In graph-based learning models, entities are often represented as vertices in an undirected graph with weighted edges describing the relationships between entities. In many real...
Wei Tang, Zhengdong Lu, Inderjit S. Dhillon
KDD
2008
ACM
206views Data Mining» more  KDD 2008»
14 years 8 months ago
Identifying biologically relevant genes via multiple heterogeneous data sources
Selection of genes that are differentially expressed and critical to a particular biological process has been a major challenge in post-array analysis. Recent development in bioin...
Zheng Zhao, Jiangxin Wang, Huan Liu, Jieping Ye, Y...
IDA
2005
Springer
14 years 1 months ago
Removing Statistical Biases in Unsupervised Sequence Learning
Unsupervised sequence learning is important to many applications. A learner is presented with unlabeled sequential data, and must discover sequential patterns that characterize the...
Yoav Horman, Gal A. Kaminka
BMCBI
2006
165views more  BMCBI 2006»
13 years 7 months ago
Improved variance estimation of classification performance via reduction of bias caused by small sample size
Background: Supervised learning for classification of cancer employs a set of design examples to learn how to discriminate between tumors. In practice it is crucial to confirm tha...
Ulrika Wickenberg-Bolin, Hanna Göransson, M&a...