Sciweavers

1863 search results - page 44 / 373
» A supervised learning approach for imbalanced data sets
Sort
View
KDD
2004
ACM
132views Data Mining» more  KDD 2004»
14 years 9 months ago
A probabilistic framework for semi-supervised clustering
Unsupervised clustering can be significantly improved using supervision in the form of pairwise constraints, i.e., pairs of instances labeled as belonging to same or different clu...
Sugato Basu, Mikhail Bilenko, Raymond J. Mooney
COLING
2010
13 years 4 months ago
A Vector Space Model for Subjectivity Classification in Urdu aided by Co-Training
The goal of this work is to produce a classifier that can distinguish subjective sentences from objective sentences for the Urdu language. The amount of labeled data required for ...
Smruthi Mukund, Rohini K. Srihari
IPM
2008
196views more  IPM 2008»
13 years 9 months ago
Author identification: Using text sampling to handle the class imbalance problem
Authorship analysis of electronic texts assists digital forensics and anti-terror investigation. Author identification can be seen as a single-label multi-class text categorizatio...
Efstathios Stamatatos
SDM
2008
SIAM
168views Data Mining» more  SDM 2008»
13 years 10 months ago
Semi-Supervised Clustering via Matrix Factorization
The recent years have witnessed a surge of interests of semi-supervised clustering methods, which aim to cluster the data set under the guidance of some supervisory information. U...
Fei Wang, Tao Li, Changshui Zhang
BMCBI
2011
13 years 4 months ago
Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in i
Background: In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focuse...
Paolo G. V. Martini, Davide Risso, Gabriele Sales,...