Sciweavers

114 search results - page 21 / 23
» Feature Subset Selection and Ranking for Data Dimensionality...
Sort
View
KDD
2000
ACM
149views Data Mining» more  KDD 2000»
14 years 1 months ago
Efficient clustering of high-dimensional data sets with application to reference matching
Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...
Andrew McCallum, Kamal Nigam, Lyle H. Ungar
ICML
2008
IEEE
14 years 10 months ago
Expectation-maximization for sparse and non-negative PCA
We study the problem of finding the dominant eigenvector of the sample covariance matrix, under additional constraints on the vector: a cardinality constraint limits the number of...
Christian D. Sigg, Joachim M. Buhmann
IDA
2010
Springer
13 years 8 months ago
Fuzzy-rough approaches for mammographic risk analysis
The accuracy of methods for the assessment of mammographic risk analysis is heavily related to breast tissue characteristics. Previous work has demonstrated considerable success i...
Neil MacParthalain, Richard Jensen, Qiang Shen, Re...
ICCPOL
2009
Springer
13 years 7 months ago
A Simple and Efficient Model Pruning Method for Conditional Random Fields
Conditional random fields (CRFs) have been quite successful in various machine learning tasks. However, as larger and larger data become acceptable for the current computational ma...
Hai Zhao, Chunyu Kit
BMCBI
2008
160views more  BMCBI 2008»
13 years 10 months ago
A method for analyzing censored survival phenotype with gene expression data
Background: Survival time is an important clinical trait for many disease studies. Previous works have shown certain relationship between patients' gene expression profiles a...
Tongtong Wu, Wei Sun, Shinsheng Yuan, Chun-Houh Ch...