Sciweavers

1950 search results - page 7 / 390
» Informative sampling for large unbalanced data sets
Sort
View
ISNN
2009
Springer
14 years 2 months ago
Nonlinear Component Analysis for Large-Scale Data Set Using Fixed-Point Algorithm
Abstract. Nonlinear component analysis is a popular nonlinear feature extraction method. It generally uses eigen-decomposition technique to extract the principal components. But th...
Weiya Shi, Yue-Fei Guo
NIPS
2004
13 years 9 months ago
Efficient Out-of-Sample Extension of Dominant-Set Clusters
Dominant sets are a new graph-theoretic concept that has proven to be relevant in pairwise data clustering problems, such as image segmentation. They generalize the notion of a ma...
Massimiliano Pavan, Marcello Pelillo
HIPC
2009
Springer
13 years 5 months ago
Detailed analysis of I/O traces for large scale applications
- In this paper, we present a tool to extract I/O traces from very large applications running at full scale during their production runs. We analyze these traces to gain informatio...
Nithin Nakka, Alok N. Choudhary, Wei-keng Liao, Le...
ICDM
2002
IEEE
159views Data Mining» more  ICDM 2002»
14 years 17 days ago
O-Cluster: Scalable Clustering of Large High Dimensional Data Sets
Clustering large data sets of high dimensionality has always been a serious challenge for clustering algorithms. Many recently developed clustering algorithms have attempted to ad...
Boriana L. Milenova, Marcos M. Campos
KDD
2003
ACM
180views Data Mining» more  KDD 2003»
14 years 8 months ago
Classifying large data sets using SVMs with hierarchical clusters
Support vector machines (SVMs) have been promising methods for classification and regression analysis because of their solid mathematical foundations which convey several salient ...
Hwanjo Yu, Jiong Yang, Jiawei Han