Sciweavers

1038 search results - page 64 / 208
» A Genetic Algorithm for Clustering on Very Large Data Sets
Sort
View
CVPR
2009
IEEE
15 years 4 months ago
Regularized Multi-Class Semi-Supervised Boosting
Many semi-supervised learning algorithms only deal with binary classification. Their extension to the multi-class problem is usually obtained by repeatedly solving a set of bina...
Amir Saffari, Christian Leistner, Horst Bischof
RSCTC
2000
Springer
143views Fuzzy Logic» more  RSCTC 2000»
14 years 8 days ago
On Efficient Construction of Decision Trees from Large Databases
The main task in decision tree construction algorithms is to find the "best partition" of the set of objects. In this paper, we investigate the problem of optimal binary ...
Hung Son Nguyen
FLAIRS
2006
13 years 10 months ago
Introducing GEMS - A Novel Technique for Ensemble Creation
The main contribution of this paper is to suggest a novel technique for automatic creation of accurate ensembles. The technique proposed, named GEMS, first trains a large number o...
Ulf Johansson, Tuve Löfström, Rikard K&o...
SIGCSE
2008
ACM
211views Education» more  SIGCSE 2008»
13 years 8 months ago
Cluster computing for web-scale data processing
In this paper we present the design of a modern course in cluster computing and large-scale data processing. The defining differences between this and previously published designs...
Aaron Kimball, Sierra Michels-Slettvet, Christophe...
PODS
2006
ACM
134views Database» more  PODS 2006»
14 years 8 months ago
Finding global icebergs over distributed data sets
Finding icebergs ? items whose frequency of occurrence is above a certain threshold ? is an important problem with a wide range of applications. Most of the existing work focuses ...
Qi Zhao, Mitsunori Ogihara, Haixun Wang, Jun Xu