Sciweavers

1492 search results - page 60 / 299
» Geometric Clustering Using the Information Bottleneck Method
Sort
View
KDD
2000
ACM
149views Data Mining» more  KDD 2000»
15 years 7 months ago
Efficient clustering of high-dimensional data sets with application to reference matching
Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...
Andrew McCallum, Kamal Nigam, Lyle H. Ungar
145
Voted
CIKM
2006
Springer
15 years 6 months ago
Efficiently clustering transactional data with weighted coverage density
In this paper, we propose a fast, memory-efficient, and scalable clustering algorithm for analyzing transactional data. Our approach has three unique features. First, we use the c...
Hua Yan, Keke Chen, Ling Liu
AIRS
2006
Springer
15 years 8 months ago
A Novel Ant-Based Clustering Approach for Document Clustering
Recently, much research has been proposed using nature inspired algorithms to perform complex machine learning tasks. Ant Colony Optimization (ACO) is one such algorithm based on s...
Yulan He, Siu Cheung Hui, Yongxiang Sim
CASCON
1997
79views Education» more  CASCON 1997»
15 years 5 months ago
File clustering using naming conventions for legacy systems
Decomposing complex software systems into conceptually independent subsystems represents a signi cant software engineering activity that receives considerable research attention. ...
Nicolas Anquetil, Timothy C. Lethbridge
136
Voted
PRL
2006
106views more  PRL 2006»
15 years 4 months ago
Invariances in kernel methods: From samples to objects
This paper presents a general method for incorporating prior knowledge into kernel methods such as Support Vector Machines. It applies when the prior knowledge can be formalized b...
Alexei Pozdnoukhov, Samy Bengio