Sciweavers

1403 search results - page 35 / 281
» Set cover algorithms for very large datasets
Sort
View
PRL
2008
135views more  PRL 2008»
13 years 8 months ago
A hierarchical clustering algorithm based on the Hungarian method
We propose a novel hierarchical clustering algorithm for data-sets in which only pairwise distances between the points are provided. The classical Hungarian method is an efficient...
Jacob Goldberger, Tamir Tassa
SGAI
2010
Springer
13 years 6 months ago
Induction of Modular Classification Rules: Using Jmax-pruning
The Prism family of algorithms induces modular classification rules which, in contrast to decision tree induction algorithms, do not necessarily fit together into a decision tree s...
Frederic T. Stahl, Max Bramer
DASFAA
2007
IEEE
220views Database» more  DASFAA 2007»
14 years 2 months ago
LAPIN: Effective Sequential Pattern Mining Algorithms by Last Position Induction for Dense Databases
Sequential pattern mining is very important because it is the basis of many applications. Although there has been a great deal of effort on sequential pattern mining in recent year...
Zhenglu Yang, Yitong Wang, Masaru Kitsuregawa
LREC
2010
141views Education» more  LREC 2010»
13 years 10 months ago
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora
The paper presents the fourth, "Mondilex" edition of the MULTEXT-East language resources, a multilingual dataset for language engineering research and development, focus...
Tomaz Erjavec
KDD
2000
ACM
149views Data Mining» more  KDD 2000»
14 years 1 days ago
Efficient clustering of high-dimensional data sets with application to reference matching
Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...
Andrew McCallum, Kamal Nigam, Lyle H. Ungar