Search Sciweavers | Sciweavers

1038 search results - page 14 / 208

» A Genetic Algorithm for Clustering on Very Large Data Sets

138

Voted

GECCO
2006
Springer

186views Optimization» more GECCO 2006»

Characterizing large text corpora using a maximum variation sampling genetic algorithm

15 years 9 months ago

Download aser.ornl.gov

An enormous amount of information available via the Internet exists. Much of this data is in the form of text-based documents. These documents cover a variety of topics that are v...

Robert M. Patton, Thomas E. Potok

claim paper

Read More »

134

Voted

KES
2008
Springer

123views Information Technology» more KES 2008»

An Algorithm to Assess the Reliability of Hierarchical Clusters in Gene Expression Data

15 years 5 months ago

Download eprints.pascal-network.org

The validation of clusters discovered in bio-molecular data is a central issue in bioinformatics. Recently, stability-based methods have been successfully applied to the analysis o...

Roberto Avogadri, Matteo Brioschi, Francesca Ruffi...

claim paper

Read More »

160

click to vote

KDD
2000
ACM

149views Data Mining» more KDD 2000»

Efficient clustering of high-dimensional data sets with application to reference matching

15 years 8 months ago

Download www.kamalnigam.com

Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...

Andrew McCallum, Kamal Nigam, Lyle H. Ungar

claim paper

Read More »

182

click to vote

SIGMOD
2001
ACM

200views Database» more SIGMOD 2001»

Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering

16 years 5 months ago

Download www.cs.ualberta.ca

In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or ra...

Markus M. Breunig, Hans-Peter Kriegel, Peer Kr&oum...

claim paper

Read More »

145

click to vote

VLDB
2005
ACM

118views Database» more VLDB 2005»

Selectivity Estimation for Fuzzy String Predicates in Large Data Sets

15 years 10 months ago

Download www.vldb2005.org

Many database applications have the emerging need to support fuzzy queries that ask for strings that are similar to a given string, such as “name similar to smith” and “tele...

Liang Jin, Chen Li

claim paper

Read More »

« Prev « First page 14 / 208 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers