Sciweavers

1038 search results - page 23 / 208
» A Genetic Algorithm for Clustering on Very Large Data Sets
Sort
View
GECCO
2008
Springer
184views Optimization» more  GECCO 2008»
13 years 9 months ago
Analysis of mammography reports using maximum variation sampling
A genetic algorithm (GA) was developed to implement a maximum variation sampling technique to derive a subset of data from a large dataset of unstructured mammography reports. It ...
Robert M. Patton, Barbara G. Beckerman, Thomas E. ...
RECOMB
2003
Springer
14 years 9 months ago
Large scale reconstruction of haplotypes from genotype data
Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymo...
Eleazar Eskin, Eran Halperin, Richard M. Karp
IPPS
2002
IEEE
14 years 1 months ago
Parallel EST Clustering
Expressed sequence tags, abbreviated ESTs, are DNA fragments experimentally derived from expressed portions of genes. Clustering of ESTs is essential for gene recognition and unde...
Anantharaman Kalyanaraman, Srinivas Aluru, Suresh ...
ICDM
2003
IEEE
154views Data Mining» more  ICDM 2003»
14 years 1 months ago
MaPle: A Fast Algorithm for Maximal Pattern-based Clustering
Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However, pattern-...
Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wan...
WSDM
2012
ACM
252views Data Mining» more  WSDM 2012»
12 years 4 months ago
WebSets: extracting sets of entities from the web using unsupervised information extraction
We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...
Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...