Sciweavers

692 search results - page 76 / 139
» Hierarchical exploration of large multivariate data sets
Sort
View
SDM
2007
SIAM
86views Data Mining» more  SDM 2007»
13 years 10 months ago
Identifying Bundles of Product Options using Mutual Information Clustering
Mass-produced goods tend to be highly standardized in order to maximize manufacturing efficiencies. Some high-value goods with limited production quantities remain much less stand...
Claudia Perlich, Saharon Rosset
ICDE
2001
IEEE
128views Database» more  ICDE 2001»
14 years 10 months ago
Counting Twig Matches in a Tree
We describe efficient algorithms for accurately estimating the number of matches of a small node-labeled tree, i.e., a twig, in a large node-labeled tree, using a summary data str...
Zhiyuan Chen, H. V. Jagadish, Flip Korn, Nick Koud...
BMCBI
2006
120views more  BMCBI 2006»
13 years 8 months ago
An improved distance measure between the expression profiles linking co-expression and co-regulation in mouse
Background: Many statistical algorithms combine microarray expression data and genome sequence data to identify transcription factor binding motifs in the low eukaryotic genomes. ...
Ryung S. Kim, Hongkai Ji, Wing Hung Wong
ICDE
2012
IEEE
267views Database» more  ICDE 2012»
11 years 11 months ago
Scalable and Numerically Stable Descriptive Statistics in SystemML
—With the exponential growth in the amount of data that is being generated in recent years, there is a pressing need for applying machine learning algorithms to large data sets. ...
Yuanyuan Tian, Shirish Tatikonda, Berthold Reinwal...
SDM
2003
SIAM
129views Data Mining» more  SDM 2003»
13 years 10 months ago
Approximate Query Answering by Model Averaging
In earlier work we have introduced and explored a variety of different probabilistic models for the problem of answering selectivity queries posed to large sparse binary data set...
Dmitry Pavlov, Padhraic Smyth