Sciweavers

738 search results - page 82 / 148
» E-CAST: A Data Mining Algorithm for Gene Expression Data
Sort
View
SDM
2010
SIAM
184views Data Mining» more  SDM 2010»
13 years 9 months ago
A Robust Decision Tree Algorithm for Imbalanced Data Sets
We propose a new decision tree algorithm, Class Confidence Proportion Decision Tree (CCPDT), which is robust and insensitive to class distribution and generates rules which are st...
Wei Liu, Sanjay Chawla, David A. Cieslak, Nitesh V...
KDID
2003
93views Database» more  KDID 2003»
13 years 9 months ago
A Framework for Frequent Sequence Mining under Generalized Regular Expression Constraints
This paper provides a framework for the extraction of frequent sequences satisfying a given regular expression (RE) constraint. We take advantage of the information contained in th...
Hunor Albert-Lorincz, Jean-François Boulica...
VLDB
2001
ACM
139views Database» more  VLDB 2001»
14 years 8 days ago
NetCube: A Scalable Tool for Fast Data Mining and Compression
We propose an novel method of computing and storing DataCubes. Our idea is to use Bayesian Networks, which can generate approximate counts for any query combination of attribute v...
Dimitris Margaritis, Christos Faloutsos, Sebastian...
AIIA
2005
Springer
14 years 1 months ago
Towards Fault-Tolerant Formal Concept Analysis
Given Boolean data sets which record properties of objects, Formal Concept Analysis is a well-known approach for knowledge discovery. Recent application domains, e.g., for very lar...
Ruggero G. Pensa, Jean-François Boulicaut
BMCBI
2006
120views more  BMCBI 2006»
13 years 7 months ago
A factor analysis model for functional genomics
Background: Expression array data are used to predict biological functions of uncharacterized genes by comparing their expression profiles to those of characterized genes. While b...
Rafal Kustra, Romy Shioda, Mu Zhu