Sciweavers

2421 search results - page 6 / 485
» Measuring independence of datasets
Sort
View
BIOCOMP
2007
13 years 8 months ago
Biomarker Discovery Across Annotated and Unannotated Microarray Datasets Using Semi-Supervised Learning
The growing body of DNA microarray data has the potential to advance our understanding of the molecular basis of disease. However annotating microarray datasets with clinically us...
Cole Harris, Noushin Ghaffari
SAC
2009
ACM
14 years 2 months ago
Capturing truthiness: mining truth tables in binary datasets
We introduce a new data mining problem: mining truth tables in binary datasets. Given a matrix of objects and the properties they satisfy, a truth table identifies a subset of pr...
Clifford Conley Owens III, T. M. Murali, Naren Ram...
ICDM
2006
IEEE
98views Data Mining» more  ICDM 2006»
14 years 1 months ago
What is the Dimension of Your Binary Data?
Many 0/1 datasets have a very large number of variables; however, they are sparse and the dependency structure of the variables is simpler than the number of variables would sugge...
Nikolaj Tatti, Taneli Mielikäinen, Aristides ...
ICML
2008
IEEE
14 years 8 months ago
Fully distributed EM for very large datasets
In EM and related algorithms, E-step computations distribute easily, because data items are independent given parameters. For very large data sets, however, even storing all of th...
Jason Wolfe, Aria Haghighi, Dan Klein
CORR
2008
Springer
115views Education» more  CORR 2008»
13 years 7 months ago
Determining the Unithood of Word Sequences using Mutual Information and Independence Measure
Most works related to unithood were conducted as part of a larger effort for the determination of termhood. Consequently, the number of independent research that study the notion ...
Wilson Wong, Wei Liu, Mohammed Bennamoun