Sciweavers

207 search results - page 39 / 42
» On Bootstrapping the ROC Curve
Sort
View
BMCBI
2008
97views more  BMCBI 2008»
13 years 8 months ago
Comparison study on k-word statistical measures for protein: From sequence to 'sequence space'
Background: Many proposed statistical measures can efficiently compare protein sequence to further infer protein structure, function and evolutionary information. They share the s...
Qi Dai, Tian-Ming Wang
DATAMINE
2008
143views more  DATAMINE 2008»
13 years 8 months ago
Automatically countering imbalance and its empirical relationship to cost
Learning from imbalanced datasets presents a convoluted problem both from the modeling and cost standpoints. In particular, when a class is of great interest but occurs relatively...
Nitesh V. Chawla, David A. Cieslak, Lawrence O. Ha...
BMCBI
2006
85views more  BMCBI 2006»
13 years 8 months ago
Searching for interpretable rules for disease mutations: a simulated annealing bump hunting strategy
Background: Understanding how amino acid substitutions affect protein functions is critical for the study of proteins and their implications in diseases. Although methods have bee...
Rui Jiang, Hua Yang, Fengzhu Sun, Ting Chen
CSL
2006
Springer
13 years 8 months ago
A study in machine learning from imbalanced data for sentence boundary detection in speech
Enriching speech recognition output with sentence boundaries improves its human readability and enables further processing by downstream language processing modules. We have const...
Yang Liu, Nitesh V. Chawla, Mary P. Harper, Elizab...
BMCBI
2007
128views more  BMCBI 2007»
13 years 8 months ago
Combining classifiers to predict gene function in Arabidopsis thaliana using large-scale gene expression measurements
Background: Arabidopsis thaliana is the model species of current plant genomic research with a genome size of 125 Mb and approximately 28,000 genes. The function of half of these ...
Hui Lan, Rachel Carson, Nicholas J. Provart, Antho...