Training a good text detector requires a large amount of labeled data, which can be very expensive to obtain. Cotraining has been shown to be a powerful semi-supervised learning t...
In this paper we describe two related approaches to estimating the sample sizes required to statistically compare the performance of two classifiers: acceptable failure rates (AFR...
Weconsider tile automatedidentification of transmembrane domains in membrane protein sequences. 324 proteins (containing 1585 segrrmnts) werc examined, representing every protein ...
We present a tool that predicts whether the software under development inside an IDE has a bug. An IDE plugin performs this prediction, using the Change Classification technique t...
—We document methods for the quantitative evaluation of systems that produce a scalar summary of a biometric sample’s quality. We are motivated by a need to test claims that qu...