training set | Sciweavers

186

GECCO
2008
Springer

184views Optimization» more GECCO 2008»

Analysis of mammography reports using maximum variation sampling

15 years 7 months ago

A genetic algorithm (GA) was developed to implement a maximum variation sampling technique to derive a subset of data from a large dataset of unstructured mammography reports. It ...

Robert M. Patton, Barbara G. Beckerman, Thomas E. ...

claim paper

Read More »

190

click to vote

GECCO
2008
Springer

137views Optimization» more GECCO 2008»

Informative sampling for large unbalanced data sets

15 years 7 months ago

Download www.cs.uvm.edu

Selective sampling is a form of active learning which can reduce the cost of training by only drawing informative data points into the training set. This selected training set is ...

Zhenyu Lu, Anand I. Rughani, Bruce I. Tranmer, Jos...

claim paper

Read More »

173

click to vote

CLEF
2010
Springer

136views Information Technology» more CLEF 2010»

ZOT! to Wikipedia Vandalism - Lab Report for PAN at CLEF 2010

15 years 7 months ago

Download clef2010.org

Abstract This vandalism detector uses features primarily derived from a wordpreserving differencing of the text for each Wikipedia article from before and after the edit, along wit...

James White, Rebecca Maessen

claim paper

Read More »

211

click to vote

CLEF
2010
Springer

159views Information Technology» more CLEF 2010»

The Wroclaw University of Technology Participation at ImageCLEF 2010 Photo Annotation Track

15 years 7 months ago

Download clef2010.org

Abstract. In this paper we present three methods for image autoannotation used by the Wroclaw University of Technology group at ImageCLEF 2010 Photo Annotation track. All of our ex...

Michal Stanek, Oskar Maier, Halina Kwasnicka

claim paper

Read More »

179

click to vote

COLING
2000

108views Computational Linguistics» more COLING 2000»

Estimation of Stochastic Attribute-Value Grammars using an Informative Sample

15 years 8 months ago

Download acl.ldc.upenn.edu

We argue that some of the computational complexity associated with estimation of stochastic attributevalue grammars can be reduced by training upon an informative subset of the fu...

Miles Osborne

claim paper

Read More »

191

click to vote

PICS
2003

79views Image Processing» more PICS 2003»

Selection of Training Sets for the Characterisation of Multispectral Imaging Systems

15 years 8 months ago

Download www.ivl.disco.unimib.it

To establish a correlation between the system output and the corresponding reflectance, the system characterisation functionDeriving the actual multispectral data from the output o...

Paolo Pellegri, Gianluca Novati, Raimondo Schettin...

claim paper

Read More »

167

click to vote

NIPS
2004

152views Information Technology» more NIPS 2004»

Breaking SVM Complexity with Cross-Training

15 years 8 months ago

Download books.nips.cc

We propose to selectively remove examples from the training set using probabilistic estimates related to editing algorithms (Devijver and Kittler, 1982). This heuristic procedure ...

Gökhan H. Bakir, Léon Bottou, Jason We...

claim paper

Read More »

194

click to vote

NCI
2004

188views Neural Networks» more NCI 2004»

Training set optimization in 3D human face recognition by RBF neural networks

15 years 8 months ago

Download www.inf.ufsc.br

In the Neural Networks approach by Radial Basis Function - RBF, the property of interpolation between faces, their variation, and the diversity of faces helps to minimize the outp...

Antonio C. Zimmermann, L. S. Encinas, L. O. Marin,...

claim paper

Read More »

179

click to vote

LWA
2004

144views Software Engineering» more LWA 2004»

Modeling Rule Precision

15 years 8 months ago

Download www.ofai.at

This paper reports first results of an empirical study of the precision of classification rules on an independent test set. We generated a large number of rules using a general co...

Johannes Fürnkranz

claim paper

Read More »

186

click to vote

FLAIRS
2006

143views Artificial Intelligence» more FLAIRS 2006»

Using Validation Sets to Avoid Overfitting in AdaBoost

15 years 8 months ago

Download www.cs.utsa.edu

AdaBoost is a well known, effective technique for increasing the accuracy of learning algorithms. However, it has the potential to overfit the training set because its objective i...

Tom Bylander, Lisa Tate

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers