training sets | Sciweavers

208

TAL
2010
Springer

127views Natural Language Processing» more TAL 2010»

Summarization as Feature Selection for Document Categorization on Small Datasets

15 years 4 months ago

Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...

Emmanuel Anguiano-Hernández, Luis Villase&n...

claim paper

Read More »

174

click to vote

ML
2000
ACM

97views Machine Learning» more ML 2000»

Randomizing Outputs to Increase Prediction Accuracy

15 years 6 months ago

Download oz.berkeley.edu

Bagging and boosting reduce error by changing both the inputs and outputs to form perturbed training sets, grow predictors on these perturbed training sets and combine them. A que...

Leo Breiman

claim paper

Read More »

197

click to vote

COLING
2000

157views Computational Linguistics» more COLING 2000»

Word Sense Disambiguation of Adjectives Using Probabilistic Networks

15 years 8 months ago

Download acl.ldc.upenn.edu

In this paper, word sense dismnbiguation (WSD) accuracy achievable by a probabilistic classifier, using very milfimal training sets, is investigated. \Ve made the assuml)tiou that...

Gerald Chao, Michael G. Dyer

claim paper

Read More »

193

click to vote

BMVC
2001

183views Computer Vision» more BMVC 2001»

An Information Theoretic Approach to Statistical Shape Modelling

15 years 9 months ago

Download www.isbe.man.ac.uk

Statistical shape models have been used widely as a basis for segmenting and interpreting images. A major drawback of the approach is the need to establish a set of dense correspo...

Rhodri H. Davies, Timothy F. Cootes, Carole J. Twi...

claim paper

Read More »

180

click to vote

DRR
2010

132views Document Analysis» more DRR 2010»

Time and space optimization of document content classifiers

15 years 9 months ago

Download www.cse.lehigh.edu

Scaling up document-image classifiers to handle an unlimited variety of document and image types poses serious challenges to conventional trainable classifier technologies. Highly...

Dawei Yin, Henry S. Baird, Chang An

claim paper

Read More »

167

click to vote

CIKM
2006
Springer

110views Information Technology» more CIKM 2006»

Performance thresholding in practical text classification

15 years 10 months ago

Download www.ims.uni-stuttgart.de

In practical classification, there is often a mix of learnable and unlearnable classes and only a classifier above a minimum performance threshold can be deployed. This problem is...

Hinrich Schütze, Emre Velipasaoglu, Jan O. Pe...

claim paper

Read More »

182

click to vote

ICDAR
2005
IEEE

125views Document Analysis» more ICDAR 2005»

Enhancing Training Data for Handwriting Recognition of Whiteboard Notes with Samples from a Different Database

16 years 4 days ago

Download www.iam.unibe.ch

Recognition of unconstrained handwritten text is still a challenge. In this paper we consider a new problem, which is the recognition of notes written on a whiteboard. Our recogni...

Marcus Liwicki, Horst Bunke

claim paper

Read More »

170

click to vote

MICAI
2007
Springer

111views Artificial Intelligence» more MICAI 2007»

Taking Advantage of the Web for Text Classification with Imbalanced Classes

16 years 21 days ago

Download ccc.inaoep.mx

A problem of supervised approaches for text classification is that they commonly require high-quality training data to construct an accurate classifier. Unfortunately, in many real...

Rafael Guzmán-Cabrera, Manuel Montes-y-G&oa...

claim paper

Read More »

188

click to vote

PKDD
2009
Springer

118views Data Mining» more PKDD 2009»

Sparse Kernel SVMs via Cutting-Plane Training

16 years 1 months ago

Download www.cs.cornell.edu

We explore an algorithm for training SVMs with Kernels that can represent the learned rule using arbitrary basis vectors, not just the support vectors (SVs) from the training set. ...

Thorsten Joachims, Chun-Nam John Yu

claim paper

Read More »

154

click to vote

MLDM
2009
Springer

92views Machine Learning» more MLDM 2009»

Improved Comprehensibility and Reliability of Explanations via Restricted Halfspace Discretization

16 years 1 months ago

Download www.utdallas.edu

A number of two-class classiﬁcation methods ﬁrst discretize each attribute of two given training sets and then construct a propositional DNF formula that evaluates to True for ...

Klaus Truemper

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers