Sciweavers

801 search results - page 10 / 161
» The Inefficiency of Batch Training for Large Training Sets
Sort
View
NAACL
2004
13 years 9 months ago
Name Tagging with Word Clusters and Discriminative Training
We present a technique for augmenting annotated training data with hierarchical word clusters that are automatically derived from a large unannotated corpus. Cluster membership is...
Scott Miller, Jethran Guinness, Alex Zamanian
NIPS
2008
13 years 9 months ago
Generative versus discriminative training of RBMs for classification of fMRI images
Neuroimaging datasets often have a very large number of voxels and a very small number of training cases, which means that overfitting of models for this data can become a very se...
Tanya Schmah, Geoffrey E. Hinton, Richard S. Zemel...
NAACL
2001
13 years 9 months ago
Applying Co-Training Methods to Statistical Parsing
We propose a novel Co-Training method for statistical parsing. The algorithm takes as input a small corpus (9695 sentences) annotated with parse trees, a dictionary of possible le...
Anoop Sarkar
PKDD
2009
Springer
118views Data Mining» more  PKDD 2009»
14 years 2 months ago
Sparse Kernel SVMs via Cutting-Plane Training
We explore an algorithm for training SVMs with Kernels that can represent the learned rule using arbitrary basis vectors, not just the support vectors (SVs) from the training set. ...
Thorsten Joachims, Chun-Nam John Yu
ICPR
2004
IEEE
14 years 8 months ago
Off-line Handwritten Textline Recognition Using a Mixture of Natural and Synthetic Training Data
In this paper the problem of off-line handwritten cursive text recognition is considered. A method for expanding the set of available training textlines by applying random perturb...
Tamás Varga, Horst Bunke