training data | Sciweavers

110

Voted

NIPS
1994

124views Information Technology» more NIPS 1994»

From Data Distributions to Regularization in Invariant Learning

15 years 3 months ago

Ideally pattern recognition machines provide constant output when the inputs are transformed under a group G of desired invariances. These invariances can be achieved by enhancing...

Todd K. Leen

claim paper

Read More »

140

Voted

NAACL
1994

127views Computational Linguistics» more NAACL 1994»

Tree-Based State Tying for High Accuracy Modelling

15 years 3 months ago

Download acl.ldc.upenn.edu

The key problem to be faced when building a HMM-based continuous speech recogniser is maintaining the balance between model complexity and available training data. For large vocab...

S. J. Young, J. J. Odell, Philip C. Woodland

claim paper

Read More »

109

click to vote

ISMB
1993

116views Computational Biology» more ISMB 1993»

Knowledge-Based Generation of Machine-Learning Experiments: Learning with DNA Crystallography Data

15 years 3 months ago

Download www.aaai.org

Thoughit has been possible in the past to learn to predict DNAhydration patterns from crystallographic data, there is ambiguity in the choice of training data (both in terms of th...

Dawn M. Cohen, Casimir A. Kulikowski, Helen Berman

claim paper

Read More »

143

Voted

AAAI
2000

125views Intelligent Agents» more AAAI 2000»

Self-Supervised Learning for Visual Tracking and Recognition of Human Hand

15 years 3 months ago

Download www.aaai.org

Due to the large variation and richness of visual inputs, statistical learning gets more and more concerned in the practice of visual processing such as visual tracking and recogn...

Ying Wu, Thomas S. Huang

claim paper

Read More »

127

Voted

NAACL
2003

113views Computational Linguistics» more NAACL 2003»

A Web-Trained Extraction Summarization System

15 years 3 months ago

Download acl.ldc.upenn.edu

A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because...

Liang Zhou, Eduard H. Hovy

claim paper

Read More »

102

click to vote

NAACL
2003

118views Computational Linguistics» more NAACL 2003»

Example Selection for Bootstrapping Statistical Parsers

15 years 3 months ago

Download www.cs.pitt.edu

This paper investigates bootstrapping for statistical parsers to reduce their reliance on manually annotated training data. We consider both a mostly-unsupervised approach, co-tra...

Mark Steedman, Rebecca Hwa, Stephen Clark, Miles O...

claim paper

Read More »

106

click to vote

NAACL
2003

131views Computational Linguistics» more NAACL 2003»

Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures

15 years 3 months ago

Download crow.ee.washington.edu

Sources of training data suitable for language modeling of conversational speech are limited. In this paper, we show how training data can be supplemented with text from the web �...

Ivan Bulyko, Mari Ostendorf, Andreas Stolcke

claim paper

Read More »

104

Voted

FLAIRS
2004

178views Artificial Intelligence» more FLAIRS 2004»

Transductive LSI for Short Text Classification Problems

15 years 3 months ago

Download www.cs.csi.cuny.edu

This paper presents work that uses Transductive Latent Semantic Indexing (LSI) for text classification. In addition to relying on labeled training data, we improve classification ...

Sarah Zelikovitz

claim paper

Read More »

79

click to vote

SDM
2007
SIAM

85views Data Mining» more SDM 2007»

Kernel Based Detection of Mislabeled Training Examples

15 years 3 months ago

Download www.cse.msu.edu

The problem of identifying mislabeled training examples has been examined in several studies, with a variety of approaches developed for editing the training data to obtain better...

Hamed Valizadegan, Pang-Ning Tan

claim paper

Read More »

87

click to vote

NAACL
2007

103views Computational Linguistics» more NAACL 2007»

Detection of Non-Native Sentences Using Machine-Translated Training Data

15 years 3 months ago

Download www.aclweb.org

Training statistical models to detect nonnative sentences requires a large corpus of non-native writing samples, which is often not readily available. This paper examines the exte...

John Lee, Ming Zhou, Xiaohua Liu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers