Sciweavers

1313 search results - page 12 / 263
» Intelligent Selection of Language Model Training Data
Sort
View
123
Voted
CEC
2010
IEEE
15 years 3 months ago
An analysis of clustering objectives for feature selection applied to encrypted traffic identification
This work explores the use of clustering objectives in a Multi-Objective Genetic Algorithm (MOGA) for both, feature selection and cluster count optimization, under the application...
Carlos Bacquet, A. Nur Zincir-Heywood, Malcolm I. ...
AAAI
2007
15 years 4 months ago
Learning Language Semantics from Ambiguous Supervision
This paper presents a method for learning a semantic parser from ambiguous supervision. Training data consists of natural language sentences annotated with multiple potential mean...
Rohit J. Kate, Raymond J. Mooney
ICMCS
2007
IEEE
133views Multimedia» more  ICMCS 2007»
15 years 8 months ago
Data Modeling Strategies for Imbalanced Learning in Visual Search
In this paper we examine a novel approach to the difficult problem of querying video databases using visual topics with few examples. Typically with visual topics, the examples a...
Jelena Tesic, Apostol Natsev, Lexing Xie, John R. ...
ACL
1996
15 years 3 months ago
Minimizing Manual Annotation Cost in Supervised Training from Corpora
Corpus-based methods for natural language processing often use supervised training, requiring expensive manual annotation of training corpora. This paper investigates methods for ...
Sean P. Engelson, Ido Dagan
105
Voted
AAAI
2000
15 years 3 months ago
Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm
Selecting the right word translation among several options in the lexicon is a core problem for machine translation. We present a novel approach to this problem that can be traine...
Philipp Koehn, Kevin Knight