Sciweavers

501 search results - page 5 / 101
» Improving Language Models by Clustering Training Sentences
Sort
View
NAACL
2010
13 years 5 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova
COLING
2010
13 years 2 months ago
A Vector Space Model for Subjectivity Classification in Urdu aided by Co-Training
The goal of this work is to produce a classifier that can distinguish subjective sentences from objective sentences for the Urdu language. The amount of labeled data required for ...
Smruthi Mukund, Rohini K. Srihari
NIPS
2000
13 years 9 months ago
A Neural Probabilistic Language Model
A goal of statistical language modeling is to learn the joint probability function of sequences of words in a language. This is intrinsically difficult because of the curse of dim...
Yoshua Bengio, Réjean Ducharme, Pascal Vinc...
EMNLP
2008
13 years 9 months ago
Refining Generative Language Models using Discriminative Learning
We propose a new approach to language modeling which utilizes discriminative learning methods. Our approach is an iterative one: starting with an initial language model, in each i...
Ben Sandbank
IALP
2010
13 years 2 months ago
Sentence Similarity-Based Source Context Modelling in PBSMT
Target phrase selection, a crucial component of the state-of-the-art phrase-based statistical machine translation (PBSMT) model, plays a key role in generating accurate translation...
Rejwanul Haque, Sudip Kumar Naskar, Andy Way, Mart...