Sciweavers

501 search results - page 19 / 101
» Improving Language Models by Clustering Training Sentences
Sort
View
ICMLC
2010
Springer
13 years 6 months ago
Approaches to improving corpus quality for statistical machine translation
: The performance of a statistical machine translation (SMT) system heavily depends on the quantity and quality of the bilingual language resource. However, the pervious work mainl...
Peng Liu, Yu Zhou, Chengqing Zong
ICASSP
2010
IEEE
13 years 6 months ago
Spoken language translation from parallel speech audio: Simultaneous interpretation as SLT training data
In recent work, we proposed an alternative to parallel text as translation model (TM) training data: audio recordings of parallel speech (pSp), as it occurs in any communication s...
Matthias Paulik, Alex Waibel
ACL
2011
12 years 11 months ago
Automated Whole Sentence Grammar Correction Using a Noisy Channel Model
Automated grammar correction techniques have seen improvement over the years, but there is still much room for increased performance. Current correction techniques mainly focus on...
Y. Albert Park, Roger Levy
LREC
2008
174views Education» more  LREC 2008»
13 years 9 months ago
UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging
Based on simple methods such as observing word and part of speech tag co-occurrence and clustering, we generate syntactic parses of sentences in an entirely unsupervised and self-...
Christian Hänig, Stefan Bordag, Uwe Quasthoff
EMNLP
2009
13 years 5 months ago
Supervised Models for Coreference Resolution
Traditional learning-based coreference resolvers operate by training a mentionpair classifier for determining whether two mentions are coreferent or not. Two independent lines of ...
Md. Altaf ur Rahman, Vincent Ng