Sciweavers

22 search results - page 1 / 5
» Reduced n-gram Models for English and Chinese Corpora
Sort
View
ACL
2006
13 years 8 months ago
Reduced n-gram Models for English and Chinese Corpora
Statistical language models should improve as the size of the n-grams increases from 3 to 5 or higher. However, the number of parameters and calculations, and the storage requirem...
Le Quan Ha, Philip Hanna, Darryl Stewart, F. Jack ...
COLING
2002
13 years 7 months ago
Learning Chinese Bracketing Knowledge Based on a Bilingual Language Model
This paper proposes a new method for automatic acquisition of Chinese bracketing knowledge from English-Chinese sentencealigned bilingual corpora. Bilingual sentence pairs are fir...
Yajuan Lü, Sheng Li, Tiejun Zhao, Muyun Yang
ANLP
2000
163views more  ANLP 2000»
13 years 8 months ago
Automatic construction of parallel English-Chinese corpus for cross-language information retrieval
A major obstacle to the construction of a probabilistic translation model is the lack of large parallel corpora. In this paper we first describe a parallel text mining system that...
Jiang Chen, Jian-Yun Nie
COLING
2008
13 years 8 months ago
Grammar Comparison Study for Translational Equivalence Modeling and Statistical Machine Translation
This paper presents a general platform, namely synchronous tree sequence substitution grammar (STSSG), for the grammar comparison study in Translational Equivalence Modeling (TEM)...
Min Zhang, Hongfei Jiang, Haizhou Li, AiTi Aw, She...
ACL
2009
13 years 5 months ago
Unsupervised Multilingual Grammar Induction
We investigate the task of unsupervised constituency parsing from bilingual parallel corpora. Our goal is to use bilingual cues to learn improved parsing models for each language ...
Benjamin Snyder, Tahira Naseem, Regina Barzilay