Current system combination methods usually use confusion networks to find consensus translations among different systems. Requiring one-to-one mappings between the words in candid...
Yang Feng, Yang Liu, Haitao Mi, Qun Liu, Yajuan L&...
The pipeline of most Phrase-Based Statistical Machine Translation (PB-SMT) systems starts from automatically word aligned parallel corpus. But word appears to be too fine-grained ...
We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...
Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...
We achieved a state of the art performance in statistical machine translation by using a large number of features with an online large-margin training algorithm. The millions of p...
Taro Watanabe, Jun Suzuki, Hajime Tsukada, Hideki ...
This paper proposes to use monolingual collocations to improve Statistical Machine Translation (SMT). We make use of the collocation probabilities, which are estimated from monoli...