Encoding Source Language with Convolutional Neural Network for Machine Translation

9 years 10 months ago

Download aclweb.org

The recently proposed neural network joint model (NNJM) (Devlin et al., 2014) augments the n-gram target language model with a heuristically chosen source context window, achieving state-of-the-art performance in SMT. In this paper, we give a more systematic treatment by summarizing the relevant source information through a convolutional architecture guided by the target information. With different guiding signals during decoding, our speciﬁcally designed convolution+gating architectures can pinpoint the parts of a source sentence that are relevant to predicting a target word, and fuse them with the context of entire source sentence to form a uniﬁed representation. This representation, together with target language words, are fed to a deep neural network (DNN) to form a stronger NNJM. Experiments on two NIST Chinese-English translation tasks show that the proposed model can achieve signiﬁcant improvements over the previous NNJM by up

Fandong Meng, Zhengdong Lu, Mingxuan Wang, Hang Li

Real-time Traffic

ACL 2015 | Computational Linguistics |

claim paper

Post Info
More Details (n/a)

Added	13 Apr 2016
Updated	13 Apr 2016
Type	Journal
Year	2015
Where	ACL
Authors	Fandong Meng, Zhengdong Lu, Mingxuan Wang, Hang Li, Wenbin Jiang, Qun Liu

Comments (0)

Sciweavers

Encoding Source Language with Convolutional Neural Network for Machine Translation

ACL 2015 | Computational Linguistics |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers