Sciweavers

ACL
2015

Learning Cross-lingual Word Embeddings via Matrix Co-factorization

8 years 7 months ago
Learning Cross-lingual Word Embeddings via Matrix Co-factorization
A joint-space model for cross-lingual distributed representations generalizes language-invariant semantic features. In this paper, we present a matrix cofactorization framework for learning cross-lingual word embeddings. We explicitly define monolingual training objectives in the form of matrix decomposition, and induce cross-lingual constraints for simultaneously factorizing monolingual matrices. The cross-lingual constraints can be derived from parallel corpora, with or without word alignments. Empirical results on a task of cross-lingual document classification show that our method is effective to encode cross-lingual knowledge as constraints for cross-lingual word embeddings.
Tianze Shi, Zhiyuan Liu, Yang Liu, Maosong Sun
Added 13 Apr 2016
Updated 13 Apr 2016
Type Journal
Year 2015
Where ACL
Authors Tianze Shi, Zhiyuan Liu, Yang Liu, Maosong Sun
Comments (0)