Parallel Corpora | Sciweavers

161

ACL
2012

218views Computational Linguistics» more ACL 2012»

ACCURAT Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora

13 years 9 months ago

The lack of parallel corpora and linguistic resources for many languages and domains is one of the major obstacles for the further advancement of automated translation. A possible...

Marcis Pinnis, Radu Ion, Dan Stefanescu, Fangzhong...

claim paper

Read More »

150

click to vote

ACL
2012

158views Computational Linguistics» more ACL 2012»

A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining

13 years 9 months ago

Download www.ims.uni-stuttgart.de

We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efﬁcient, language pair independent and mines transliteration pairs i...

Hassan Sajjad, Alexander Fraser, Helmut Schmid

claim paper

Read More »

202

click to vote

EMNLP
2011

175views Natural Language Processing» more EMNLP 2011»

Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation

14 years 6 months ago

Download www.clsp.jhu.edu

Previous work has shown that high quality phrasal paraphrases can be extracted from bilingual parallel corpora. However, it is not clear whether bitexts are an appropriate resourc...

Juri Ganitkevitch, Chris Callison-Burch, Courtney ...

claim paper

Read More »

188

click to vote

EMNLP
2011

166views Natural Language Processing» more EMNLP 2011»

Multi-Source Transfer of Delexicalized Dependency Parsers

14 years 6 months ago

Download ryanmcd.com

We present a simple method for transferring dependency parsers from source languages with labeled training data to target languages without labeled training data. We ﬁrst demons...

Ryan T. McDonald, Slav Petrov, Keith Hall

claim paper

Read More »

214

click to vote

ACL
2011

202views Computational Linguistics» more ACL 2011»

An Algorithm for Unsupervised Transliteration Mining with an Application to Word Alignment

14 years 10 months ago

Download www.ims.uni-stuttgart.de

We propose a language-independent method for the automatic extraction of transliteration pairs from parallel corpora. In contrast to previous work, our method uses no form of supe...

Hassan Sajjad, Alexander Fraser, Helmut Schmid

claim paper

Read More »

175

click to vote

COLING
2010

137views Computational Linguistics» more COLING 2010»

An Empirical Study on Web Mining of Parallel Data

15 years 1 months ago

Download www.aclweb.org

This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...

Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim

claim paper

Read More »

160

click to vote

EMNLP
2009

123views Natural Language Processing» more EMNLP 2009»

Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies

15 years 4 months ago

Download aclweb.org

In this paper, we present an algorithm for extracting translations of any given multiword expression from parallel corpora. Given a multiword expression to be translated, the meth...

Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason...

claim paper

Read More »

168

click to vote

COLING
2002

126views Computational Linguistics» more COLING 2002»

A Cheap and Fast Way to Build Useful Translation Lexicons

15 years 6 months ago

Download acl.ldc.upenn.edu

The paper presents a statistical approach to automatic building of translation lexicons from parallel corpora. We briefly describe the pre-processing steps, a baseline iterative m...

Dan Tufis

claim paper

Read More »

175

click to vote

COLING
2002

106views Computational Linguistics» more COLING 2002»

Extracting Word Sequence Correspondences with Support Vector Machines

15 years 6 months ago

Download acl.ldc.upenn.edu

This paper proposes a learning and extracting method of word sequence correspondences from non-aligned parallel corpora with Support Vector Machines, which have high ability of th...

Kengo Sato, Hiroaki Saito

claim paper

Read More »

208

click to vote

IPM
2006

171views more IPM 2006»

Automatic extraction of bilingual word pairs using inductive chain learning in various languages

15 years 6 months ago

Download sig.media.eng.hokudai.ac.jp

In this paper, we propose a new learning method for extracting bilingual word pairs from parallel corpora in various languages. In cross-language information retrieval, the system...

Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers