Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

150

ACL
2009

99views Computational Linguistics» more ACL 2009»

A Beam-Search Extraction Algorithm for Comparable Data

15 years 4 months ago

A Beam-Search Extraction Algorithm for Comparable Data

Download www.aclweb.org

This paper extends previous work on extracting parallel sentence pairs from comparable data (Munteanu and Marcu, 2005). For a given source sentence S, a maximum entropy (ME) classifier is applied to a large set of candidate target translations . A beam-search algorithm is used to abandon target sentences as non-parallel early on during classification if they fall outside the beam. This way, our novel algorithm avoids any document-level prefiltering step. The algorithm increases the number of extracted parallel sentence pairs significantly, which leads to a BLEU improvement of about 1 % on our SpanishEnglish data.

Christoph Tillmann

Real-time Traffic

ACL 2009 | Algorithm | Computational Linguistics | Parallel Sentence | Parallel Sentence Pairs |

claim paper

Related Content

» Algorithms for the Extraction of Synteny Blocks from Comparative Maps

» Comparing Two Recommender Algorithms with the Help of Recommendations by Peers

» A Direct Evolutionary Feature Extraction Algorithm for Classifying High Dimensional Data

» Comparing Intended and Real Usage in Web Portal Temporal Logic and Data Mining

» Semisupervised extractive speech summarization via cotraining algorithm

» COMPACT A Comparative Package for Clustering Assessment

» Comparative study of spine vertebra shape retrieval using learningbased feature selection

» Extraction of fuzzy rules from trained neural network using evolutionary algorithm

» A Bayesian Approach to Building Footprint Extraction from Aerial LIDAR Data

Post Info
More Details (n/a)

Added	16 Feb 2011
Updated	16 Feb 2011
Type	Journal
Year	2009
Where	ACL
Authors	Christoph Tillmann

Comments (0)