Sciweavers

ACL
2007
14 years 9 days ago
Fast Unsupervised Incremental Parsing
This paper describes an incremental parser and an unsupervised learning algorithm for inducing this parser from plain text. The parser uses a representation for syntactic structur...
Yoav Seginer
ACL
2007
14 years 9 days ago
Ordering Phrases with Function Words
This paper presents a Function Word centered, Syntax-based (FWS) solution to address phrase ordering in the context of statistical machine translation (SMT). Motivated by the obse...
Hendra Setiawan, Min-Yen Kan, Haizhou Li
ACL
2007
14 years 9 days ago
An Ensemble Method for Selection of High Quality Parses
While the average performance of statistical parsers gradually improves, they still attach to many sentences annotations of rather low quality. The number of such sentences grows ...
Roi Reichart, Ari Rappoport
ACL
2007
14 years 9 days ago
Semantic Transliteration of Personal Names
Words of foreign origin are referred to as borrowed words or loanwords. A loanword is usually imported to Chinese by phonetic transliteration if a translation is not easily availa...
Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, Minghui Do...
ACL
2007
14 years 9 days ago
Guiding Semi-Supervision with Constraint-Driven Learning
Over the last few years, two of the main research directions in machine learning of natural language processing have been the study of semi-supervised learning algorithms as a way...
Ming-Wei Chang, Lev-Arie Ratinov, Dan Roth
ACL
2007
14 years 9 days ago
Bilingual-LSA Based LM Adaptation for Spoken Language Translation
We propose a novel approach to crosslingual language model (LM) adaptation based on bilingual Latent Semantic Analysis (bLSA). A bLSA model is introduced which enables latent topi...
Yik-Cheung Tam, Ian R. Lane, Tanja Schultz
ACL
2007
14 years 9 days ago
A Hybrid Approach to Word Segmentation and POS Tagging
In this paper, we present a hybrid method for word segmentation and POS tagging. The target languages are those in which word boundaries are ambiguous, such as Chinese and Japanes...
Tetsuji Nakagawa, Kiyotaka Uchimoto
ACL
2007
14 years 9 days ago
Word Sense Disambiguation Improves Statistical Machine Translation
Recent research presents conflicting evidence on whether word sense disambiguation (WSD) systems can help to improve the performance of statistical machine translation (MT) syste...
Yee Seng Chan, Hwee Tou Ng, David Chiang
ACL
2007
14 years 9 days ago
Generating Usable Formats for Metadata and Annotations in a Large Meeting Corpus
The AMI Meeting Corpus is now publicly available, including manual annotation files generated in the NXT XML format, but lacking explicit metadata for the 171 meetings of the cor...
Andrei Popescu-Belis, Paula Estrella
ACL
2007
14 years 9 days ago
A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing
This paper presents a comparative study of five parameter estimation algorithms on four NLP tasks. Three of the five algorithms are well-known in the computational linguistics com...
Jianfeng Gao, Galen Andrew, Mark Johnson, Kristina...