Sciweavers

ACL
2008
13 years 9 months ago
Inducing Gazetteers for Named Entity Recognition by Large-Scale Clustering of Dependency Relations
We propose using large-scale clustering of dependency relations between verbs and multiword nouns (MNs) to construct a gazetteer for named entity recognition (NER). Since dependen...
Jun'ichi Kazama, Kentaro Torisawa
ACL
2008
13 years 9 months ago
Phrase Table Training for Precision and Recall: What Makes a Good Phrase and a Good Phrase Pair?
In this work, the problem of extracting phrase translation is formulated as an information retrieval process implemented with a log-linear model aiming for a balanced precision an...
Yonggang Deng, Jia Xu, Yuqing Gao
ACL
2008
13 years 9 months ago
Semi-Supervised Sequential Labeling and Segmentation Using Giga-Word Scale Unlabeled Data
This paper provides evidence that the use of more unlabeled data in semi-supervised learning can improve the performance of Natural Language Processing (NLP) tasks, such as part-o...
Jun Suzuki, Hideki Isozaki
ACL
2008
13 years 9 months ago
Learning Bilingual Lexicons from Monolingual Corpora
We present a method for learning bilingual translation lexicons from monolingual corpora. Word types in each language are characterized by purely monolingual features, such as con...
Aria Haghighi, Percy Liang, Taylor Berg-Kirkpatric...
ACL
2008
13 years 9 months ago
A Critical Reassessment of Evaluation Baselines for Speech Summarization
We assess the current state of the art in speech summarization, by comparing a typical summarizer on two different domains: lecture data and the SWITCHBOARD corpus. Our results ca...
Gerald Penn, Xiaodan Zhu
ACL
2008
13 years 9 months ago
Cohesive Phrase-Based Decoding for Statistical Machine Translation
Phrase-based decoding produces state-of-theart translations with no regard for syntax. We add syntax to this process with a cohesion constraint based on a dependency tree for the ...
Colin Cherry
ACL
2008
13 years 9 months ago
Parsing Noun Phrase Structure with CCG
Statistical parsing of noun phrase (NP) structure has been hampered by a lack of goldstandard data. This is a significant problem for CCGbank, where binary branching NP derivation...
David Vadas, James R. Curran
ACL
2008
13 years 9 months ago
A Deductive Approach to Dependency Parsing
We define a new formalism, based on Sikkel's parsing schemata for constituency parsers, that can be used to describe, analyze and compare dependency parsing algorithms. This ...
Carlos Gómez-Rodríguez, John A. Carr...
ACL
2008
13 years 9 months ago
An Entity-Mention Model for Coreference Resolution with Inductive Logic Programming
The traditional mention-pair model for coreference resolution cannot capture information beyond mention pairs for both learning and testing. To deal with this problem, we present ...
Xiaofeng Yang, Jian Su, Jun Lang, Chew Lim Tan, Ti...
ACL
2008
13 years 9 months ago
Joint Word Segmentation and POS Tagging Using a Single Perceptron
For Chinese POS tagging, word segmentation is a preliminary step. To avoid error propagation and improve segmentation by utilizing POS information, segmentation and tagging can be...
Yue Zhang 0004, Stephen Clark