Sciweavers

EMNLP
2009
13 years 10 months ago
Multi-Class Confidence Weighted Algorithms
The recently introduced online confidence-weighted (CW) learning algorithm for binary classification performs well on many binary NLP tasks. However, for multi-class problems CW l...
Koby Crammer, Mark Dredze, Alex Kulesza
EMNLP
2009
13 years 10 months ago
Reading to Learn: Constructing Features from Semantic Abstracts
Jacob Eisenstein, James Clarke, Dan Goldwasser, Da...
EMNLP
2009
13 years 10 months ago
Re-Ranking Models Based-on Small Training Data for Spoken Language Understanding
The design of practical language applications by means of statistical approaches requires annotated data, which is one of the most critical constraint. This is particularly true f...
Marco Dinarelli, Alessandro Moschitti, Giuseppe Ri...
EMNLP
2009
13 years 10 months ago
Statistical Estimation of Word Acquisition with Application to Readability Prediction
Models of language learning play a central role in a wide range of applications: from psycholinguistic theories of how people acquire new word knowledge, to information systems th...
Paul Kidwell, Guy Lebanon, Kevyn Collins-Thompson
EMNLP
2009
13 years 10 months ago
Self-Training PCFG Grammars with Latent Annotations Across Languages
We investigate the effectiveness of selftraining PCFG grammars with latent annotations (PCFG-LA) for parsing languages with different amounts of labeled training data. Compared to...
Zhongqiang Huang, Mary P. Harper
EMNLP
2009
13 years 10 months ago
A Simple Unsupervised Learner for POS Disambiguation Rules Given Only a Minimal Lexicon
We propose a new model for unsupervised POS tagging based on linguistic distinctions between open and closed-class items. Exploiting notions from current linguistic theory, the sy...
Qiuye Zhao, Mitch Marcus
EMNLP
2009
13 years 10 months ago
On the Use of Virtual Evidence in Conditional Random Fields
Virtual evidence (VE), first introduced by (Pearl, 1988), provides a convenient way of incorporating prior knowledge into Bayesian networks. This work generalizes the use of VE to...
Xiao Li
EMNLP
2009
13 years 10 months ago
Bilingual dictionary generation for low-resourced language pairs
Bilingual dictionaries are vital resources in many areas of natural language processing. Numerous methods of machine translation require bilingual dictionaries with large coverage...
István Varga, Shoichi Yokoyama
EMNLP
2009
13 years 10 months ago
Semi-supervised Semantic Role Labeling Using the Latent Words Language Model
Semantic Role Labeling (SRL) has proved to be a valuable tool for performing automatic analysis of natural language texts. Currently however, most systems rely on a large training...
Koen Deschacht, Marie-Francine Moens