
13 years 11 months ago
A Supervised Algorithm for Verb Disambiguation into VerbNet Classes
VerbNet (VN) is a major large-scale English verb lexicon. Mapping verb instances to their VN classes has been proven useful for several NLP tasks. However, verbs are polysemous wi...
Omri Abend, Roi Reichart, Ari Rappoport
13 years 11 months ago
An Improved Hierarchical Bayesian Model of Language for Document Classification
This paper addresses the fundamental problem of document classification, and we focus attention on classification problems where the classes are mutually exclusive. In the course ...
Ben Allison
13 years 11 months ago
Parsing the SynTagRus Treebank of Russian
We present the first results on parsing the SYNTAGRUS treebank of Russian with a data-driven dependency parser, achieving a labeled attachment score of over 82% and an unlabeled a...
Joakim Nivre, Igor Boguslavsky, Leonid L. Iomdin
13 years 11 months ago
Normalizing SMS: are Two Metaphors Better than One ?
Electronic written texts used in computermediated interactions (e-mails, blogs, chats, etc) present major deviations from the norm of the language. This paper presents an comparat...
Catherine Kobus, François Yvon, Géra...
13 years 11 months ago
Exploiting Constituent Dependencies for Tree Kernel-Based Semantic Relation Extraction
This paper proposes a new approach to dynamically determine the tree span for tree kernel-based semantic relation extraction. It exploits constituent dependencies to keep the node...
Longhua Qian, Guodong Zhou, Fang Kong, Qiaoming Zh...
13 years 11 months ago
Bayesian Semi-Supervised Chinese Word Segmentation for Statistical Machine Translation
Words in Chinese text are not naturally separated by delimiters, which poses a challenge to standard machine translation (MT) systems. In MT, the widely used approach is to apply ...
Jia Xu, Jianfeng Gao, Kristina Toutanova, Hermann ...
13 years 11 months ago
A Method for Automatic POS Guessing of Chinese Unknown Words
This paper proposes a method for automatic POS (part-of-speech) guessing of Chinese unknown words. It contains two models. The first model uses a machinelearning method to predict...
Likun Qiu, Changjian Hu, Kai Zhao
13 years 11 months ago
Authorship Attribution and Verification with Many Authors and Limited Data
Most studies in statistical or machine learning based authorship attribution focus on two or a few authors. This leads to an overestimation of the importance of the features extra...
Kim Luyckx, Walter Daelemans
13 years 11 months ago
On Robustness and Domain Adaptation using SVD for Word Sense Disambiguation
In this paper we explore robustness and domain adaptation issues for Word Sense Disambiguation (WSD) using Singular Value Decomposition (SVD) and unlabeled data. We focus on the s...
Eneko Agirre, Oier Lopez de Lacalle