
13 years 7 months ago
The Contribution of Stylistic Information to Content-based Mobile Spam Filtering
Content-based approaches to detecting mobile spam to date have focused mainly on analyzing the topical aspect of a SMS message (what it is about) but not on the stylistic aspect (...
Dae-Neung Sohn, Jung-Tae Lee, Hae-Chang Rim
13 years 7 months ago
Generalizing over Lexical Features: Selectional Preferences for Semantic Role Classification
This paper explores methods to alleviate the effect of lexical sparseness in the classification of verbal arguments. We show how automatically generated selectional preferences ar...
Beñat Zapirain, Eneko Agirre, Lluís ...
13 years 7 months ago
Comparing the Accuracy of CCG and Penn Treebank Parsers
We compare the CCG parser of Clark and Curran (2007) with a state-of-the-art Penn Treebank (PTB) parser. An accuracy comparison is performed by converting the CCG derivations into...
Stephen Clark, James R. Curran
13 years 7 months ago
Composite Kernels For Relation Extraction
The automatic extraction of relations between entities expressed in natural language text is an important problem for IR and text understanding. In this paper we show how differen...
Frank Reichartz, Hannes Korte, Gerhard Paass
13 years 7 months ago
A Note on the Implementation of Hierarchical Dirichlet Processes
The implementation of collapsed Gibbs samplers for non-parametric Bayesian models is non-trivial, requiring considerable book-keeping. Goldwater et al. (2006a) presented an approx...
Phil Blunsom, Trevor Cohn, Sharon Goldwater, Mark ...
13 years 7 months ago
A Beam-Search Extraction Algorithm for Comparable Data
This paper extends previous work on extracting parallel sentence pairs from comparable data (Munteanu and Marcu, 2005). For a given source sentence S, a maximum entropy (ME) class...
Christoph Tillmann
13 years 7 months ago
Improved Smoothing for N-gram Language Models Based on Ordinary Counts
Kneser-Ney (1995) smoothing and its variants are generally recognized as having the best perplexity of any known method for estimating N-gram language models. Kneser-Ney smoothing...
Robert C. Moore, Chris Quirk
13 years 7 months ago
A Framework for Entailed Relation Recognition
We define the problem of recognizing entailed relations
Dan Roth, Mark Sammons, V. G. Vinod Vydiswaran
13 years 7 months ago
Where's the Verb? Correcting Machine Translation During Question Answering
When a multi-lingual question-answering (QA) system provides an answer that has been incorrectly translated, it is very likely to be regarded as irrelevant. In this paper, we prop...
Wei-Yun Ma, Kathy McKeown
13 years 7 months ago
Realistic Grammar Error Simulation using Markov Logic
The development of Dialog-Based ComputerAssisted Language Learning (DB-CALL) systems requires research on the simulation of language learners. This paper presents a new method for...
Sungjin Lee, Gary Geunbae Lee