
13 years 11 months ago
Sentence Compression as a Step in Summarization or an Alternative Path in Text Shortening
The originality of this work leads in tackling text compression using an unsupervised method, based on a deep linguistic analysis, and without resorting on a learning corpus. This...
Mehdi Yousfi Monod, Violaine Prince
13 years 11 months ago
A Scalable MMR Approach to Sentence Scoring for Multi-Document Update Summarization
We present SMMR, a scalable sentence scoring method for query-oriented update summarization. Sentences are scored thanks to a criterion combining query relevance and dissimilarity...
Florian Boudin, Marc El-Bèze, Juan Manuel T...
13 years 11 months ago
Explaining Similarity of Terms
Computing the similarity between entities is a core component of many NLP tasks such as measuring the semantic similarity of terms for generating a distributional thesaurus. In th...
Vishnu Vyas, Patrick Pantel
13 years 11 months ago
Quantification and Implication in Semantic Calendar Expressions Represented with Finite-State Transducers
This paper elaborates a model for representing semantic calendar expressions (SCEs), which correspond to the intensional meanings of natural-language calendar phrases. The model u...
Jyrki Niemi, Kimmo Koskenniemi
13 years 11 months ago
Experiments in Base-NP Chunking and Its Role in Dependency Parsing for Thai
This paper studies the role of base-NP information in dependency parsing for Thai. The baseline performance reveals that the base-NP chunking task for Thai is much more difficult ...
Shisanu Tongchim, Virach Sornlertlamvanich, Hitosh...
13 years 11 months ago
On the Weak Generative Capacity of Weighted Context-free Grammars
It is shown how weighted context-free grammars can be used to recognize languages beyond their weak generative capacity by a one-step constant time extension of standard recogniti...
Anders Søgaard
13 years 11 months ago
Detecting Erroneous Uses of Complex Postpositions in an Agglutinative Language
This work presents the development of a system that detects incorrect uses of complex postpositions in Basque, an agglutinative language. Error detection in complex postpositions ...
Arantza Díaz de Ilarraza Sánchez, Ko...
13 years 11 months ago
Rank Distance as a Stylistic Similarity
In this paper we propose a new distance function (rank distance) designed to reflect stylistic similarity between texts. To assess the ability of this distance measure to capture ...
Marius Popescu, Liviu Petrisor Dinu
13 years 11 months ago
The Power of Negative Thinking: Exploiting Label Disagreement in the Min-cut Classification Framework
Treating classification as seeking minimum cuts in the appropriate graph has proven effective in a number of applications. The power of this approach lies in its ability to incorp...
Mohit Bansal, Claire Cardie, Lillian Lee
13 years 11 months ago
Scaling up Analogical Learning
Recent years have witnessed a growing interest in analogical learning for NLP applications. If the principle of analogical learning is quite simple, it does involve complex steps ...
Philippe Langlais, François Yvon