Sciweavers

CICLING
2007
Springer
14 years 1 months ago
On the Impact of Lexical and Linguistic Features in Genre- and Domain-Based Categorization
Abstract. Classification in genres and domains is a major field of research for Information Retrieval (scientific and technical watch, datamining, etc.) and the selection of app...
Guillaume Cleuziou, Céline Poudat
CICLING
2007
Springer
14 years 1 months ago
NEO-CORTEX: A Performant User-Oriented Multi-Document Summarization System
Abstract. This paper discusses an approach to topic-oriented multidocument summarization. It investigates the effectiveness of using additional information about the document set ...
Florian Boudin, Juan Manuel Torres Moreno
CICLING
2007
Springer
14 years 1 months ago
Enhancing Cross-Language Question Answering by Combining Multiple Question Translations
One major problem of state-of-the-art Cross Language Question Answering systems is the translation of user questions. This paper proposes combining the potential of multiple transl...
Rita M. Aceves-Pérez, Manuel Montes-y-G&oac...
CICLING
2007
Springer
14 years 1 months ago
Adapting the JIRS Passage Retrieval System to the Arabic Language
The need of having a Passage Retrieval (PR) system for Arabic texts is due essentially to our aim to build an Arabic Question Answering (QA) system in our research team. We have ch...
Yassine Benajiba, Paolo Rosso, José Manuel ...
CICLING
2007
Springer
14 years 1 months ago
ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy
Abstract. The task of Named Entity Recognition (NER) allows to identify proper names as well as temporal and numeric expressions, in an open-domain text. NER systems proved to be v...
Yassine Benajiba, Paolo Rosso, José-Miguel ...
CICLING
2007
Springer
14 years 1 months ago
Morphological Disambiguation of Turkish Text with Perceptron Algorithm
Abstract. This paper describes the application of the perceptron algorithm to the morphological disambiguation of Turkish text. Turkish has a productive derivational morphology. Du...
Hasim Sak, Tunga Güngör, Murat Saraclar
CICLING
2007
Springer
14 years 1 months ago
Characterizing Humour: An Exploration of Features in Humorous Texts
This paper investigates the problem of automatic humour recognition, and provides and in-depth analysis of two of the most frequently observed features of humorous text: human-cent...
Rada Mihalcea, Stephen G. Pulman
CICLING
2007
Springer
14 years 1 months ago
A Mixed Trigrams Approach for Context Sensitive Spell Checking
This paper addresses the problem of real-word spell checking, i.e., the detection and correction of typos that result in real words of the target language. This paper proposes a me...
Davide Fossati, Barbara Di Eugenio
CICLING
2007
Springer
14 years 1 months ago
The Non-associativity of Polarized Tree-Based Grammars
Abstract. Polarities are used to sanction grammar fragment combination in high level tree-based formalisms such as eXtenssible MetaGrammar (XMG) and polarized unification grammars...
Yael Cohen-Sygal, Shuly Wintner
CICLING
2007
Springer
14 years 1 months ago
Handling Conjunctions in Named Entities
Although the literature contains reports of very high accuracy figures for the recognition of named entities in text, there are still some named entity phenomena that remain probl...
Robert Dale, Pawel P. Mazur