Sciweavers

LREC
2008
99views Education» more  LREC 2008»
13 years 9 months ago
Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors
This paper presents an algorithm for correcting language errors typical of second-language learners. We focus on preposition errors, which are very common among second-language le...
Matthieu Hermet, Alain Désilets, Stan Szpak...
LREC
2008
111views Education» more  LREC 2008»
13 years 9 months ago
Low-Density Language Bootstrapping: the Case of Tajiki Persian
Low-density languages raise difficulties for standard approaches to natural language processing that depend on large online corpora. Using Persian as a case study, we propose a no...
Karine Megerdoomian, Dan Parvaz
LREC
2008
93views Education» more  LREC 2008»
13 years 9 months ago
First Broadcast News Transcription System for Khmer Language
In this paper we present an overview on the development of a large vocabulary continuous speech recognition (LVCSR) system for Khmer, the official language of Cambodia, spoken by ...
Sopheap Seng, Sethserey Sam, Laurent Besacier, Bri...
LREC
2008
102views Education» more  LREC 2008»
13 years 9 months ago
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
We report on the construction of a gold-standard dataset consisting of annotated clinical notes suitable for evaluating our biomedical named entity recognition system. The dataset...
Philip V. Ogren, Guergana K. Savova, Christopher G...
LREC
2008
84views Education» more  LREC 2008»
13 years 9 months ago
Named Entity Recognition for Digitised Historical Texts
We describe and evaluate a prototype system for recognising person and place names in digitised records of British parliamentary proceedings from the late 17th and early 19th cent...
Claire Grover, Sharon Givon, Richard Tobin, Julian...
LREC
2008
171views Education» more  LREC 2008»
13 years 9 months ago
Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information
Distributional, corpus-based descriptions have frequently been applied to model aspects of word meaning. However, distributional models that use corpus data as their basis have on...
Michael Roth, Sabine Schulte im Walde
LREC
2008
129views Education» more  LREC 2008»
13 years 9 months ago
A Multi-Word Term Extraction Program for Arabic Language
Terminology extraction commonly includes two steps: identification of term-like units in the texts, mostly multi-word phrases, and the ranking of the extracted term-like units acc...
Siham Boulaknadel, Béatrice Daille, Driss A...
LREC
2008
146views Education» more  LREC 2008»
13 years 9 months ago
Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Developing resources which can be used for Natural Language Processing is an extremely difficult task for any language, but is even more so for less privileged (or less computeriz...
Anil Kumar Singh, Kiran Pala, Harshit Surana
LREC
2008
78views Education» more  LREC 2008»
13 years 9 months ago
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection
The influence of English as a global language continues to grow to an extent that its words and expressions permeate the original forms of other languages. This paper evaluates a ...
Beatrice Alex
LREC
2008
162views Education» more  LREC 2008»
13 years 9 months ago
Semiotic-based Ontology Evaluation Tool (S-OntoEval)
The objective of the Semiotic-based Ontology Evaluation Tool (S-OntoEval) is to evaluate and propose improvements to a given ontological model. The evaluation aims at assessing th...
Renata Dividino, Massimo Romanelli, Daniel Sonntag