Sciweavers

LREC
2008
93views Education» more  LREC 2008»
14 years 28 days ago
First Broadcast News Transcription System for Khmer Language
In this paper we present an overview on the development of a large vocabulary continuous speech recognition (LVCSR) system for Khmer, the official language of Cambodia, spoken by ...
Sopheap Seng, Sethserey Sam, Laurent Besacier, Bri...
LREC
2008
102views Education» more  LREC 2008»
14 years 28 days ago
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
We report on the construction of a gold-standard dataset consisting of annotated clinical notes suitable for evaluating our biomedical named entity recognition system. The dataset...
Philip V. Ogren, Guergana K. Savova, Christopher G...
LREC
2008
84views Education» more  LREC 2008»
14 years 28 days ago
Named Entity Recognition for Digitised Historical Texts
We describe and evaluate a prototype system for recognising person and place names in digitised records of British parliamentary proceedings from the late 17th and early 19th cent...
Claire Grover, Sharon Givon, Richard Tobin, Julian...
LREC
2008
171views Education» more  LREC 2008»
14 years 28 days ago
Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information
Distributional, corpus-based descriptions have frequently been applied to model aspects of word meaning. However, distributional models that use corpus data as their basis have on...
Michael Roth, Sabine Schulte im Walde
LREC
2008
129views Education» more  LREC 2008»
14 years 28 days ago
A Multi-Word Term Extraction Program for Arabic Language
Terminology extraction commonly includes two steps: identification of term-like units in the texts, mostly multi-word phrases, and the ranking of the extracted term-like units acc...
Siham Boulaknadel, Béatrice Daille, Driss A...
LREC
2008
146views Education» more  LREC 2008»
14 years 28 days ago
Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Developing resources which can be used for Natural Language Processing is an extremely difficult task for any language, but is even more so for less privileged (or less computeriz...
Anil Kumar Singh, Kiran Pala, Harshit Surana
LREC
2008
78views Education» more  LREC 2008»
14 years 28 days ago
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection
The influence of English as a global language continues to grow to an extent that its words and expressions permeate the original forms of other languages. This paper evaluates a ...
Beatrice Alex
LREC
2008
162views Education» more  LREC 2008»
14 years 28 days ago
Semiotic-based Ontology Evaluation Tool (S-OntoEval)
The objective of the Semiotic-based Ontology Evaluation Tool (S-OntoEval) is to evaluate and propose improvements to a given ontological model. The evaluation aims at assessing th...
Renata Dividino, Massimo Romanelli, Daniel Sonntag
LREC
2008
108views Education» more  LREC 2008»
14 years 28 days ago
Spatiotemporal Annotation Using MiniSTEx: how to deal with Alternative, Foreign, Vague and/or Obsolete Names?
We are currently developing MiniSTEx, a spatiotemporal annotation system to handle temporal and/or geospatial information directly and indirectly expressed in texts. In the end, t...
Ineke Schuurman
LREC
2008
153views Education» more  LREC 2008»
14 years 28 days ago
Using Random Indexing to improve Singular Value Decomposition for Latent Semantic Analysis
We present results from using Random Indexing for Latent Semantic Analysis to handle Singular Value Decomposition tractability issues. We compare Latent Semantic Analysis, Random ...
Linus Sellberg, Arne Jönsson