Sciweavers

20 search results - page 4 / 4
» An Automatically Built Named Entity Lexicon for Arabic
Sort
View
LREC
2010
189views Education» more  LREC 2010»
13 years 7 months ago
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems
Availability of labeled language resources, such as annotated corpora and domain dependent labeled language resources is crucial for experiments in the field of Natural Language ...
Eric Charton, Juan Manuel Torres Moreno
LREC
2010
186views Education» more  LREC 2010»
13 years 10 months ago
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech manually transcribed and by the outputs of automatic tools (automatic segmenta...
Yannick Estève, Thierry Bazillon, Jean-Yves...
AAAI
2006
13 years 10 months ago
Learning Blocking Schemes for Record Linkage
Record linkage is the process of matching records across data sets that refer to the same entity. One issue within record linkage is determining which record pairs to consider, si...
Matthew Michelson, Craig A. Knoblock
KDD
2009
ACM
219views Data Mining» more  KDD 2009»
14 years 9 months ago
Structured correspondence topic models for mining captioned figures in biological literature
A major source of information (often the most crucial and informative part) in scholarly articles from scientific journals, proceedings and books are the figures that directly pro...
Amr Ahmed, Eric P. Xing, William W. Cohen, Robert ...
SIGIR
2009
ACM
14 years 3 months ago
Web derived pronunciations for spoken term detection
Indexing and retrieval of speech content in various forms such as broadcast news, customer care data and on-line media has gained a lot of interest for a wide range of application...
Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jan...