Sciweavers

LREC
2010
153views Education» more  LREC 2010»
13 years 9 months ago
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank
Corpora of sentences annotated with grammatical information have been deployed by extending the basic lexical and morphological data with increasingly complex information, such as...
António Branco, Francisco Costa, Joã...
LREC
2010
144views Education» more  LREC 2010»
13 years 9 months ago
Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH)
In this paper we report a way of constructing a translation corpus that contains not only source and target texts, but draft and final versions of target texts, through the transl...
Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita, K...
LREC
2010
150views Education» more  LREC 2010»
13 years 9 months ago
Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese
Compilation of a 100 million words balanced corpus called the Balanced Corpus of Contemporary Written Japanese (or BCCWJ) is underway at the National Institute for Japanese Langua...
Kikuo Maekawa, Makoto Yamazaki, Takehiko Maruyama,...
LREC
2010
158views Education» more  LREC 2010»
13 years 9 months ago
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank
In this paper, we present several ways to measure and evaluate the annotation and annotators, proposed and used during the building of the Czech part of the Prague Czech-English D...
Marie Mikulová, Jan Stepánek
LREC
2010
164views Education» more  LREC 2010»
13 years 9 months ago
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
After a brief overview of the elements of modern grid computing, a number of common use-cases of natural language processing tasks running on the grid are presented, notably corpu...
Jan Jona Javorsek, Tomaz Erjavec
LREC
2010
136views Education» more  LREC 2010»
13 years 9 months ago
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows
Many contemporary language technology systems are characterized by long pipelines of tools with complex dependencies. Too often, these workflows are implemented by ad hoc scripts;...
Jonathan H. Clark, Alon Lavie
LREC
2010
208views Education» more  LREC 2010»
13 years 9 months ago
A Case Study on Interoperability for Language Resources and Applications
This paper reports our experience when integrating differ resources and services into a grid environment. The use case we address implies the deployment of several NLP application...
Marta Villegas, Núria Bel, Santiago Bel, V&...
LREC
2010
176views Education» more  LREC 2010»
13 years 9 months ago
There's no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
In the paper we investigate the impact of data size on a Word Sense Disambiguation task (WSD). We question the assumption that the knowledge acquisition bottleneck, which is known...
Ines Rehbein, Josef Ruppenhofer
LREC
2010
174views Education» more  LREC 2010»
13 years 9 months ago
Enhancing Language Resources with Maps
We will look at how maps can be integrated in research resources, such as language databases and language corpora. By using maps, search results can be illustrated in a way that i...
Janne Bondi Johannessen, Kristin Hagen, Anders N&o...