Sciweavers

LREC
2010
181views Education» more  LREC 2010»
14 years 27 days ago
Linguistically Motivated Unsupervised Segmentation for Machine Translation
In this paper we use statistical machine translation and morphology information from two different morphological analyzers to try to improve translation quality by linguistically ...
Mark Fishel, Harri Kirik
LREC
2010
146views Education» more  LREC 2010»
14 years 27 days ago
A Pilot Arabic CCGbank
We describe a process for converting the Penn Arabic Treebank into the CCG formalism. Previous efforts have yielded CCGbanks in English, German, and Turkish, thus opening these la...
Stephen A. Boxwell, Chris Brew
LREC
2010
181views Education» more  LREC 2010»
14 years 27 days ago
Generating FrameNets of Various Granularities: The FrameNet Transformer
We present a method and a software tool, the FrameNet Transformer, for deriving customized versions of the FrameNet database based on frame and frame element relations. The FrameN...
Josef Ruppenhofer, Jonas Sunde, Manfred Pinkal
LREC
2010
229views Education» more  LREC 2010»
14 years 27 days ago
Building a Node of the Accessible Language Technology Infrastructure
We present a limited prototype of the CLARIN Language Technology Infrastructure (LTI) node, which provides several types of web services for Polish. The functionality encompasses ...
Bartosz Broda, Michal Marcinczuk, Maciej Piasecki
LREC
2010
183views Education» more  LREC 2010»
14 years 27 days ago
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet
Semantic lexicons and lexical ontologies are some major resources in natural language processing. Developing such resources are time consuming tasks for which some automatic metho...
Mehrnoush Shamsfard, Hakimeh Fadaei, Elham Fekri
LREC
2010
136views Education» more  LREC 2010»
14 years 27 days ago
A Japanese Particle Corpus Built by Example-Based Annotation
This paper is a report on an on-going project of creating a new corpus focusing on Japanese particles. The corpus will provide deeper syntactic/semantic information than the exist...
Hiroki Hanaoka, Hideki Mima, Jun-ichi Tsujii
LREC
2010
237views Education» more  LREC 2010»
14 years 27 days ago
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers
We present an experimental framework for Entity Mention Detection in which two different classifiers are combined to exploit Data Redundancy attained through the annotation of a l...
Silvana Marianela Bernaola Biggio, Manuela Speranz...
LREC
2010
148views Education» more  LREC 2010»
14 years 27 days ago
GikiCLEF: Crosscultural Issues in Multilingual Information Access
In this paper we describe GikiCLEF, the first evaluation contest that, to our knowledge, was specifically designed to expose and investigate cultural and linguistic issues involve...
Diana Santos, Luís Miguel Cabral, Corina Fo...
LREC
2010
165views Education» more  LREC 2010»
14 years 27 days ago
Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus
After three years of work the Dutch Parallel Corpus (DPC) project has reached an end. The finalized corpus is a ten-million-word high-quality sentence-aligned bidirectional parall...
Orphée De Clercq, Maribel Montero Perez
LREC
2010
169views Education» more  LREC 2010»
14 years 27 days ago
Using Comparable Corpora to Adapt a Translation Model to Domains
Statistical machine translation (SMT) requires a large parallel corpus, which is available only for restricted language pairs and domains. To expand the language pairs and domains...
Hiroyuki Kaji, Takashi Tsunakawa, Daisuke Okada