Sciweavers

59 search results - page 9 / 12
» Acquisition of Morphology of an Indic Language from Text Cor...
Sort
View
NLE
2008
118views more  NLE 2008»
13 years 7 months ago
Part-of-speech tagging of Modern Hebrew text
Words in Semitic texts often consist of a concatenation of word segments, each corresponding to a Part-of-Speech (POS) category. Semitic words may be ambiguous with regard to thei...
Roy Bar-Haim, Khalil Sima'an, Yoad Winter
LREC
2008
157views Education» more  LREC 2008»
13 years 8 months ago
AnCora: Multilevel Annotated Corpora for Catalan and Spanish
This paper presents AnCora, a multilingual corpus annotated at different linguistic levels consisting of 500,000 words in Catalan (AnCora-Ca) and in Spanish (AnCora-Es). At presen...
Mariona Taulé, Maria Antònia Mart&ia...
LREC
2008
117views Education» more  LREC 2008»
13 years 8 months ago
Swedish-Turkish Parallel Treebank
In this paper, we describe our work on building a parallel treebank for a less studied and typologically dissimilar language pair, namely Swedish and Turkish. The treebank is a ba...
Beáta Megyesi, Bengt Dahlqvist, Eva Petters...
EACL
2003
ACL Anthology
13 years 8 months ago
Detecting Novel Compounds: The Role of Distributional Evidence
Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurrence in a corpus is indicative of their terminological status, and has not addr...
Mirella Lapata, Alex Lascarides
IR
2010
13 years 4 months ago
Sentence-level event classification in unstructured texts
The ability to correctly classify sentences that describe events is an important task for many natural language applications such as Question Answering (QA) and Text Summarisation....
Martina Naughton, Nicola Stokes, Joe Carthy