Sciweavers

59 search results - page 4 / 12
» Acquisition of Morphology of an Indic Language from Text Cor...
Sort
View
LREC
2008
70views Education» more  LREC 2008»
13 years 8 months ago
Process Model for Composing High-quality Text Corpora
The Teko corpus composing model offers a decentralized, dynamic way of collecting high-quality text corpora for linguistic research. The resulting corpus consists of independent t...
Mikko Lounela
ANLP
2000
126views more  ANLP 2000»
13 years 8 months ago
Compound Noun Segmentation Based on Lexical Data Extracted from Corpus
Compound noun analysis is one of the crucial problems in Korean language processing because a series of nouns in Korean may appear without white space in real texts, which makes i...
Juntae Yoon
COLING
2008
13 years 8 months ago
Verification and Implementation of Language-Based Deception Indicators in Civil and Criminal Narratives
Our goal is to use natural language processing to identify deceptive and nondeceptive passages in transcribed narratives. We begin by motivating an analysis of language-based dece...
Joan Bachenko, Eileen Fitzpatrick, Michael Schonwe...
LREC
2008
132views Education» more  LREC 2008»
13 years 8 months ago
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gather...
Michael Mohler, Rada Mihalcea
ACL
2009
13 years 5 months ago
Part of Speech Tagger for Assamese Text
Assamese is a morphologically rich, agglutinative and relatively free word order Indic language. Although spoken by nearly 30 million people, very little computational linguistic ...
Navanath Saharia, Dhrubajyoti Das, Utpal Sharma, J...