Sciweavers

700 search results - page 39 / 140
» Language Model Based Arabic Word Segmentation
Sort
View
CICLING
2004
Springer
14 years 7 days ago
Language-Independent Methods for Compiling Monolingual Lexical Data
Abstract: In this paper we describe a flexible, portable and languageindependent infrastructure for setting up large monolingual language corpora. The approach is based on collecti...
Christian Biemann, Stefan Bordag, Gerhard Heyer, U...
EMNLP
2010
13 years 6 months ago
It Depends on the Translation: Unsupervised Dependency Parsing via Word Alignment
We reveal a previously unnoticed connection between dependency parsing and statistical machine translation (SMT), by formulating the dependency parsing task as a problem of word a...
Samuel Brody
CLEF
2008
Springer
13 years 10 months ago
Allomorfessor: Towards Unsupervised Morpheme Analysis
Many modern natural language processing applications would benefit from automatic morphological analysis of words, especially when dealing with morphologically rich languages. Con...
Oskar Kohonen, Sami Virpioja, Mikaela Klami
EMNLP
2009
13 years 6 months ago
Discriminative Corpus Weight Estimation for Machine Translation
Current statistical machine translation (SMT) systems are trained on sentencealigned and word-aligned parallel text collected from various sources. Translation model parameters ar...
Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhan...
ACL
1998
13 years 10 months ago
The Production of Code-Mixed Discourse
We propose a comprehensive theory of codemixed discourse, encompassing equivalencepoint and insertional code-switching, palindromic constructions and lexical borrowing. The starti...
David Sankoff