Sciweavers

700 search results - page 26 / 140
» Language Model Based Arabic Word Segmentation
Sort
View
FLAIRS
2007
13 years 10 months ago
Combining Machine Learning with Linguistic Heuristics for Chinese Word Segmentation
This paper describes a hybrid model that combines machine learning with linguistic heuristics for integrating unknown word identification with Chinese word segmentation. The model...
Xiaofei Lu
ICASSP
2008
IEEE
14 years 2 months ago
Sentence segmentation and punctuation recovery for spoken language translation
Sentence segmentation and punctuation recovery are critical components for effective spoken language translation (SLT). In this paper we describe our recent work on sentence segme...
Matthias Paulik, Sharath Rao, Ian R. Lane, Stephan...
IALP
2010
13 years 5 months ago
A Proposed Model for Constructing a Yami WordNet
This paper describes an attempt to build a lexical database for the Yami language, an Austronesian endangered language. As the Yami language documentation and conservation project...
Meng-Chien Yang, D. Victoria Rau, Ann Hui-Huan Cha...
ICDAR
2007
IEEE
14 years 10 days ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
FSMNLP
2005
Springer
14 years 2 months ago
TAGH: A Complete Morphology for German Based on Weighted Finite State Automata
TAGH is a system for automatic recognition of German word forms. It is based on a stem lexicon with allomorphs and a concatenative mechanism for inflection and word formation. Wei...
Alexander Geyken, Thomas Hanneforth