Sciweavers

144 search results - page 9 / 29
» Improved Source-Channel Models for Chinese Word Segmentation
Sort
View
NAACL
2007
13 years 9 months ago
Hybrid Models for Semantic Classification of Chinese Unknown Words
This paper addresses the problem of classifying Chinese unknown words into fine-grained semantic categories defined in a Chinese thesaurus. We describe three novel knowledge-based...
Xiaofei Lu
ACL
2010
13 years 6 months ago
Jointly Optimizing a Two-Step Conditional Random Field Model for Machine Transliteration and Its Fast Decoding Algorithm
This paper presents a joint optimization method of a two-step conditional random field (CRF) model for machine transliteration and a fast decoding algorithm for the proposed metho...
Dong Yang, Paul R. Dixon, Sadaoki Furui
EMNLP
2004
13 years 9 months ago
A New Approach for English-Chinese Named Entity Alignment
Traditional word alignment approaches cannot come up with satisfactory results for Named Entities. In this paper, we propose a novel approach using a maximum entropy model for nam...
Donghui Feng, Yajuan Lü, Ming Zhou
IRAL
2003
ACM
14 years 1 months ago
Issues in pre- and post-translation document expansion: untranslatable cognates and missegmented words
Query expansion by pseudo-relevance feedback is a well-established technique in both mono- and cross- lingual information retrieval, enriching and disambiguating the typically ter...
Gina-Anne Levow
NAACL
2010
13 years 5 months ago
Joint Parsing and Alignment with Weakly Synchronized Grammars
Syntactic machine translation systems extract rules from bilingual, word-aligned, syntactically parsed text, but current systems for parsing and word alignment are at best cascade...
David Burkett, John Blitzer, Dan Klein