Sciweavers

14577 search results - page 17 / 2916
» Statistical Language Modelling
Sort
View
EMNLP
2010
13 years 8 months ago
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping
Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...
Baobao Chang, Dongxu Han
ACL
2001
13 years 11 months ago
Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters
In this paper, a new language model, the Multi-Class Composite N-gram, is proposed to avoid a data sparseness problem for spoken language in that it is difficult to collect traini...
Hirofumi Yamamoto, Shuntaro Isogai, Yoshinori Sagi...
ECIR
2007
Springer
13 years 11 months ago
Natural Language Processing for Usage Based Indexing of Web Resources
Abstract. The identification of reliable and interesting items on Internet becomes more and more difficult and time consuming. This paper is a position paper describing our intend...
Anne Boyer, Armelle Brun
CICLING
2009
Springer
14 years 10 months ago
Enriching Statistical Translation Models Using a Domain-Independent Multilingual Lexical Knowledge Base
This paper presents a method for improving phrase-based Statistical Machine Translation systems by enriching the original translation model with information derived from a multilin...
Miguel García, Jesús Giménez,...
ACL
2009
13 years 8 months ago
Active Learning for Multilingual Statistical Machine Translation
Statistical machine translation (SMT) models require bilingual corpora for training, and these corpora are often multilingual with parallel text in multiple languages simultaneous...
Gholamreza Haffari, Anoop Sarkar