Sciweavers

142 search results - page 3 / 29
» Contemporaneous text as side-information in statistical lang...
Sort
View
EMNLP
2010
13 years 8 months ago
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping
Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...
Baobao Chang, Dongxu Han
SIGIR
2009
ACM
14 years 5 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
GRAPHICSINTERFACE
2003
14 years 7 days ago
Input-based Language Modelling in the Design of High Performance Text Input Techniques
We present a critique of language-based modelling for text input research, and propose an alternative inputbased approach. Current language-based statistical models are derived fr...
R. William Soukoreff, I. Scott MacKenzie
IBERAMIA
2010
Springer
13 years 9 months ago
Improved Text Generation Using N-gram Statistics
Abstract. In Natural Language Generation (NLG) systems, a generalpurpose surface realisation module will usually require the underlying application to provide highly detailed input...
Eder Miranda de Novais, Thiago Dias Tadeu, Ivandr&...
ACL
2003
14 years 7 days ago
Generalized Algorithms for Constructing Statistical Language Models
Recent text and speech processing applications such as speech mining raise new and more general problems related to the construction of language models. We present and describe in...
Cyril Allauzen, Mehryar Mohri, Brian Roark