Sciweavers

1313 search results - page 10 / 263
» Intelligent Selection of Language Model Training Data
Sort
View
INLG
2010
Springer
15 years 10 days ago
Cross-linguistic Attribute Selection for REG: Comparing Dutch and English
In this paper we describe a cross-linguistic experiment in attribute selection for referring expression generation. We used a graph-based attribute selection algorithm that was tr...
Mariët Theune, Ruud Koolen, Emiel Krahmer
125
Voted
PICS
2003
15 years 3 months ago
Selection of Training Sets for the Characterisation of Multispectral Imaging Systems
To establish a correlation between the system output and the corresponding reflectance, the system characterisation functionDeriving the actual multispectral data from the output o...
Paolo Pellegri, Gianluca Novati, Raimondo Schettin...
AI
2009
Springer
15 years 9 months ago
Training Global Linear Models for Chinese Word Segmentation
This paper examines how one can obtain state of the art Chinese word segmentation using global linear models. We provide experimental comparisons that give a detailed road-map for ...
Dong Song, Anoop Sarkar
110
Voted
TALIP
2002
108views more  TALIP 2002»
15 years 2 months ago
Toward a unified approach to statistical language modeling for Chinese
This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu ...
EMNLP
2009
15 years 6 days ago
Less is More: Significance-Based N-gram Selection for Smaller, Better Language Models
The recent availability of large corpora for training N-gram language models has shown the utility of models of higher order than just trigrams. In this paper, we investigate meth...
Robert C. Moore, Chris Quirk