Sciweavers

45 search results - page 6 / 9
» Building a Web Corpus of Czech
Sort
View
EMNLP
2009
13 years 5 months ago
Using the Web for Language Independent Spellchecking and Autocorrection
We have designed, implemented and evaluated an end-to-end system spellchecking and autocorrection system that does not require any manually annotated training data. The World Wide...
Casey Whitelaw, Ben Hutchinson, Grace Chung, Ged E...
ICASSP
2009
IEEE
14 years 2 months ago
Efficacy of a constantly adaptive language modeling technique for web-scale applications
In this paper, we describe CALM, a method for building statistical language models for the Web. CALM addresses several unique challenges dealing with the Web contents. First, CALM...
Kuansan Wang, Xiaolong Li
ACL
2008
13 years 9 months ago
Mining Parenthetical Translations from the Web by Word Alignment
Documents in languages such as Chinese, Japanese and Korean sometimes annotate terms with their translations in English inside a pair of parentheses. We present a method to extrac...
Dekang Lin, Shaojun Zhao, Benjamin Van Durme, Mari...
INLG
2010
Springer
13 years 5 months ago
Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation
Building NLG systems, in particular statistical ones, requires parallel data (paired inputs and outputs) which do not generally occur naturally. In this paper, we investigate the ...
Anja Belz, Eric Kow
EUROCON
2011
281views Hardware» more  EUROCON 2011»
12 years 11 months ago
Browsing videos by automatically detected audio events
—This paper focuses on Audio Event Detection (AED), a research area which aims to substantially enhance the access to audio in multimedia content. With the ever-growing quantity ...
Virginia Barbosa, Thomas Pellegrini, Miguel Bugalh...