In this paper, we present a learning approach to the scenario template task of information extraction, where information filling one template could come from multiple sentences. ...
Word alignment plays a crucial role in statistical machine translation. Word-aligned corpora have been found to be an excellent source of translation-related knowledge. We present...
This paper proposes the use of uncertainty reduction in machine learning methods such as co-training and bilingual bootstrapping, which are referred to, in a general term, as ‘c...
Pipelined Natural Language Generation (NLG) systems have grown increasingly complex as architectural modules were added to support language functionalities such as referring expre...
This paper describes a method for learning the countability preferences of English nouns from raw text corpora. The method maps the corpus-attested lexico-syntactic properties of ...
Recent text and speech processing applications such as speech mining raise new and more general problems related to the construction of language models. We present and describe in...