Morphologically complex terms composed from Greek or Latin elements are frequent in scientific and technical texts. Word forming units are thus relevant cues for the identificatio...
Building NLG systems, in particular statistical ones, requires parallel data (paired inputs and outputs) which do not generally occur naturally. In this paper, we investigate the ...
This paper presents a methodology for automatic learning of ontologies from Thai text corpora, by extraction of terms and relations. A shallow parser is used to chunk texts on whic...
This paper discusses the influence of the corpus on the automatic identification of proper names in texts. Techniques developed for the newswire genre are generally not sufficient...
: We demonstrate an approach and an accompanying UNIX toolbox for performing wtrious kinds of Knowledge tT,xlractions and Structuring. The goal is to "practically" enhanc...
Antoine Ogonowski, Marie Luce Herviou, Eva Dauphin