The present paper presents the structure of a cross-linguistic database of production data. The database contains annotated texts collected from a sample of fifteen different langu...
This paper presents DiZer, an automatic DIscourse analyZER for Brazilian Portuguese. Given a source text, the system automatically produces its corresponding rhetorical analysis, f...
Thiago Alexandre Salgueiro Pardo, Maria das Gra&cc...
This paper discusses the use of character images to determine the parameters of an image degradation model. The acute angles in character images provide information used to find ...
Hierarchies provide a means of organizing, summarizing and accessing information. We describe a method for automatically generating hierarchies from small collections of text, and...
We describe a system for extracting mentions of terms such as company and product names, in a large and noisy corpus of documents, such as the World Wide Web. Since natural langua...
Einat Amitay, Rani Nelken, Wayne Niblack, Ron Siva...