Systems that automatically process sign language rely on appropriate data. We therefore present the ATIS sign language corpus that is based on the domain of air travel information...
Jan Bungeroth, Daniel Stein, Philippe Dreuw, Herma...
Truecasing is the process of restoring case information to badly-cased or noncased text. This paper explores truecasing issues and proposes a statistical, language modeling based ...
Lucian Vlad Lita, Abraham Ittycheriah, Salim Rouko...
Supporting multiple languages and natural language processing are of high importance in information systems. This paper discusses software tools for the generation of languages pro...
The implementation of a recently suggested class of structural change tests, which test for parameter instability in general parametric models, in the R language for statistical c...
Real-world natural language sentences are long and complex, and always contain unexpected grammatical constructions. It even includes noise and ungrammaticality. This paper descri...