Sentence-level aligned parallel texts are important resources for a number of natural language processing (NLP) tasks and applications such as statistical machine translation and ...
Annotated corpora are valuable resources for NLP which are often costly to create. We introduce a method for transferring annotation from a morphologically annotated corpus of a so...
Structured document retrieval makes use of document components as the basis of the retrieval process, rather than complete documents. The inherent relationships between these comp...
Jane Reid, Mounia Lalmas, Karen Finesilver, Morten...
In spoken language processing, considerable research has been accomplished on language identi cation. Singing language identi cation is an important yet challenging area that has ...
XML databases often contain documents comprising structured text. Therefore, it is important to integrate "information retrieval style" query evaluation, which is well-s...