In the "Sandglass" MT architecture, we identify the class of monosemous Japanese functional expressions and utilize it in the task of translating Japanese functional exp...
Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, T...
The Enron Email Corpus provides "Real World" text in the business email domain, which is a target domain for many speech and language applications. We present a section ...
A large effort has been devoted to the development of textual knowledge acquisition (KA) tools, but it is still difficult to assess the progress that has been made. The lack of we...
In this paper we bring to light a novel intersection between corpus linguistics and behavioral data that can be employed as an evaluation metric for resources for low-density lang...
This paper introduces a new lexicographic resource, the MuLeXFoR database, which aims to present word-formation processes in a multilingual environment. Morphological items repres...