Performance of n-gram language models depends to a large extent on the amount of training text material available for building the models and the degree to which this text matches...
A Bloom filter (BF) is a randomised data structure for set membership queries. Its space requirements are significantly below lossless information-theoretic lower bounds but it ...
In this paper, we demonstrate that accurate machine translation is possible without the concept of “words,” treating MT as a problem of transformation between character string...
Graham Neubig, Taro Watanabe, Shinsuke Mori, Tatsu...