There are many accurate methods for language identification of long text samples, but identification of very short strings still presents a challenge. This paper studies a language identification task, in which the test samples have only 5
Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpi