Language identification is the task of identifying the language a given document is written in. This paper describes a detailed examination of what models perform best under diffe...
There are many accurate methods for language identification of long text samples, but identification of very short strings still presents a challenge. This paper studies a languag...
Word form normalization through lemmatization or stemming is a standard procedure in information retrieval because morphological variation needs to be accounted for and several la...
It is frequently claimed that the human mind is organized in a modular fashion, a hypothesis linked historically, though not inevitably, to the claim that many aspects of the huma...