Sciweavers

CLEF
2011
Springer
12 years 11 months ago
Adapting Statistical Language Identification Methods for Short Queries
This paper describes the participation of UAIC team at the LogCLEF 2011 initiative, language identification task. Our approach is an aggregation of known methods for recognizing la...
Alexandru-Lucian Gînsca, Emanuela Boros, Adr...
LREC
2010
169views Education» more  LREC 2010»
13 years 6 months ago
Language Identification of Short Text Segments with N-gram Models
There are many accurate methods for language identification of long text samples, but identification of very short strings still presents a challenge. This paper studies a languag...
Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpi...
NAACL
2010
13 years 9 months ago
Language Identification: The Long and the Short of the Matter
Language identification is the task of identifying the language a given document is written in. This paper describes a detailed examination of what models perform best under diffe...
Timothy Baldwin, Marco Lui
NAACL
2010
13 years 9 months ago
Language identification of names with SVMs
The task of identifying the language of text or utterances has a number of applications in natural language processing. Language identification has traditionally been approached w...
Aditya Bhargava, Grzegorz Kondrak
ENGL
2008
101views more  ENGL 2008»
13 years 11 months ago
Identifying Perceptually Similar Languages Using Teager Energy Based Cepstrum
Language Identification (LID) refers to the task of identifying an unknown language from the test utterances. In this paper, a new method of feature extraction, viz., Teager Energy...
Hemant A. Patil, T. K. Basu
CLEF
2010
Springer
14 years 16 days ago
Language Identification Strategies for Cross Language Information Retrieval
In our participation to the 2010 LogCLEF track we focused on the analysis of the European Library (TEL) logs and in particular we experimented with the identification of the natura...
Alessio Bosca, Luca Dini
CLIN
2001
14 years 25 days ago
Applying Monte Carlo Techniques to Language Identification
Two major stages stages in language identification systems can be identified: the language modeling stage, where the distinctive features of languages are determined and stored in...
Arjen Poutsma
LREC
2008
141views Education» more  LREC 2008»
14 years 26 days ago
A Comparative Study on Language Identification Methods
In this paper we present two experiments conducted for comparison of different language identification algorithms. Short words-, frequent words- and n-gram-based approaches are co...
Lena Grothe, Ernesto William De Luca, Andreas N&uu...
LREC
2010
130views Education» more  LREC 2010»
14 years 27 days ago
The Problems of Language Identification within Hugely Multilingual Data Sets
As the data for more and more languages is finding its way into digital form, with an increasing amount of this data being posted to the Web, it has become possible to collect lan...
Fei Xia, Carrie Lewis, William D. Lewis
CIMCA
2006
IEEE
14 years 1 months ago
Identification of Document Language is Not yet a Completely Solved Problem
Existing Language Identification (LID) approaches do reach 100% precision, in most common situations, when dealing with documents written in just one language, and when those docu...
Joaquim Ferreira da Silva, Gabriel Pereira Lopes