Language Identification of Short Text Segments with N-gram Models

15 years 1 months ago

Download www.lrec-conf.org

There are many accurate methods for language identification of long text samples, but identification of very short strings still presents a challenge. This paper studies a language identification task, in which the test samples have only 5

Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpi

Real-time Traffic

Education | Language Identification | LREC 2010 | N-gram Model | Ranking Method |

claim paper

» Language Identification on the Web Extending the Dictionary Method

» A Machine TextInspired Machine Learning Approach for Identification of Transmembrane Helix...

» Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

» A Handwritten Character Extraction Algorithm for Multilanguage Document Image

» Web article extraction for web printing a DOMvisual based approach

» German Compound Analysis with wfsc

» Mining models of human activities from the web

» Creation of topic map by identifying topic chain in chinese

Post Info
More Details (n/a)

Added	20 May 2011
Updated	20 May 2011
Type	Journal
Year	2010
Where	LREC
Authors	Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpioja

Comments (0)

Sciweavers

Language Identification of Short Text Segments with N-gram Models

Education | Language Identification | LREC 2010 | N-gram Model | Ranking Method |

Explore & Download

Productivity Tools

Sciweavers