Sciweavers

139 search results - page 12 / 28
» Information-Theoretic Segmentation of Natural Language
Sort
View
TSD
2001
Springer
15 years 8 months ago
Text Segmentation into Paragraphs Based on Local Text Cohesion
The problem of automatic text segmentation is subcategorized into two different problems: thematic segmentation into rather large topically selfcontained sections and splitting int...
Igor A. Bolshakov, Alexander F. Gelbukh
FSMNLP
2005
Springer
15 years 9 months ago
TAGH: A Complete Morphology for German Based on Weighted Finite State Automata
TAGH is a system for automatic recognition of German word forms. It is based on a stem lexicon with allomorphs and a concatenative mechanism for inflection and word formation. Wei...
Alexander Geyken, Thomas Hanneforth
EMNLP
2010
15 years 2 months ago
Predicting the Semantic Compositionality of Prefix Verbs
In many applications, replacing a complex word form by its stem can reduce sparsity, revealing connections in the data that would not otherwise be apparent. In this paper, we focu...
Shane Bergsma, Aditya Bhargava, Hua He, Grzegorz K...
EMNLP
2010
15 years 2 months ago
Lessons Learned in Part-of-Speech Tagging of Conversational Speech
This paper examines tagging models for spontaneous English speech transcripts. We analyze the performance of state-of-the-art tagging models, either generative or discriminative, ...
Vladimir Eidelman, Zhongqiang Huang, Mary P. Harpe...
CICLING
2009
Springer
15 years 8 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus