Sciweavers

689 search results - page 27 / 138
» Urdu Word Segmentation
Sort
View
COLING
1996
13 years 8 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka
ACL
1998
13 years 8 months ago
Text Segmentation Using Reiteration and Collocation
A method is presented for segmenting text into subtopic areas. The proportion of related pairwise words is calculated between adjacent windows of text to determine their lexical s...
Amanda C. Jobbins, Lindsay J. Evett
EMNLP
2010
13 years 5 months ago
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping
Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...
Baobao Chang, Dongxu Han
PKDD
2009
Springer
150views Data Mining» more  PKDD 2009»
14 years 1 months ago
Omiotis: A Thesaurus-Based Measure of Text Relatedness
In this paper we present a new approach for measuring the relatedness between text segments, based on implicit semantic links between their words, as offered by a word thesaurus, n...
George Tsatsaronis, Iraklis Varlamis, Michalis Vaz...
EMNLP
2009
13 years 5 months ago
Using Morphological and Syntactic Structures for Chinese Opinion Analysis
This paper employs morphological structures and relations between sentence segments for opinion analysis on words and sentences. Chinese words are classified into eight morphologi...
Lun-Wei Ku, Ting-Hao Huang, Hsin-Hsi Chen