Sciweavers

620 search results - page 57 / 124
» Computing with words for text processing: An approach to the...
Sort
View
COLING
2002
13 years 8 months ago
Unknown Word Extraction for Chinese Documents
There is no blank to mark word boundaries in Chinese text. As a result, identifying words is difficult, because of segmentation ambiguities and occurrences of unknown words. Conve...
Keh-Jiann Chen, Wei-Yun Ma
WWW
2009
ACM
14 years 9 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
LREC
2010
183views Education» more  LREC 2010»
13 years 10 months ago
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet
Semantic lexicons and lexical ontologies are some major resources in natural language processing. Developing such resources are time consuming tasks for which some automatic metho...
Mehrnoush Shamsfard, Hakimeh Fadaei, Elham Fekri
LLL
1999
Springer
14 years 1 months ago
Learning to Lemmatise Slovene Words
Abstract. Automatic lemmatisation is a core application for many language processing tasks. In inflectionally rich languages, such as Slovene, assigning the correct lemma to each ...
Saso Dzeroski, Tomaz Erjavec
CICLING
2004
Springer
14 years 20 days ago
A Syllabification Algorithm for Spanish
This paper presents an algorithm for dividing Spanish words into syllables. This algorithm is based on grammatical rules which were translated into a simple algorithm, easy to impl...
Heriberto Cuayáhuitl