Sciweavers

684 search results - page 32 / 137
» Vietnamese Word Segmentation
Sort
View
PVLDB
2008
136views more  PVLDB 2008»
13 years 8 months ago
Keyword query cleaning
Unlike traditional database queries, keyword queries do not adhere to predefined syntax and are often dirty with irrelevant words from natural languages. This makes accurate and e...
Ken Q. Pu, Xiaohui Yu
MLMI
2005
Springer
14 years 2 months ago
Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings
We present baseline results for the joint segmentation and classification of dialog acts (DAs) of the ICSI Meeting Corpus. Two simple approaches based on word information are inve...
Matthias Zimmermann, Yang Liu, Elizabeth Shriberg,...
COLING
1996
13 years 10 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka
ACL
1998
13 years 10 months ago
Text Segmentation Using Reiteration and Collocation
A method is presented for segmenting text into subtopic areas. The proportion of related pairwise words is calculated between adjacent windows of text to determine their lexical s...
Amanda C. Jobbins, Lindsay J. Evett
PKDD
2009
Springer
150views Data Mining» more  PKDD 2009»
14 years 3 months ago
Omiotis: A Thesaurus-Based Measure of Text Relatedness
In this paper we present a new approach for measuring the relatedness between text segments, based on implicit semantic links between their words, as offered by a word thesaurus, n...
George Tsatsaronis, Iraklis Varlamis, Michalis Vaz...