Sciweavers

139 search results - page 5 / 28
» Information-Theoretic Segmentation of Natural Language
Sort
View
CICLING
2004
Springer
15 years 7 months ago
Language-Independent Methods for Compiling Monolingual Lexical Data
Abstract: In this paper we describe a flexible, portable and languageindependent infrastructure for setting up large monolingual language corpora. The approach is based on collecti...
Christian Biemann, Stefan Bordag, Gerhard Heyer, U...
148
Voted
EMNLP
2010
15 years 2 months ago
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping
Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...
Baobao Chang, Dongxu Han
EMNLP
2010
15 years 2 months ago
An Efficient Algorithm for Unsupervised Word Segmentation with Branching Entropy and MDL
This paper proposes a fast and simple unsupervised word segmentation algorithm that utilizes the local predictability of adjacent character sequences, while searching for a leaste...
Valentin Zhikov, Hiroya Takamura, Manabu Okumura
NLPRS
2001
Springer
15 years 8 months ago
Topic Segmentation : A First Stage to Dialog-Based Information Extraction
We study the problem of topic segmentation of manually transcribed speech in order to facilitate information extraction from dialogs. Our approach is based on a combination of mul...
Narjès Boufaden, Guy Lapalme, Yoshua Bengio
117
Voted
ACL
2006
15 years 5 months ago
Semantic Discourse Segmentation and Labeling for Route Instructions
In order to build a simulated robot that accepts instructions in unconstrained natural language, a corpus of 427 route instructions was collected from human subjects in the office...
Nobuyuki Shimizu