Sciweavers

COLING
2002
13 years 11 months ago
Building a Large-Scale Annotated Chinese Corpus
In this paper we address issues related to building a large-scale Chinese corpus. We try to answer four questions: (i) how to speed up annotation, (ii) how to maintain high annota...
Nianwen Xue, Fu-Dong Chiou, Martha Stone Palmer
COLING
2002
13 years 11 months ago
Structure Alignment Using Bilingual Chunking
A new statistical method called "bilingual chunking" for structure alignment is proposed. Different with the existing approaches which align hierarchical structures like...
Wei Wang, Ming Zhou, Jin-Xia Huang, Changning Huan...
COLING
2002
13 years 11 months ago
A Chart-Parsing Algorithm for Efficient Semantic Analysis
In some contexts, well-formed natural language cannot be expected as input to information or communication systems. In these contexts, the use of grammar-independent input (sequen...
Pascal Vaillant
COLING
2002
13 years 11 months ago
Combining Unsupervised and Supervised Methods for PP Attachment Disambiguation
Statistical methods for PP attachment fall into two classes according to the training material used: first, unsupervised methods trained on raw text corpora and second, supervised...
Martin Volk
COLING
2002
13 years 11 months ago
Text Generation from Keywords
We describe a method for generating sentences from "keywords" or "headwords". This method consists of two main parts, candidate-text construction and evaluatio...
Kiyotaka Uchimoto, Satoshi Sekine, Hitoshi Isahara
COLING
2002
13 years 11 months ago
Morphological Analysis of the Spontaneous Speech Corpus
This paper describes a project tagging a spontaneous speech corpus with morphological information such as word segmentation and parts-ofspeech. We use a morphological analysis sys...
Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada...
COLING
2002
13 years 11 months ago
A Cheap and Fast Way to Build Useful Translation Lexicons
The paper presents a statistical approach to automatic building of translation lexicons from parallel corpora. We briefly describe the pre-processing steps, a baseline iterative m...
Dan Tufis
COLING
2002
13 years 11 months ago
Applying an NVEF Word-Pair Identifier to the Chinese Syllable-to-Word Conversion Problem
Syllable-to-word (STW) conversion is important in Chinese phonetic input methods and speech recognition. There are two major problems in the STW conversion: (1) resolving the ambi...
Jia-Lin Tsai, Wen-Lian Hsu
COLING
2002
13 years 11 months ago
Multi-Dimensional Text Classification
This paper proposes a multi-dimensional framework for classifying text documents. In this framework, the concept of multidimensional category model is introduced for representing ...
Thanaruk Theeramunkong, Verayuth Lertnattee