Sciweavers

COLING
2002
13 years 11 months ago
A Robust Cross-Style Bilingual Sentences Alignment Model
Most current sentence alignment approaches adopt sentence length and cognate as the alignment features; and they are mostly trained and tested in the documents with the same style...
Tz-Liang Kueng, Keh-Yih Su
COLING
2002
13 years 11 months ago
Linking Syntactic and Semantic Arguments in a Dependency-based Formalism
We propose a formal characterization of variation in the syntactic realization of semantic arguments, using hierarchies of syntactic relations and thematic roles, and a mechanism ...
Christian Korthals, Ralph Debusmann
COLING
2002
13 years 11 months ago
Determining Recurrent Sound Correspondences by Inducing Translation Models
I present a novel approach to the determination of recurrent sound correspondences in bilingual wordlists. The idea is to relate correspondences between sounds in wordlists to tra...
Grzegorz Kondrak
COLING
2002
13 years 11 months ago
Automatic Text Categorization using the Importance of Sentences
Automatic text categorization is a problem of automatically assigning text documents to predefined categories. In order to classify text documents, we must extract good features f...
Youngjoong Ko, Jinwoo Park, Jungyun Seo
COLING
2002
13 years 11 months ago
Robust Interpretation of User Requests for Text Retrieval in a Multimodal Environment
We describe a parser for robust and flexible interpretation of user utterances in a multi-modal system for web search in newspaper databases. Users can speak or type, and they can...
Alexandra Klein, Estela Puig-Waldmüller, Hara...
COLING
2002
13 years 11 months ago
"Dialog Navigator": A Question Answering System Based on Large Text Knowledge Base
This paper describes a dialog based QA system, Dialog Navigator, which can answer questions based on large text knowledge base. In real world QA systems, vagueness of questions is...
Yoji Kiyota, Sadao Kurohashi, Fuyuko Kido
COLING
2002
13 years 11 months ago
Scaled Log Likelihood Ratios for the Detection of Abbreviations in Text Corpora
We describe a language-independent, flexible, and accurate method for the detection of abbreviations in text corpora. It is based on the idea that an abbreviation can be viewed as...
Tibor Kiss, Jan Strunk
COLING
2002
13 years 11 months ago
Unsupervised Named Entity Classification Models and their Ensembles
This paper proposes an unsupervised learning model for classifying named entities. This model uses a training set, built automatically by means of a small-scale named entity dicti...
Jae-Ho Kim, In-Ho Kang, Key-Sun Choi
COLING
2002
13 years 11 months ago
A Comparative Evaluation of Data-driven Models in Translation Selection of Machine Translation
We present a comparative evaluation of two data-driven models used in translation selection of English-Korean machine translation. Latent semantic analysis(LSA) and probabilistic ...
Yuseop Kim, Jeong Ho Chang, Byoung-Tak Zhang
COLING
2002
13 years 11 months ago
A Novel Disambiguation Method for Unification-Based Grammars Using Probabilistic Context-Free Approximations
We present a novel disambiguation method for unification-based grammars (UBGs). In contrast to other methods, our approach obviates the need for probability models on the UBG side...
Bernd Kiefer, Hans-Ulrich Krieger, Detlef Prescher