Sciweavers

IJCNLP
2005
Springer
14 years 1 months ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
IJCNLP
2005
Springer
14 years 1 months ago
Web-Based Unsupervised Learning for Query Formulation in Question Answering
Yi-Chia Wang, Jian-Cheng Wu, Tyne Liang, Jason S. ...
IJCNLP
2005
Springer
14 years 1 months ago
Assigning Polarity Scores to Reviews Using Machine Learning Techniques
We propose a novel type of document classification task that quantifies how much a given document (review) appreciates the target object using not binary polarity (good or bad) b...
Daisuke Okanohara, Jun-ichi Tsujii
IJCNLP
2005
Springer
14 years 1 months ago
Automatic Image Annotation Using Maximum Entropy Model
Automatic image annotation is a newly developed and promising technique to provide semantic image retrieval via text descriptions. It concerns a process of automatically labeling t...
Wei Li, Maosong Sun
IJCNLP
2005
Springer
14 years 1 months ago
Web-Based Terminology Translation Mining
Mining terminology translation from a large amount of Web data can be applied in many fields such as reading/writing assistant, machine translation and cross-language information r...
Gaolin Fang, Hao Yu, Fumihito Nishino
IJCNLP
2005
Springer
14 years 1 months ago
Classifying Chinese Texts in Two Steps
Abstract. This paper proposes a two-step method for Chinese text categorization (TC). In the first step, a Naïve Bayesian classifier is used to fix the fuzzy area between two cate...
Xinghua Fan, Maosong Sun, Key-Sun Choi, Qin Zhang
IJCNLP
2005
Springer
14 years 1 months ago
A Method of Recognizing Entity and Relation
The entity and relation recognition, i.e. (1) assigning semantic classes to entities in a sentence, and (2) determining the relations held between entities, is an important task in...
Xinghua Fan, Maosong Sun
IJCNLP
2005
Springer
14 years 1 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
IJCNLP
2005
Springer
14 years 1 months ago
A Case-Based Reasoning Approach for Speech Corpus Generation
Corpus-based stochastic language models have achieved significant success in speech recognition, but construction of a corpus pertaining to a specific application is a difficult ta...
Yandong Fan, Elizabeth A. Kendall