Sciweavers

AIRS
2004
Springer
14 years 4 months ago
Combining Sentence Length with Location Information to Align Monolingual Parallel Texts
Abundant Chinese paraphrasing resource on Internet can be attained from different Chinese translations of one foreign masterpiece. Paraphrases corpus is the corpus that includes s...
Weigang Li, Ting Liu, Sheng Li
AIRS
2004
Springer
14 years 4 months ago
Effective Topic Distillation with Key Resource Pre-selection
Topic distillation aims at finding key resources which are high-quality pages for certain topics. With analysis in non-content features of key resources, a pre-selection method is ...
Yiqun Liu, Min Zhang, Shaoping Ma
AIRS
2004
Springer
14 years 5 months ago
Using Verb Dependency Matching in a Reading Comprehension System
In this paper, we describe a reading comprehension system. This system can return a sentence in a given document as the answer to a given question. This system applies bag-of-words...
Kui Xu, Helen Meng
AIRS
2004
Springer
14 years 5 months ago
Automatic Word Clustering for Text Categorization Using Global Information
This paper presents a cluster-based text categorization system which uses class distributional clustering of words. We propose a new clustering model which considers the global in...
Wenliang Chen, Xingzhi Chang, Huizhen Wang, Jingbo...
AIRS
2004
Springer
14 years 5 months ago
A Bootstrapping Approach for Geographic Named Entity Annotation
Abstract. Geographic named entities can be classified into many subtypes that are useful for applications such as information extraction and question answering. In this paper, we ...
Seungwoo Lee, Gary Geunbae Lee
AIRS
2004
Springer
14 years 5 months ago
Document Clustering Using Linear Partitioning Hyperplanes and Reallocation
This paper presents a novel algorithm for document clustering based on a combinatorial framework of the Principal Direction Divisive Partitioning (PDDP) algorithm [1] and a simpli...
Canasai Kruengkrai, Virach Sornlertlamvanich, Hito...
AIRS
2004
Springer
14 years 5 months ago
On Bit-Parallel Processing of Multi-byte Text
There exist practical bit-parallel algorithms for several types of pair-wise string processing, such as longest common subsequence computation or approximate string matching. The b...
Heikki Hyyrö, Jun Takaba, Ayumi Shinohara, Ma...
AIRS
2004
Springer
14 years 5 months ago
Multilingual Relevant Sentence Detection Using Reference Corpus
IR with reference corpus is one approach when dealing with relevant sentences detection, which takes the result of IR as the representation of query (sentence). Lack of informatio...
Ming-Hung Hsu, Ming-Feng Tsai, Hsin-Hsi Chen
AIRS
2004
Springer
14 years 5 months ago
Improving Transliteration with Precise Alignment of Phoneme Chunks and Using Contextual Features
Abstract. Automatic transliteration of foreign names is basically regarded as a diminutive clone of the machine translation (MT) problem. It thus follows IBM’s conventional MT mo...
Wei Gao, Kam-Fai Wong, Wai Lam
AIRS
2004
Springer
14 years 5 months ago
Applying CLIR Techniques to Event Tracking
Abstract. Cross-lingual event tracking from a very large number of information sources (thousands of Web sites, for example) is an open challenge. In this paper we investigate effe...
Nianli Ma, Yiming Yang, Monica Rogati