Sciweavers

CICLING
2010
Springer
13 years 6 months ago
A Chunk-Driven Bootstrapping Approach to Extracting Translation Patterns
Abstract. We present a linguistically-motivated sub-sentential alignment system that extends the intersected IBM Model 4 word alignments. The alignment system is chunk-driven and r...
Lieve Macken, Walter Daelemans
CICLING
2010
Springer
13 years 6 months ago
Who's the Thief? Automatic Detection of the Direction of Plagiarism
Determining the direction of plagiarism (who plagiarized whom in a given pair of documents) is one of the most interesting problems in the field of automatic plagiarism detection. ...
Cristian Grozea, Marius Popescu
CICLING
2010
Springer
13 years 6 months ago
An Empirical Study on the Feature's Type Effect on the Automatic Classification of Arabic Documents
The Arabic language is a highly flexional and morphologically very rich language. It presents serious challenges to the automatic classification of documents, one of which is deter...
Saeed Raheel, Joseph Dichy
CICLING
2010
Springer
13 years 8 months ago
Emotion Holder for Emotional Verbs - The Role of Subject and Syntax
Abstract. Human-like holder plays an important role in identifying actual emotion expressed in text. This paper presents a baseline followed by syntactic approach for capturing emo...
Dipankar Das, Sivaji Bandyopadhyay
CICLING
2010
Springer
13 years 9 months ago
A Syntactic Textual Entailment System Based on Dependency Parser
Abstract. The development of a syntactic textual entailment system that compares the dependency relations in both the text and the hypothesis has been reported. The Stanford Depend...
Partha Pakray, Alexander F. Gelbukh, Sivaji Bandyo...
CICLING
2010
Springer
13 years 9 months ago
Ontological Parsing of Encyclopedia Information
Victor Bocharov, Lidia Pivovarova, Valery Rubashki...
CICLING
2010
Springer
13 years 11 months ago
Issues in Analyzing Telugu Sentences towards Building a Telugu Treebank
This paper describes an effort towards building a Telugu Dependency Treebank. We discuss the basic framework and issues we encountered while annotating. 1487 sentences have been an...
Chaitanya Vempaty, Viswanatha Naidu, Samar Husain,...
CICLING
2010
Springer
13 years 11 months ago
A Distributional Semantics Approach to Simultaneous Recognition of Multiple Classes of Named Entities
Named Entity Recognition and Classification is being studied for last two decades. Since semantic features take huge amount of training time and are slow in inference, the existing...
Siddhartha Jonnalagadda, Robert Leaman, Trevor Coh...
CICLING
2010
Springer
14 years 1 months ago
Systematic Processing of Long Sentences in Rule Based Portuguese-Chinese Machine Translation
The translation quality and parsing efficiency are often disappointed when Rule based Machine Translation systems deal with long sentences. Due to the complicated syntactic structu...
Francisco Oliveira, Fai Wong, Iok-Sai Hong
CICLING
2010
Springer
14 years 2 months ago
ETL Ensembles for Chunking, NER and SRL
We present a new ensemble method that uses Entropy Guided Transformation Learning (ETL) as the base learner. The proposed approach, ETL Committee, combines the main ideas of Baggin...
Cícero Nogueira dos Santos, Ruy Luiz Milidi...