Sciweavers

IAJIT
2011
13 years 2 months ago
Improving the accuracy of English-Arabic statistical sentence alignment
: Multilingual natural language processing systems are increasingly relying on parallel corpus to ameliorate their output. Parallel corpora constitute the basic block for training ...
Mohammad Salameh, Rached Zantout, Nashat Mansour
CICLING
2010
Springer
13 years 2 months ago
A Chunk-Driven Bootstrapping Approach to Extracting Translation Patterns
Abstract. We present a linguistically-motivated sub-sentential alignment system that extends the intersected IBM Model 4 word alignments. The alignment system is chunk-driven and r...
Lieve Macken, Walter Daelemans
CICLING
2010
Springer
13 years 2 months ago
Who's the Thief? Automatic Detection of the Direction of Plagiarism
Determining the direction of plagiarism (who plagiarized whom in a given pair of documents) is one of the most interesting problems in the field of automatic plagiarism detection. ...
Cristian Grozea, Marius Popescu
CICLING
2010
Springer
13 years 2 months ago
An Empirical Study on the Feature's Type Effect on the Automatic Classification of Arabic Documents
The Arabic language is a highly flexional and morphologically very rich language. It presents serious challenges to the automatic classification of documents, one of which is deter...
Saeed Raheel, Joseph Dichy
CORR
2011
Springer
181views Education» more  CORR 2011»
13 years 2 months ago
Compressed String Dictionaries
The problem of storing a set of strings – a string dictionary – in compact form appears naturally in many cases. While classically it has represented a small part of the whole ...
Nieves R. Brisaboa, Rodrigo Cánovas, Miguel...
CICLING
2010
Springer
13 years 4 months ago
Emotion Holder for Emotional Verbs - The Role of Subject and Syntax
Abstract. Human-like holder plays an important role in identifying actual emotion expressed in text. This paper presents a baseline followed by syntactic approach for capturing emo...
Dipankar Das, Sivaji Bandyopadhyay
IR
2010
13 years 4 months ago
FIDJI: using syntax for validating answers in multiple documents
This article presents FIDJI, a question-answering (QA) system for French. FIDJI combines syntactic information with traditional QA techniques such as named entity recognition and t...
Véronique Moriceau, Xavier Tannier
IR
2010
13 years 4 months ago
Statistical query expansion for sentence retrieval and its effects on weak and strong queries
The retrieval of sentences that are relevant to a given information need is a challenging passage retrieval task. In this context, the well-known vocabulary mismatch problem arises...
David E. Losada
IR
2010
13 years 4 months ago
Sentence-level event classification in unstructured texts
The ability to correctly classify sentences that describe events is an important task for many natural language applications such as Question Answering (QA) and Text Summarisation....
Martina Naughton, Nicola Stokes, Joe Carthy