The Internet constitutes a potential huge store of parallel text that may be collected to be exploited by many applications such as multilingual information retrieval, machine tran...
We have aligned Japanese and English news articles and sentences to make a large parallel corpus. We first used a method based on cross-language information retrieval (CLIR) to a...
We describe a word alignment platform which ensures text pre-processing (tokenization, POS-tagging, lemmatization, chunking, sentence alignment) as required by an accurate word al...
Dan Tufis, Radu Ion, Alexandru Ceausu, Dan Stefane...
We present the procedures we implemented to carry out system oriented evaluation of a syntax-based word aligner --ALIBI. We take the approach of regarding cross-corpus evaluation ...
This paper describes the work achieved in the rst half of a 4-year cooperative research project ARCADE, nanced by AUPELF-UREF. The project is devoted to the evaluation of parallel...