Sciweavers

260 search results - page 48 / 52
» The Alignment Template Approach to Statistical Machine Trans...
Sort
View
COLING
2000
13 years 9 months ago
Identifying Terms by their Family and Friends
Multi-word terms are traditionally identified using statistical techniques or, more recently, using hybrid techniques combining statistics with shallow linguistic information. Al)...
Diana Maynard, Sophia Ananiadou
SIGIR
2011
ACM
12 years 10 months ago
No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity
This work explores the problem of cross-lingual pairwise similarity, where the task is to extract similar pairs of documents across two different languages. Solutions to this pro...
Ferhan Ture, Tamer Elsayed, Jimmy J. Lin
ACL
2006
13 years 9 months ago
Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
Named Entity recognition (NER) is an important part of many natural language processing tasks. Current approaches often employ machine learning techniques and require supervised d...
Alexandre Klementiev, Dan Roth
LREC
2008
105views Education» more  LREC 2008»
13 years 9 months ago
Linguistic Resources for Reconstructing Spontaneous Speech Text
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accompl...
Erin Fitzgerald, Frederick Jelinek
COLING
2010
13 years 2 months ago
Plagiarism Detection across Distant Language Pairs
Plagiarism, the unacknowledged reuse of text, does not end at language boundaries. Cross-language plagiarism occurs if a text is translated from a fragment written in a different ...
Alberto Barrón-Cedeño, Paolo Rosso, ...