Sciweavers

138 search results - page 7 / 28
» Data Cleaning for Word Alignment
Sort
View
SEMWEB
2007
Springer
14 years 2 months ago
Potluck: Data Mash-Up Tool for Casual Users
As more and more reusable structured data appears on the Web, casual users will want to take into their own hands the task of mashing up data rather than wait for mash-up sites to ...
David F. Huynh, Robert C. Miller, David R. Karger
LREC
2010
162views Education» more  LREC 2010»
13 years 10 months ago
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation
Given the recent trend to evaluate the performance of word sense disambiguation systems in a more application-oriented set-up, we report on the construction of a multilingual benc...
Els Lefever, Véronique Hoste
ACL
2012
11 years 10 months ago
Learning Better Rule Extraction with Translation Span Alignment
This paper presents an unsupervised approach to learning translation span alignments from parallel data that improves syntactic rule extraction by deleting spurious word alignment...
Jingbo Zhu, Tong Xiao, Chunliang Zhang
IJCNLP
2005
Springer
14 years 1 months ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
EMNLP
2010
13 years 6 months ago
Combining Unsupervised and Supervised Alignments for MT: An Empirical Study
Word alignment plays a central role in statistical MT (SMT) since almost all SMT systems extract translation rules from word aligned parallel training data. While most SMT systems...
Jinxi Xu, Antti-Veikko I. Rosti