Sciweavers

107 search results - page 5 / 22
» Evaluation of Alignment Methods for HTML Parallel Text
Sort
View
ISM
2006
IEEE
209views Multimedia» more  ISM 2006»
14 years 1 months ago
Holistic Comparison of Text Images for Content-Based Retrieval
The accurate recognition of text that appears in images/videos using analytical character recognition methods is often very difficult, despite the fact that the text might be corr...
Julinda Gllavata, Ermir Qeli, Bernd Freisleben
EMNLP
2007
13 years 9 months ago
Improving Word Alignment with Bridge Languages
We describe an approach to improve Statistical Machine Translation (SMT) performance using multi-lingual, parallel, sentence-aligned corpora in several bridge languages. Our appro...
Shankar Kumar, Franz Josef Och, Wolfgang Macherey
COLING
2010
13 years 2 months ago
Automatic analysis of semantic similarity in comparable text through syntactic tree matching
We propose to analyse semantic similarity in comparable text by matching syntactic trees and labeling the alignments according to one of five semantic similarity relations. We pre...
Erwin Marsi, Emiel Krahmer
JCDL
2006
ACM
167views Education» more  JCDL 2006»
14 years 1 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
CICLING
2009
Springer
13 years 11 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus