Sciweavers

321 search results - page 52 / 65
» Prefiltering techniques for efficient XML document processin...
Sort
View
134
Voted
ICDAR
2009
IEEE
14 years 12 months ago
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
We propose a low cost method for the correction of the output of OCR engines through the use of human labor. The method employs an error estimator neural network that learns to as...
Ahmad Abdulkader, Mathew R. Casey
XIMEP
2004
ACM
108views Database» more  XIMEP 2004»
15 years 7 months ago
The Joy of SAX
Most current XQuery implementations require that all XML data reside in memory in one form or another before they start processing the data. This is unacceptable for large XML doc...
Leonidas Fegaras
106
Voted
WWW
2001
ACM
16 years 2 months ago
Towards second and third generation web-based multimedia
First generation Web-content encodes information in handwritten (HTML) Web pages. Second generation Web content generates HTML pages on demand, e.g. by filling in templates with c...
Jacco van Ossenbruggen, Joost Geurts, Frank Cornel...
134
Voted
CIKM
2011
Springer
14 years 2 months ago
An efficient method for using machine translation technologies in cross-language patent search
Topics in prior-art patent search are typically full patent applications and relevant items are patents often taken from sources in different languages. Cross language patent retr...
Walid Magdy, Gareth J. F. Jones
CICLING
2009
Springer
15 years 6 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus