Sciweavers

321 search results - page 52 / 65
» Prefiltering techniques for efficient XML document processin...
Sort
View
ICDAR
2009
IEEE
13 years 6 months ago
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
We propose a low cost method for the correction of the output of OCR engines through the use of human labor. The method employs an error estimator neural network that learns to as...
Ahmad Abdulkader, Mathew R. Casey
XIMEP
2004
ACM
108views Database» more  XIMEP 2004»
14 years 2 months ago
The Joy of SAX
Most current XQuery implementations require that all XML data reside in memory in one form or another before they start processing the data. This is unacceptable for large XML doc...
Leonidas Fegaras
WWW
2001
ACM
14 years 9 months ago
Towards second and third generation web-based multimedia
First generation Web-content encodes information in handwritten (HTML) Web pages. Second generation Web content generates HTML pages on demand, e.g. by filling in templates with c...
Jacco van Ossenbruggen, Joost Geurts, Frank Cornel...
CIKM
2011
Springer
12 years 8 months ago
An efficient method for using machine translation technologies in cross-language patent search
Topics in prior-art patent search are typically full patent applications and relevant items are patents often taken from sources in different languages. Cross language patent retr...
Walid Magdy, Gareth J. F. Jones
CICLING
2009
Springer
14 years 16 days ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus