A Game-based Approach to Transcribing Images of Text

15 years 8 months ago

Download www.itri.brighton.ac.uk

We present a methodology that takes as input scanned documents of typed or hand-written text, and produces transcriptions of the text as output. Instead of using OCR technology, the methodology is game-based and produces such transcriptions as a by-product. The approach is intended particularly for languages for which language technology and resources are scarce and reliable OCR technology may not exist. It can be used in place of OCR for transcribing individual documents, or to create corpora of paired images and transcriptions required to train OCR tools. We present Minefield, a prototype implementation of the approach which is currently collecting Arabic transcriptions.

Khalil Dahab, Anja Belz

Real-time Traffic

Education | Input Scanned Documents | LREC 2010 | OCR Technology | Reliable Ocr Technology |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Khalil Dahab, Anja Belz

Sciweavers

A Game-based Approach to Transcribing Images of Text

Education | Input Scanned Documents | LREC 2010 | OCR Technology | Reliable Ocr Technology |

Explore & Download

Productivity Tools

Sciweavers