Sciweavers

IJDAR
2011

Grammar-based techniques for creating ground-truthed sketch corpora

13 years 6 months ago
Grammar-based techniques for creating ground-truthed sketch corpora
Although publicly-available, ground-truthed corpora have proven useful for training, evaluating, and comparing recognition systems in many domains, the availability of such corpora for sketch recognizers, and math recognizers in particular, is currently quite poor. This paper presents a general approach to creating large, ground-truthed corpora for structured sketch domains such as mathematics. In the approach, random sketch templates are generated automatically using a grammar model of the sketch domain. These templates are transcribed manually, then automatically annotated with ground-truth. The annotation procedure uses the generated sketch templates to nd a matching between transcribed and generated symbols. A large, ground-truthed corpus of handwritten mathematical expressions presented in the paper illustrates the utility of the approach.
Scott MacLean, George Labahn, Edward Lank, Mirette
Added 14 May 2011
Updated 14 May 2011
Type Journal
Year 2011
Where IJDAR
Authors Scott MacLean, George Labahn, Edward Lank, Mirette S. Marzouk, David Tausky
Comments (0)