Sciweavers

ICDAR
2011
IEEE

Embedding a Mathematical OCR Module into OCRopus

12 years 11 months ago
Embedding a Mathematical OCR Module into OCRopus
—This paper describes embedding a mathematical formula recognition module into the OCR system OCRopus aiming at developing a OCR system for scientific and technical documents which include mathematical formulas. OCRopus is a open source OCR system emphasizing modularity, easy extensibility, and reuse. This system has several basic components such as preprocessing, layout analysis, and text line recognition, so it is a challenging project to embed the mathematical formula recognition module into the OCRopus system. We have developed the math OCR module, then report how to embed our module into the OCRopus system in order to realize a math OCR which can deal with wide variety of documents including mathematical formulas. Keywords-OCR; OCRopus; Mathematical formula recognition;
Shinpei Yamazaki, Fumihiro Furukori, Qinzheng Zhao
Added 24 Dec 2011
Updated 24 Dec 2011
Type Journal
Year 2011
Where ICDAR
Authors Shinpei Yamazaki, Fumihiro Furukori, Qinzheng Zhao, Keiichiro Shirai, Masayuki Okamoto
Comments (0)