In this paper, we propose a novel segmentation-free approach for keyword search in historical typewritten documents combining image preprocessing, synthetic data creation, word sp...
Basilios Gatos, Thomas Konidaris, Kostas Ntzios, I...
We turn to the viewpoint of users of a DAU system. Out of the view of users we sketch a picture of “Document Analysis and Understanding” (DAU), only a simple division of DAU i...
Search engine technology plays an important role in Web information retrieval. However, with Internet information explosion, traditional searching techniques cannot provide satisfa...
Baile Shi, Guoyu Hao, Hongtao Xu, Mei Wang, Qi Zha...
Patent document images maintained by the U.S. patent database have a specific format, in which figures and text descriptions are separated into different sections. This makes it...
: Documents such as spreadsheets are easy to create, edit, and exchange. However, their use causes a set of well known problems such as poor data quality, lack of multi user suppor...