Sciweavers

JODL
2000

Automatic page analysis for the creation of a digital library from newspaper archives

13 years 11 months ago
Automatic page analysis for the creation of a digital library from newspaper archives
Digital preservation of newspaper archives aims both at the salvation of endangered material (paper) and at the creation of digital library services that will allow full utilization of the archives by all interested parties. In this paper, we address a series of issues pertaining to the retro-conversion of newspapers, i.e., the conversion of newspaper pages into digital resources. An integrated approach is presented that provides solutions to problems related to newspaper page image enhancement, segmentation of pages into various items (titles, text, images etc), article identification and reconstruction, and, finally, recognition of the textual components. Emphasis is placed on the most difficult intermediate stages of page segmentation and article identification and reconstruction. Detailed experimental results, obtained from a large testbed of old newspaper issues, are presented which clearly demonstrate the applicability of our methodology to the successful retro-conversion of news...
Basilios Gatos, S. L. Mantzaris, Stavros J. Perant
Added 19 Dec 2010
Updated 19 Dec 2010
Type Journal
Year 2000
Where JODL
Authors Basilios Gatos, S. L. Mantzaris, Stavros J. Perantonis, A. Tsigris
Comments (0)