Research and development of information access technology for scanned paper documents has been hampered by the lack of public test collections of realistic scope and complexity. As part of a project to create a prototype system for search and mining of
David D. Lewis, Gady Agam, Shlomo Argamon, Ophir F