We present a system to automatically generate RSS feeds from HTML documents that consist of time-series items with date expressions, e.g., archives of weblogs, BBSs, chats, mailin...
When faced with many documents, people often use systems that characterize documents as read or unread. Most email and document management systems treat this distinction as a bina...
This paper proposes a document image binarization method, which is especially robust to the images degraded by uneven light condition, such as the camera captured document images....
This paper presents a pioneering effort towards machine authentication of security documents like bank cheques, legal deeds, certificates, etc. that fall under the same class as f...
Various approaches have been recently proposed for storing the evolution of an XML document, thereby preserving useful past information about the document and thus the ability to ...