Some large scale topical digital libraries, such as CiteSeer, harvest online academic documents by crawling open-access archives, university and author homepages, and authors’ s...
Latency is a fundamental problem for all distributed systems including digital libraries. To reduce user perceived delays both caching – keeping accessed objects for future use â...
Large collections of scanned documents (books and journals) are now available in Digital Libraries. The most common method for retrieving relevant information from these collectio...
This paper covers a method for capturing documents using a digital camera. A typical cheap VGA digital camera (resolution 640 by 480 pixels) does not have adequate resolution to c...
Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...