Skew detection via principal components is proposed as an e ective methodforimageswhich contain other parts than text. It is shown that the negative of the image leads to much mor...
Abstract. XML query processors suffer from main-memory limitations that prevent them from processing large XML documents. While content-based predicates can be used to project down...
We describe a new corpus collected for comparative evaluation of OCR-software and postcorrection techniques. The corpus is freely available for academic groups and use. The major ...
Stoyan Mihov, Klaus U. Schulz, Christoph Ringlstet...
A formal top down model shall be presented to aid documentation and harmonization of information security requirements. The model formalizes layered development of inn security, w...
Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...