This paper describes a method for hiding data inside printed text documents that is resilient to print/scan and photocopying operations. Using the principle of channel coding with side information, the embedder inserts a message into a text document while treating the content of the document as known interference. The data is embedded by making small changes to text characters before the document is printed. Using a simple correction-based detector in conjuncion with an error correction code, the hidden data can be extracted from a photocopy of the printed document. By enhancing the detector with an optical character recognition algorithm, the embedded data can be extracted even after multiple rounds of photocopying. Results from subjective tests show that the changes made by the embedding algorithm, while perceptible, are not obtrusive to a lay reader. IEEE International Conference on Acoustics, Speech and Signal Processing This work may not be copied or reproduced in whole or in par...
Avinash L. Varna, Shantanu Rane, Anthony Vetro