Text documents, in electronic and hardcopy forms, are and will probably remain the most widely used kind of content in our digital age. The goal of this paper is to overview protocols for text data-hiding based “smart documents”, achieving document self-authentication, self-recovery, selfannotation and automatic processing. We argue that document security, recovery and embedded annotation are the most promising data-hiding based frameworks.