Understanding the source, data, and documentation files associated with legacy systems in preparation for maintenance or reengineering is an increasingly important problem for many organizations. A useful tool to support these activities considers the collection of files as text and supports analysis and navigation by exploiting connections that result from the occurrence of exact matches of substrings that derive from the languages used, the design process, the use of common idioms, and maintenance activity. By navigating these paths some understanding of the overall structure can be inferred. A tool for data visualization and navigation of the textual redundancy web based on the technology of HTML and the World Wide Web is
J. Howard Johnson