Data on modern networks are massive and are applied in the area of monitoring and analyzing activities at the network element, network-wide, and customer and service levels for a heavily increasing number of networks since network technology is used in almost every personal computer. This results in very large log-files containing important data about the network behavior such as http accesses, e-mail headers, routing information of backbones, firewall alarms, or messages. Finding interesting patterns in network data is an important task for network analysts and managers to recognize and respond to changing conditions quickly; within minutes when possible. This situation creates new challenges in coping with scale. Firstly, the analysis of the huge amounts (usually tera-bytes) of the ever-growing network data in detail and the extraction of interesting knowledge or general characteristics about the network behavior is a very difficult task. Secondly, in practice, network data with geog...
Roland Heilmann, Daniel A. Keim, Christian Panse,