How much do two profiles of the same program differ? When has a profile changed enough to warrant reexamination of the profiled program? And how should two or more profiles be combined to make a better hybrid profile? To answer these questions, we borrow concepts from information theory that measure the differences between the distributions of random variables. Treating the distribution of execution frequencies in a profile like the distribution of a random variable lets us use metrics of difference between distributions and induces a mathematically well-founded way to combine distributions. For concreteness, we report results based on branch profiles, but our algorithms apply to any frequency-based profiling method. Our metric agrees in character with prior metrics while allowing us to adjust the relative importance of coverage and conflict issues. Our hybrid distributions do better than previous profile blending methods.
Serap A. Savari, Cliff Young