Sciweavers

EPEW
2006
Springer

Precise Regression Benchmarking with Random Effects: Improving Mono Benchmark Results

14 years 3 months ago
Precise Regression Benchmarking with Random Effects: Improving Mono Benchmark Results
Benchmarking as a method of assessing software performance is known to suffer from random fluctuations that distort the observed performance. In this paper, we focus on the fluctuations caused by compilation. We show that the design of a benchmarking experiment must reflect the existence of the fluctuations if the performance observed during the experiment is to be representative of reality. We present a new statistical model of a benchmark experiment that reflects the presence of the fluctuations in compilation, execution and measurement. The model describes the observed performance and makes it possible to calculate the optimum dimensions of the experiment that yield the best precision within a given amount of time. Using a variety of benchmarks, we evaluate the model within the context of regression benchmarking. We show that the model significantly decreases the number of erroneously detected performance changes in regression benchmarking. Key words: performance evaluation, benchma...
Tomás Kalibera, Petr Tuma
Added 22 Aug 2010
Updated 22 Aug 2010
Type Conference
Year 2006
Where EPEW
Authors Tomás Kalibera, Petr Tuma
Comments (0)