For getting reproducible benchmark results, you just need to ensurethat thermal throttling never kicks in. If the kernel is compiled
with cpufreq stats enabled, you can compare these stats before/after
your benchmark to ensure that it spent all the time running at the
same designated clock frequency.
Anyway, I recommend you to start the tests for the hardware
robustness with:
wget https://raw.github.com/ssvb/cpuburn/master/cpuburn-a9.S
arm-linux-gnueabihf-gcc -o cpuburn-a9 cpuburn-a9.S