On Thu, Aug 25, 2011 at 09:17:59AM +0100, Richard Sandiford wrote:
Revital Eres revital.eres@linaro.org writes:
btw, do you also have numbers of how much SMS (hopefully) improves performance on top of the vectorized code?
OK, here's a comparison of:
-mcpu=cortex-a8 -mfpu=neon -mfloat-abi=softfp -mvectorize-with-neon-quad -fno-auto-inc-dec
vs:
-mcpu=cortex-a8 -mfpu=neon -mfloat-abi=softfp -mvectorize-with-neon-quad -fmodulo-sched -fmodulo-sched-allow-regmoves -fno-auto-inc-dec
(including the register-scheduling patch). As you can see, it's a bit of a mixed bag.
Hmm, a mixed bag, really? It looks like only aes and resample truly benefit..