On 29 November 2010 00:18, Michael Hope michael.hope@linaro.org wrote:
To add to the mix:
Some ideas that are logged as blueprints: Using ARMv5 saturated instructions (https://blueprints.launchpad.net/gcc-linaro/+spec/armv5-saturated-ops) Using ARMv6 SIMD instructions (https://blueprints.launchpad.net/gcc-linaro/+spec/armv6-simd)
Those are quite nice instructions; certainly they seem useful for string ops of various types if misued creatively.
Using ARMv7 unaligned accesses (https://blueprints.launchpad.net/gcc-linaro/+spec/unaligned-accesses) Changing the built-in memcpy to use unaligned (https://blueprints.launchpad.net/gcc-linaro/+spec/unaligned-memcpy)
The interesting challenge here is figuring out how expensive unaligned's are and if the cost trade offs are the same on different chips.
The following areas have been suggested. I don't know if they're still valid:
Register allocator: The register allocator is designed around the needs of architectures with a low register count and restrictive register classes. The ARM architecture has many general purpose registers. Different assumptions may give better code.
Conditional instructions: The ARM and, to a lesser extent, Thumb-2 ISAs allow conditional execution of instructions. This can be used in many situations to eliminate an expensive branch. The middle end expands and transforms branches. The ARM backend tries to recombine the RTL back into conditional instructions, but often can't due to the middle end transforms.
GCC is quite creative in avoiding branches by doing lots of masking and logic; it'll be interesting how much this has to gain.
Dave