== Bug fighting == * Tracked bug 774175 (apt segfault on armel on oneiric) down to the cortex-a8 branch erratum bug that we found as part of the bug jam a few weeks ago (affecting the more obscure vtk package) - Richard's existing binutils fix should fix this.
== String routines == * Struggled to get 'perf' to get sane results from profiling spec; some of the samples are obviously being associated with the wrong process somewhere along the process (e.g. it's showing significant samples in the sh process but in a library that's used by the actual benchmark.
* latrace on spec still running on ursa2
* Wrote a non-neon memcpy; as expected it's aligned performance is very similar to libc/kernel - it's a bit faster in some places but slower in some odd places (e.g. n*32+1 bytes is a lot slower for some reason). It's also really bad on mis-aligned cases, I tried to take advantage of the v7's ability to do misaligned loads - but they really are quite slow.
Dave
Can there be something using pragma option to disable neon for each function?
发自我的 iPad
在 2011-5-7,上午1:29,David Gilbert david.gilbert@linaro.org 写道:
== Bug fighting ==
- Tracked bug 774175 (apt segfault on armel on oneiric) down to the
cortex-a8 branch erratum bug that we found as part of the bug jam a few weeks ago (affecting the more obscure vtk package) - Richard's existing binutils fix should fix this.
== String routines ==
- Struggled to get 'perf' to get sane results from profiling spec;
some of the samples are obviously being associated with the wrong process somewhere along the process (e.g. it's showing significant samples in the sh process but in a library that's used by the actual benchmark.
latrace on spec still running on ursa2
Wrote a non-neon memcpy; as expected it's aligned performance is
very similar to libc/kernel - it's a bit faster in some places but slower in some odd places (e.g. n*32+1 bytes is a lot slower for some reason). It's also really bad on mis-aligned cases, I tried to take advantage of the v7's ability to do misaligned loads - but they really are quite slow.
Dave
linaro-toolchain mailing list linaro-toolchain@lists.linaro.org http://lists.linaro.org/mailman/listinfo/linaro-toolchain
On 8 May 2011 13:55, Hakehuang hakehuang@gmail.com wrote:
Can there be something using pragma option to disable neon for each function?
I don't think there is a pragma like that for ARM at the moment; Gcc does seem to have a
#pragma GCC target
and also function attributes for target options; but at the moment these are documented as only being used on x86 (where they are used to turn things like sse on and off).
What is your use case?
Dave
linaro-toolchain@lists.linaro.org