On Fri, Aug 23, 2019 at 2:15 AM Maxim Kuvyrkov maxim.kuvyrkov@linaro.org wrote:
... The other 1/3 will require much more work -- neon intrinsics needs to be converted from inline asms to GCC builtins, so that we can attach scheduler descriptions to them.
Please do.
I know of at least two libraries that heavily rely on ARMv8 intrinsics in some areas. Performance is better than C/C++, but it is kind of flat overall. Stack Overflow also has several questions about poor intrinsics performance on ARM.
Jeff