Ok, I see what went wrong. I had tested allyesconfig and allmodconfig on 5.4.y, but neither of those selects CONFIG_KVM=y for arm; axm55xx_defconfig is literally the only config (out of 109) that does.
32b ARM KVM support was ripped out in commit 541ad0150ca4 ("arm: Remove 32bit KVM host support") which landed in v5.7-rc1. So when commit 2cbd1cc3dcd3 ("ARM: 8991/1: use VFP assembler mnemonics if available") was written, arch/arm/kvm/ no longer existed. If it did, then 2cbd1cc3dcd3 would have needed something like https://gist.github.com/nickdesaulniers/980e68e9c0680fff06b1b64f2b973171. And allmodconfig/allyesconfig testing wouldn't have caught this, only testing axm55xx_defconfig would have. Before KVM support was dropped, that was the only config that explicitly enabled the config that failed.
On Mon, Mar 15, 2021 at 10:53 AM Ard Biesheuvel ardb@kernel.org wrote:
On Mon, 15 Mar 2021 at 18:43, Nick Desaulniers ndesaulniers@google.com wrote:
f77ac2e378be doesn't apply cleanly to linux-5.4.y. There's a conflict in arch/arm/vfp/vfphw.S due to 5.4 missing commit 2cbd1cc3dcd3 ("ARM: 8991/1: use VFP assembler mnemonics if available") which is one of the patches I sent in the 5.4 series in this thread. That was 1 of a 3 patch series: https://lore.kernel.org/linux-arm-kernel/cover.1593205699.git.stefan@agner.c...
Should I separate out just those 3 and f77ac2e378be and send that for 5.4, or manually backport just f77ac2e378be and note in the commit message the conflict?
I assume we still want a fix for THUMB2, so I'll send a backport of just f77ac2e378be modified to note that there was a fixup against 2cbd1cc3dcd3, since 2cbd1cc3dcd3 is problematic for CONFIG_KVM=y on 5.4.
You haven't explained why all this effort is justified to begin with.
Who cares about being able to build 4.19 or 5.4 mainline with Clang 12 and IAS?
Ah, sorry, ChromeOS and Android very much do so. (Google's production kernels as well, though I don't think they have any 32b ARM machines). Android is already building 4.19+ with LLVM_IAS=1 for ARCH=arm64,x86_64,i686. ChromeOS is doing so for 5.4+ for ARCH=arm64,x86_64 as well. I'm not sure precisely what's going on in prodkernel land, but I know they have LLVM_IAS=1 enabled for x86_64. So when Greg says this is "for no real users" I disagree. Maybe no one is using LLVM_IAS=1 for ARCH=arm at this moment, but that was the point of the backports, to enable more distros to do so.
Stable has already accepted patches to 4.19+ for these architectures where it was made explicit that this was for LLVM_IAS=1 support. https://lore.kernel.org/stable/CAKwvOdk_U6SEwOC-ykaVTMu1ZmEjWC8cCiTetvU2k2dQ... https://lore.kernel.org/stable/CAKwvOd=F_wWLxhnV3J8jx1L3SXPd8NFYyOKzAh7rL0iR... https://lore.kernel.org/stable/CAKwvOdmEcjjw78K0Avj-7s5BBXcT7ARhEMMEYqpCP-ZT... https://lore.kernel.org/stable/CAKwvOdnGDHn+Y+g5AsKvOFiuF7iVAJ8+x53SgWxH9ejq... https://lore.kernel.org/stable/CAKwvOdkK1LgLC4ChptzUTC45WvE9-Sn0OqtgF7-odNSw... https://lore.kernel.org/stable/CAKwvOd=x+fVo1_mMJUGHYXpmGf8UM5yx+uWD-Ci=y=0o... https://lore.kernel.org/stable/CAKwvOdn78WAUiRtyPxW7oEhUz8GN6MkL=Jt+n17jEQXP...
Now it's just down to ARM and THUMB2 support. Then we will be using a similar toolchain regardless of ISA. We will also then have an evergreen toolchain, rather than one that will not be receiving future updates and is unsupported (which becomes a problem when folks need new things and is a liability to be removed), and more wood behind fewer arrows so we can focus on starting on the feature requests we have piling up (like kernel GCC plugin-like features in LLVM, like a code model for the kernel, etc).
This has been communicated to Android OEMs (https://android.googlesource.com/platform/prebuilts/clang/host/linux-x86/+/m...) for them to help test and report issues and likely will also happen for the next release of the NDK (https://github.com/android/ndk/wiki/Changelog-r23#announcements).
I am aware that Clang enablement is a prerequisite for CFI and LTO etcetera, and so I am fully on board with this activity for current and future kernels.
LTO does require LLVM_IAS=1 at the moment; there are no plans I know of yet to port LTO to ARCH=arm, but maybe if Sami is bored? :P
Stable kernels are a different matter, though. I tend to get stable-kernel-rules.rst thrown in my face for proposing backports that aren't nearly as large or intrusive as this stuff, but for some reason, those rules do not seem to apply here.
I understand. I'm also balancing shipping patches for toolchain support out of tree, vs upstreaming. Everything so far has been upstreamed, but 32b ARM support has been...more involved. But I'm hopeful that we will be able to expand our staff soon to better improve that.
I think all of these patches would be useful to CrOS, so my plan was to send the series upstream. We can keep it downstream, where the number of supported configurations and toolchains is more limited. 4.19+ is of interest for new Android devices this year, but 5.4 in particular will live much longer, so we will have to carry the divergence for longer. I think some of the strictly UAL related changes are relatively lower risk.
So my suggestion would be to forget about 4.19 and 5.4 entirely for these changes, unless there is an obvious benefit to all consumers of these stable trees. Otherwise, exposing them to ongoing breakage like this is indefensible IMHO.