On Mon, Sep 15, 2025 at 05:18:25AM +0000, K Prateek Nayak wrote:
commit cba4262a19afae21665ee242b3404bcede5a94d7 upstream.
Support for parsing the topology on AMD/Hygon processors using CPUID leaf 0xb was added in
3986a0a805e6 ("x86/CPU/AMD: Derive CPU topology from CPUID function 0xB when available").
In an effort to keep all the topology parsing bits in one place, this commit also introduced a pseudo dependency on the TOPOEXT feature to parse the CPUID leaf 0xb.
The TOPOEXT feature (CPUID 0x80000001 ECX[22]) advertises the support for Cache Properties leaf 0x8000001d and the CPUID leaf 0x8000001e EAX for "Extended APIC ID" however support for 0xb was introduced alongside the x2APIC support not only on AMD [1], but also historically on x86 [2].
The support for the 0xb leaf is expected to be confirmed by ensuring
leaf <= max supported cpuid_level
and then parsing the level 0 of the leaf to confirm EBX[15:0] (LogProcAtThisLevel) is non-zero as stated in the definition of "CPUID_Fn0000000B_EAX_x00 [Extended Topology Enumeration] (Core::X86::Cpuid::ExtTopEnumEax0)" in Processor Programming Reference (PPR) for AMD Family 19h Model 01h Rev B1 Vol1 [3] Sec. 2.1.15.1 "CPUID Instruction Functions".
This has not been a problem on baremetal platforms since support for TOPOEXT (Fam 0x15 and later) predates the support for CPUID leaf 0xb (Fam 0x17[Zen2] and later), however, for AMD guests on QEMU, the "x2apic" feature can be enabled independent of the "topoext" feature where QEMU expects topology and the initial APICID to be parsed using the CPUID leaf 0xb (especially when number of cores > 255) which is populated independent of the "topoext" feature flag.
Unconditionally call detect_extended_topology() on AMD processors to first parse the topology using the extended topology leaf 0xb before using the TOPOEXT leaf (0x8000001e).
Parsing of "DIE_TYPE" in detect_extended_topology() is specific to CPUID leaf 0x1f which is only supported on Intel platforms. Continue using the TOPOEXT leaf (0x8000001e) to derive the "cpu_die_id" on AMD platforms.
[ prateek: Adapted the fix from the original commit to stable kernel which doesn't contain the x86 topology rewrite, renamed cpu_parse_topology_ext() with the erstwhile detect_extended_topology() function in commit message, dropped references to extended topology leaf 0x80000026 which the stable kernels aren't aware of, make a note of "cpu_die_id" parsing nuances in detect_extended_topology() and why AMD processors should still rely on TOPOEXT leaf for "cpu_die_id". ]
That's a lot of changes. Why not just use a newer kernel for this new hardware? Why backport this in such a different way? That is going to cause other changes in the future to be harder to backport in the future.
What's driving the requirement to run new hardware on old kernels?
thanks,
greg k-h