Bagas Sanjaya bagasdotme@gmail.com writes:
Hi,
I notice a regression report on Bugzilla [1]. Quoting from it:
Hello, we recently got a few kernel crashes with following backtrace. Happened on 6.4.12 (and 6.4.11 I think) but did not happen (I think) on 6.4.4.
[293790.928007] ------------[ cut here ]------------ [293790.929905] rq->clock_update_flags & RQCF_ACT_SKIP [293790.929919] WARNING: CPU: 13 PID: 3837105 at kernel/sched/sched.h:1561 __cfsb_csd_unthrottle+0x149/0x160 [293790.933694] Modules linked in: [...] [293790.946262] Unloaded tainted modules: edac_mce_amd(E):1 [293790.956625] CPU: 13 PID: 3837105 Comm: QueryWorker-30f Tainted: G W E 6.4.12-1.gdc.el9.x86_64 #1 [293790.957963] Hardware name: RDO OpenStack Compute/RHEL, BIOS edk2-20230301gitf80f052277c8-2.el9 03/01/2023 [293790.959681] RIP: 0010:__cfsb_csd_unthrottle+0x149/0x160
See Bugzilla for the full thread.
Anyway, I'm adding this regression to regzbot:
#regzbot introduced: ebb83d84e49b54 https://bugzilla.kernel.org/show_bug.cgi?id=217843
Thanks.
The code in question is literally "rq_lock; update_rq_clock; rq_clock_start_loop_update (the warning)", which suggests to me that RQCF_ACT_SKIP is somehow leaking from somewhere else?