#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable: - either the laptop freezes; - either the screen keeps blank; - either the screen is OK but mouse is frozen; - either display lags with several logs in dmesg: [ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
Hi, this is your Linux kernel regression tracker. Thanks for the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was suspected as the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might be related to your problem (I'm not directly involved in any of this, I don't know the details, it's just that 887f75cfd0da looked familiar to me). If it is, a fix for these two bugs was committed to master earlier this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?i...
It will likely be backported to 5.17.y, maybe already in the over-next release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
For informatio, here is a part of the log around the suspend process:
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices failed May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03: Transfer while suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at drivers/i2c/ busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff ff e9 2d 9c 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ? newidle_balance.constprop.0+0x1f7/0x3b0 May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ? acpi_subsys_resume_early+0x50/0x50 May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: failed to change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM: failed to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel: [drm:amdgpu_device_ip_resume_phase2] *ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed to resume async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: Power consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in BIOS setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command 0xfc20 tx timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw command failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in /etc/dnsmasq.d/ dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:18 geek500 kernel: [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was suspected as the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might be related to your problem (I'm not directly involved in any of this, I don't know the details, it's just that 887f75cfd0da looked familiar to me). If it is, a fix for these two bugs was committed to master earlier this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the over-next release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices failed May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03: Transfer while suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at drivers/i2c/ busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff ff e9 2d 9c 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ? newidle_balance.constprop.0+0x1f7/0x3b0 May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ? acpi_subsys_resume_early+0x50/0x50 May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: failed to change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM: failed to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel: [drm:amdgpu_device_ip_resume_phase2] *ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed to resume async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: Power consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in BIOS setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command 0xfc20 tx timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw command failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in /etc/dnsmasq.d/ dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:18 geek500 kernel: [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was suspected as the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might be related to your problem (I'm not directly involved in any of this, I don't know the details, it's just that 887f75cfd0da looked familiar to me). If it is, a fix for these two bugs was committed to master earlier this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the over-next release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
On 5/15/22 21:47, Kai-Heng Feng wrote:
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend process:
Is it possible to attach full dmesg?
This is AMD APU and NVIDIA dGPU? Or is this A+A?
I notice in the log snippet here this message:
May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: Power
consumption
will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in BIOS setup.
This means that you are manually picking s2idle but the system was designed for using S3. Did you manually choose s2idle?
Would you mind bringing all the details of this regression over to a bug report at https://gitlab.freedesktop.org/drm/amd/-/issues/ instead?
I would like to see: * acpidump * full dmesg from the working scenario and * full dmesg of failing scenario (including failure if possible)
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices failed May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03: Transfer while suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at drivers/i2c/ busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff ff e9 2d 9c 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ? newidle_balance.constprop.0+0x1f7/0x3b0 May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ? acpi_subsys_resume_early+0x50/0x50 May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: failed to change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM: failed to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel: [drm:amdgpu_device_ip_resume_phase2] *ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed to resume async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: Power consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in BIOS setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command 0xfc20 tx timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw command failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in /etc/dnsmasq.d/ dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:18 geek500 kernel: [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was suspected as the culprit: https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitlab.fre...
And a one related to it: https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitlab.fre...
You might want to take a look if what was discussed there might be related to your problem (I'm not directly involved in any of this, I don't know the details, it's just that 887f75cfd0da looked familiar to me). If it is, a fix for these two bugs was committed to master earlier this week:
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.kernel... d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the over-next release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch have all commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to the point to get a console for now).
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices failed May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03: Transfer while suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at drivers/i2c/ busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff ff e9 2d 9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ? newidle_balance.constprop.0+0x1f7/0x3b0 May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ? acpi_subsys_resume_early+0x50/0x50 May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: failed to change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM: failed to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel: [drm:amdgpu_device_ip_resume_phase2] *ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed to resume async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: Power consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in BIOS setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command 0xfc20 tx timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw command failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in /etc/dnsmasq.d/ dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:18 geek500 kernel: [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was suspected as the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might be related to your problem (I'm not directly involved in any of this, I don't know the details, it's just that 887f75cfd0da looked familiar to me). If it is, a fix for these two bugs was committed to master earlier this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi t/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the over-next release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch have all commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to the point to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices failed May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03: Transfer while suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at drivers/i2c/ busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff ff e9 2d 9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ? newidle_balance.constprop.0+0x1f7/0x3b0 May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ? acpi_subsys_resume_early+0x50/0x50 May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: failed to change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM: failed to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel: [drm:amdgpu_device_ip_resume_phase2] *ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed to resume async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44 00 00 48 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d 87 c4 ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48 0f a3 05 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d50 EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:41 geek500 kernel: ------------[ cut here ]------------ May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at drivers/clk/ clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last unloaded: acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion Gaming Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP: 0010:clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48 85 db 74 a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35 87 c4 ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f a3 05 ea 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc1c47d60 EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel: acpi_subsys_runtime_suspend+0x9/0x20 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ? kthread_complete_and_exit+0x20/0x20 May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000 ]--- May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: Power consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in BIOS setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't change power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no GSI May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command 0xfc20 tx timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw command failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in /etc/dnsmasq.d/ dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer expired on ring gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0 May 14 19:22:18 geek500 kernel: [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer expired on ring sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was suspected as the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might be related to your problem (I'm not directly involved in any of this, I don't know the details, it's just that 887f75cfd0da looked familiar to me). If it is, a fix for these two bugs was committed to master earlier this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi t/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the over-next release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore. At resume, symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR* Waiting for fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring sdma0 [ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never been fixed and plagues us when trying to diagnose another regression on touchpad not resuming in the past). The screen goes blank and I can get it OK when pressing the power button, this seems to reset it. After that all suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop working again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated + NVidia 1650Ti (turned off with ACPI call in order to get more battery, I'm not using NVidia driver).
No, the problem is there even without acpicall. Fyi I use it to shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening (basically exact same behavior as 2.17.4).
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for
the report.
On 14.05.22 16:41, Christian Casteyde wrote:
#regzbot introduced v5.17.3..v5.17.4 #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was
suspected as
the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might
be
related to your problem (I'm not directly involved in any of
this, I
don't know the details, it's just that 887f75cfd0da looked
familiar to
me). If it is, a fix for these two bugs was committed to master
earlier
this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
t/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the
over-next
release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a
lot of
reports and sometimes miss something important when writing
mails like
this. If that's the case here, don't hesitate to tell me in a
public
reply, it's in everyone's interest to set the public record
straight.
Hello Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
symptoms are variable:
- either the laptop freezes;
- either the screen keeps blank;
- either the screen is OK but mouse is frozen;
- either display lags with several logs in dmesg:
[ 228.275492] [drm] Fence fallback timer expired on ring gfx [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
fences timed out! [ 228.779490] [drm] Fence fallback timer expired on ring gfx [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
[ 229.283485] [drm] Fence fallback timer expired on ring gfx [ 229.787487] [drm] Fence fallback timer expired on ring gfx ...
I've bisected the problem.
Please note this laptop has a strange behaviour on suspend: The first suspend request always fails (this point has never
been
fixed and plagues us when trying to diagnose another regression on
touchpad not
resuming in the past). The screen goes blank and I can get it
OK when
pressing the power button, this seems to reset it. After that
all
suspend/resume works OK.
Since 5.17.4, it is not possible anymore to get the laptop
working
again after the first suspend failure.
HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
1650Ti (turned off with ACPI call in order to get more battery, I'm
not using
NVidia driver).
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for
the report.
On 14.05.22 16:41, Christian Casteyde wrote: > #regzbot introduced v5.17.3..v5.17.4 > #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was
suspected as
the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might
be
related to your problem (I'm not directly involved in any of
this, I
don't know the details, it's just that 887f75cfd0da looked
familiar to
me). If it is, a fix for these two bugs was committed to master
earlier
this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
t/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the
over-next
release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a
lot of
reports and sometimes miss something important when writing
mails like
this. If that's the case here, don't hesitate to tell me in a
public
reply, it's in everyone's interest to set the public record
straight.
> Hello > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> symptoms are variable: > - either the laptop freezes; > - either the screen keeps blank; > - either the screen is OK but mouse is frozen; > - either display lags with several logs in dmesg: > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
> fences timed out! > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> [ 229.283485] [drm] Fence fallback timer expired on ring gfx > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > ... > > I've bisected the problem. > > Please note this laptop has a strange behaviour on suspend: > The first suspend request always fails (this point has never
been
> fixed > and > plagues us when trying to diagnose another regression on
touchpad not
> resuming in the past). The screen goes blank and I can get it
OK when
> pressing the power button, this seems to reset it. After that
all
> suspend/resume works OK. > > Since 5.17.4, it is not possible anymore to get the laptop
working
> again > after the first suspend failure. > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> 1650Ti > (turned off with ACPI call in order to get more battery, I'm
not using
> NVidia driver).
Hello
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
Following are dmesg for 5.18-rc7: - when it fails (dmesg-bad.txt); - when it works (dmesg-good.txt).
CC
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for
the report.
On 14.05.22 16:41, Christian Casteyde wrote: > #regzbot introduced v5.17.3..v5.17.4 > #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was
suspected as
the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might
be
related to your problem (I'm not directly involved in any of
this, I
don't know the details, it's just that 887f75cfd0da looked
familiar to
me). If it is, a fix for these two bugs was committed to master
earlier
this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
t/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the
over-next
release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a
lot of
reports and sometimes miss something important when writing
mails like
this. If that's the case here, don't hesitate to tell me in a
public
reply, it's in everyone's interest to set the public record
straight.
> Hello > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> symptoms are variable: > - either the laptop freezes; > - either the screen keeps blank; > - either the screen is OK but mouse is frozen; > - either display lags with several logs in dmesg: > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
> fences timed out! > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> [ 229.283485] [drm] Fence fallback timer expired on ring gfx > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > ... > > I've bisected the problem. > > Please note this laptop has a strange behaviour on suspend: > The first suspend request always fails (this point has never
been
> fixed > and > plagues us when trying to diagnose another regression on
touchpad not
> resuming in the past). The screen goes blank and I can get it
OK when
> pressing the power button, this seems to reset it. After that
all
> suspend/resume works OK. > > Since 5.17.4, it is not possible anymore to get the laptop
working
> again > after the first suspend failure. > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> 1650Ti > (turned off with ACPI call in order to get more battery, I'm
not using
> NVidia driver).
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit. The suspend on the system doesn't work properly at the first place.
Kai-Heng
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
[AMD Official Use Only - General]
-----Original Message----- From: Thorsten Leemhuis regressions@leemhuis.info Sent: Wednesday, May 18, 2022 01:37 To: Kai-Heng Feng kai.heng.feng@canonical.com Cc: casteyde.christian@free.fr; stable@vger.kernel.org; regressions@lists.linux.dev; Deucher, Alexander Alexander.Deucher@amd.com; gregkh@linuxfoundation.org; Limonciello, Mario Mario.Limonciello@amd.com Subject: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"),
and
the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
I think the right way to focus on this is to fix the original suspend issue. The fact that the first suspend is failing with s3 should be a red flag that the system is in a pretty bad state.
Maybe can we get /sys/power/pm_debug_messages turned on as well as /sys/power/pm_print_times. Then we should have a better idea on what is going on that triggers that first failure.
Again, it would be much better to put all this in a bug report somewhere. It's really hard to associate dmesgs in a threaded email with what's going on. Kernel Bugzilla, AMD's Gitlab, it doesn't matter where really. Anywhere is better than email threads IMO.
In this case the revert would causes problems for the resume of any dGPU. So it's a tradeoff of many dGPU resume failures vs one APU resume failure in s2idle after a failed suspend in s3.
BTW - I'm not really sure why the system is picking s2idle for "the second try". Is that the kernel doing this, or is this userspace causing it? We really shouldn't be seeing different suspend modes across attempts without a user consciously selecting one.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Hello
I've checked with 5.18 the problem is still there. Interestingly, I tried to revert the commit but it was rejected because of the change in the test from: if (!adev->in_s0ix) to: if (amdgpu_acpi_should_gpu_reset(adev))
in amdgpu_pmops_suspend.
I fixed the rejection, keeping shoud_gpu_reset, but it still fails. Then I changed to restore test of in_s0ix as it was in 5.17, and it works. I tried with a call to amd_gpu_asic_reset without testing at all in_s0ix, it works.
Therefore, my APU wants a reset in amdgpu_pmops_suspend.
By curiosity, I tried to do the reset in amdgpu_pmops_suspend_noirq as was intended in 5.18 original code, commenting out the test of amdgpu_acpi_should_gpu_reset(adev) (since this APU wants a reset). This does not work, I got the Fence timeout errors or freezes.
If I leave noirq function unchanged (original 5.18 code), and just add a reset in suspend() as was done in 5.17, it works.
Therefore, my GPU does NOT want to be reset in noirq, the reset must be in suspend.
In other words, I modified amdgpu_pmops_suspend (partial revert) like this and this works on my laptop:
static int amdgpu_pmops_suspend(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev); + int r;
if (amdgpu_acpi_is_s0ix_active(adev)) adev->in_s0ix = true; else adev->in_s3 = true; - return amdgpu_device_suspend(drm_dev, true); + r = amdgpu_device_suspend(drm_dev, true); + if (r) + return r; + if (!adev->in_s0ix) + return amdgpu_asic_reset(adev); return 0; }
static int amdgpu_pmops_suspend_noirq(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
if (amdgpu_acpi_should_gpu_reset(adev)) return amdgpu_asic_reset(adev);
return 0; }
I don't know if other APU want a reset, in the same context, and how to differentiate all the cases, so I cannot go further, but I can test patches if needed.
CC
Le mercredi 18 mai 2022, 08:37:27 CEST Thorsten Leemhuis a écrit :
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
[Public]
-----Original Message----- From: Christian Casteyde casteyde.christian@free.fr Sent: Monday, May 23, 2022 08:03 To: Kai-Heng Feng kai.heng.feng@canonical.com; Thorsten Leemhuis regressions@leemhuis.info Cc: stable@vger.kernel.org; regressions@lists.linux.dev; Deucher, Alexander Alexander.Deucher@amd.com; gregkh@linuxfoundation.org; Limonciello, Mario Mario.Limonciello@amd.com Subject: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
Hello
I've checked with 5.18 the problem is still there. Interestingly, I tried to revert the commit but it was rejected because of the change in the test from: if (!adev->in_s0ix) to: if (amdgpu_acpi_should_gpu_reset(adev))
in amdgpu_pmops_suspend.
I fixed the rejection, keeping shoud_gpu_reset, but it still fails. Then I changed to restore test of in_s0ix as it was in 5.17, and it works. I tried with a call to amd_gpu_asic_reset without testing at all in_s0ix, it works.
Therefore, my APU wants a reset in amdgpu_pmops_suspend.
By curiosity, I tried to do the reset in amdgpu_pmops_suspend_noirq as was intended in 5.18 original code, commenting out the test of amdgpu_acpi_should_gpu_reset(adev) (since this APU wants a reset). This does not work, I got the Fence timeout errors or freezes.
If I leave noirq function unchanged (original 5.18 code), and just add a reset in suspend() as was done in 5.17, it works.
Therefore, my GPU does NOT want to be reset in noirq, the reset must be in suspend.
In other words, I modified amdgpu_pmops_suspend (partial revert) like this and this works on my laptop:
static int amdgpu_pmops_suspend(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
int r;
if (amdgpu_acpi_is_s0ix_active(adev)) adev->in_s0ix = true; else adev->in_s3 = true;
- return amdgpu_device_suspend(drm_dev, true);
- r = amdgpu_device_suspend(drm_dev, true);
- if (r)
return r;
- if (!adev->in_s0ix)
return 0;return amdgpu_asic_reset(adev);
}
static int amdgpu_pmops_suspend_noirq(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
if (amdgpu_acpi_should_gpu_reset(adev)) return amdgpu_asic_reset(adev);
return 0; }
I don't know if other APU want a reset, in the same context, and how to differentiate all the cases, so I cannot go further, but I can test patches if needed.
The core of this problem is that your first suspend fails and the GPU is in a bad state for the next suspend.
I'm not sure why you're ignoring my other emails, but I did suggest a different approach in this thread here:
https://patchwork.freedesktop.org/patch/486836/
Thanks,
CC
Le mercredi 18 mai 2022, 08:37:27 CEST Thorsten Leemhuis a écrit :
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
I've opened the gitlab entry for this discussion: https://gitlab.freedesktop.org/drm/amd/-/issues/2023
I confirm I 'm not receiving mails anymore from the mailing list but I'll follow gitlab.
In reply to the patch proposed by Mario: https://patchwork.freedesktop.org/patch/486836/ With this patch applied on vanilla 5.18 kernel: - suspend still fails; - after suspend attempt, the screen comes back with only the cursor; - switching to a console let me get the following dmesg file.
CC
Le lundi 23 mai 2022, 15:02:53 CEST Christian Casteyde a écrit :
Hello
I've checked with 5.18 the problem is still there. Interestingly, I tried to revert the commit but it was rejected because of the change in the test from: if (!adev->in_s0ix) to: if (amdgpu_acpi_should_gpu_reset(adev))
in amdgpu_pmops_suspend.
I fixed the rejection, keeping shoud_gpu_reset, but it still fails. Then I changed to restore test of in_s0ix as it was in 5.17, and it works. I tried with a call to amd_gpu_asic_reset without testing at all in_s0ix, it works.
Therefore, my APU wants a reset in amdgpu_pmops_suspend.
By curiosity, I tried to do the reset in amdgpu_pmops_suspend_noirq as was intended in 5.18 original code, commenting out the test of amdgpu_acpi_should_gpu_reset(adev) (since this APU wants a reset). This does not work, I got the Fence timeout errors or freezes.
If I leave noirq function unchanged (original 5.18 code), and just add a reset in suspend() as was done in 5.17, it works.
Therefore, my GPU does NOT want to be reset in noirq, the reset must be in suspend.
In other words, I modified amdgpu_pmops_suspend (partial revert) like this and this works on my laptop:
static int amdgpu_pmops_suspend(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
int r;
if (amdgpu_acpi_is_s0ix_active(adev)) adev->in_s0ix = true; else adev->in_s3 = true;
- return amdgpu_device_suspend(drm_dev, true);
- r = amdgpu_device_suspend(drm_dev, true);
- if (r)
return r;
- if (!adev->in_s0ix)
return 0;return amdgpu_asic_reset(adev);
}
static int amdgpu_pmops_suspend_noirq(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
if (amdgpu_acpi_should_gpu_reset(adev)) return amdgpu_asic_reset(adev);
return 0; }
I don't know if other APU want a reset, in the same context, and how to differentiate all the cases, so I cannot go further, but I can test patches if needed.
CC
Le mercredi 18 mai 2022, 08:37:27 CEST Thorsten Leemhuis a écrit :
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
[Public]
-----Original Message----- From: Christian Casteyde casteyde.christian@free.fr Sent: Monday, May 23, 2022 12:03 To: Kai-Heng Feng kai.heng.feng@canonical.com; Thorsten Leemhuis regressions@leemhuis.info Cc: stable@vger.kernel.org; regressions@lists.linux.dev; Deucher, Alexander Alexander.Deucher@amd.com; gregkh@linuxfoundation.org; Limonciello, Mario Mario.Limonciello@amd.com Subject: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
I've opened the gitlab entry for this discussion: https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla b.freedesktop.org%2Fdrm%2Famd%2F- %2Fissues%2F2023&data=05%7C01%7Cmario.limonciello%40amd.com% 7C7a9cf928dd1e491f0c2c08da3cde2e21%7C3dd8961fe4884e608e11a82d994e 183d%7C0%7C0%7C637889222210477502%7CUnknown%7CTWFpbGZsb3d8ey JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D% 7C3000%7C%7C%7C&sdata=WnQwbF8J4j2F69GINpz49Zg5Qg0tpVCmUG i1FjXrCu4%3D&reserved=0
I confirm I 'm not receiving mails anymore from the mailing list but I'll follow gitlab.
In reply to the patch proposed by Mario: https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fpatc hwork.freedesktop.org%2Fpatch%2F486836%2F&data=05%7C01%7Cma rio.limonciello%40amd.com%7C7a9cf928dd1e491f0c2c08da3cde2e21%7C3dd 8961fe4884e608e11a82d994e183d%7C0%7C0%7C637889222210477502%7CUn known%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6 Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=FuhEnUyC8nv ycpaGr4yfBbGpuXHXoKqImBTQ0PL8caY%3D&reserved=0 With this patch applied on vanilla 5.18 kernel:
- suspend still fails;
- after suspend attempt, the screen comes back with only the cursor;
- switching to a console let me get the following dmesg file.
I spent some time with Christian today on that Gitlab issue, and want to update this audience to a few things.
1) The first suspend failure is caused by something wrong with acpi_tad driver on his system. Blacklisting the driver everything works properly.
2) Failing the first time with deep and trying s2idle instead is caused by something in his userspace (unknown right now).
3) This is not a regression IMO. The GPU was in a bad state from the problem caused by acpi-tad. We introduced poking around and resetting the GPU to help with aborted suspends, but they've led to a pile of "Oh but not this case", fix this ordering problem, deal with this repercussion. We shouldn't be dropping any of those incremental solutions to deal with a path like this; the direction should be for fixing acpi_tad or whatever userspace is using it incorrectly on Christian's system.
CC
Le lundi 23 mai 2022, 15:02:53 CEST Christian Casteyde a écrit :
Hello
I've checked with 5.18 the problem is still there. Interestingly, I tried to revert the commit but it was rejected because of the change in the test from: if (!adev->in_s0ix) to: if (amdgpu_acpi_should_gpu_reset(adev))
in amdgpu_pmops_suspend.
I fixed the rejection, keeping shoud_gpu_reset, but it still fails. Then I changed to restore test of in_s0ix as it was in 5.17, and it works. I tried with a call to amd_gpu_asic_reset without testing at all in_s0ix, it works.
Therefore, my APU wants a reset in amdgpu_pmops_suspend.
By curiosity, I tried to do the reset in amdgpu_pmops_suspend_noirq as
was
intended in 5.18 original code, commenting out the test of amdgpu_acpi_should_gpu_reset(adev) (since this APU wants a reset). This does not work, I got the Fence timeout errors or freezes.
If I leave noirq function unchanged (original 5.18 code), and just add a reset in suspend() as was done in 5.17, it works.
Therefore, my GPU does NOT want to be reset in noirq, the reset must be
in
suspend.
In other words, I modified amdgpu_pmops_suspend (partial revert) like
this
and this works on my laptop:
static int amdgpu_pmops_suspend(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
int r;
if (amdgpu_acpi_is_s0ix_active(adev)) adev->in_s0ix = true; else adev->in_s3 = true;
- return amdgpu_device_suspend(drm_dev, true);
- r = amdgpu_device_suspend(drm_dev, true);
- if (r)
return r;
- if (!adev->in_s0ix)
return 0;return amdgpu_asic_reset(adev);
}
static int amdgpu_pmops_suspend_noirq(struct device *dev) { struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
if (amdgpu_acpi_should_gpu_reset(adev)) return amdgpu_asic_reset(adev);
return 0; }
I don't know if other APU want a reset, in the same context, and how to differentiate all the cases, so I cannot go further, but I can test patches if needed.
CC
Le mercredi 18 mai 2022, 08:37:27 CEST Thorsten Leemhuis a écrit :
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC
reset"),
and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails
like
this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
Conclusion from gitlab discussion
There are three different problems at stake here:
1. First suspend failure after boot is due to TAD driver. When it is loaded, first suspend deep fails. This is the root cause of the following sequence. TAD seems to be not completely functionnal on my laptop, sysfs returns IO errors (missing handlers in dmesg). After the first failure, every "deep" attempt work.
2. When suspend deep fails, elogind (used by Slackware 15) falls back to s2idle. This behaviour is documented in logind.conf man page, and normaly can be configured (side note: I didn't manage to do so, it ignores my configuration file).
3. When suspend to s2idle, the GPU fails to suspend and need a reset. This reset must be done in pm_suspend (not totally ok if reset at resume). However, the laptop indeed goes into s2idle. In this state, the power button awakes it: this part is handled by the BIOS and not the distribution (which shut downs if not suspended). This is what doesn't work anymore un 5.17.4 and 5.18.
The root cause is therefore the ACPI TAD preventing the first deep suspend to complete, then elogind asking for a s2idle in fallback, then s2idle leaving the APU in inconsistent state, that can only be fixed by a reset in pm_suspend, and not pm_suspend_noirq or pm_suspend_late.
I will open a separate bug for the ACPI TAD problem. For now I will run without this driver, as deep suspend works fine in this case and s2idle is therefore useless for me.
CC
Le lundi 23 mai 2022, 19:03:27 CEST Christian Casteyde a écrit :
I've opened the gitlab entry for this discussion: https://gitlab.freedesktop.org/drm/amd/-/issues/2023
I confirm I 'm not receiving mails anymore from the mailing list but I'll follow gitlab.
In reply to the patch proposed by Mario: https://patchwork.freedesktop.org/patch/486836/ With this patch applied on vanilla 5.18 kernel:
- suspend still fails;
- after suspend attempt, the screen comes back with only the cursor;
- switching to a console let me get the following dmesg file.
CC
Le lundi 23 mai 2022, 15:02:53 CEST Christian Casteyde a écrit :
Hello
I've checked with 5.18 the problem is still there. Interestingly, I tried to revert the commit but it was rejected because of
the change in the test from: if (!adev->in_s0ix)
to: if (amdgpu_acpi_should_gpu_reset(adev))
in amdgpu_pmops_suspend.
I fixed the rejection, keeping shoud_gpu_reset, but it still fails. Then I changed to restore test of in_s0ix as it was in 5.17, and it works. I tried with a call to amd_gpu_asic_reset without testing at all in_s0ix, it works.
Therefore, my APU wants a reset in amdgpu_pmops_suspend.
By curiosity, I tried to do the reset in amdgpu_pmops_suspend_noirq as was intended in 5.18 original code, commenting out the test of amdgpu_acpi_should_gpu_reset(adev) (since this APU wants a reset). This does not work, I got the Fence timeout errors or freezes.
If I leave noirq function unchanged (original 5.18 code), and just add a reset in suspend() as was done in 5.17, it works.
Therefore, my GPU does NOT want to be reset in noirq, the reset must be in suspend.
In other words, I modified amdgpu_pmops_suspend (partial revert) like this and this works on my laptop:
static int amdgpu_pmops_suspend(struct device *dev) {
struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
int r;
if (amdgpu_acpi_is_s0ix_active(adev))
adev->in_s0ix = true;
else
adev->in_s3 = true;
- return amdgpu_device_suspend(drm_dev, true);
r = amdgpu_device_suspend(drm_dev, true);
if (r)
return r;
if (!adev->in_s0ix)
return amdgpu_asic_reset(adev);
return 0;
}
static int amdgpu_pmops_suspend_noirq(struct device *dev) {
struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev); if (amdgpu_acpi_should_gpu_reset(adev)) return amdgpu_asic_reset(adev); return 0;
}
I don't know if other APU want a reset, in the same context, and how to differentiate all the cases, so I cannot go further, but I can test patches if needed.
CC
Le mercredi 18 mai 2022, 08:37:27 CEST Thorsten Leemhuis a écrit :
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote:
I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), and the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
On 24.05.22 22:54, Christian Casteyde wrote:
Conclusion from gitlab discussion
Christian, Mario, thx for the updates. I'll remove the issue from the list of tracked regressions then:
#regzbot invalid: tricky situation with other problems; the issue thus not really qualifies as regression
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
There are three different problems at stake here:
- First suspend failure after boot is due to TAD driver.
When it is loaded, first suspend deep fails. This is the root cause of the following sequence. TAD seems to be not completely functionnal on my laptop, sysfs returns IO errors (missing handlers in dmesg). After the first failure, every "deep" attempt work.
- When suspend deep fails, elogind (used by Slackware 15) falls back to
s2idle. This behaviour is documented in logind.conf man page, and normaly can be configured (side note: I didn't manage to do so, it ignores my configuration file).
- When suspend to s2idle, the GPU fails to suspend and need a reset.
This reset must be done in pm_suspend (not totally ok if reset at resume). However, the laptop indeed goes into s2idle. In this state, the power button awakes it: this part is handled by the BIOS and not the distribution (which shut downs if not suspended). This is what doesn't work anymore un 5.17.4 and 5.18.
The root cause is therefore the ACPI TAD preventing the first deep suspend to complete, then elogind asking for a s2idle in fallback, then s2idle leaving the APU in inconsistent state, that can only be fixed by a reset in pm_suspend, and not pm_suspend_noirq or pm_suspend_late.
I will open a separate bug for the ACPI TAD problem. For now I will run without this driver, as deep suspend works fine in this case and s2idle is therefore useless for me.
CC
Le lundi 23 mai 2022, 19:03:27 CEST Christian Casteyde a écrit :
I've opened the gitlab entry for this discussion: https://gitlab.freedesktop.org/drm/amd/-/issues/2023
I confirm I 'm not receiving mails anymore from the mailing list but I'll follow gitlab.
In reply to the patch proposed by Mario: https://patchwork.freedesktop.org/patch/486836/ With this patch applied on vanilla 5.18 kernel:
- suspend still fails;
- after suspend attempt, the screen comes back with only the cursor;
- switching to a console let me get the following dmesg file.
CC
Le lundi 23 mai 2022, 15:02:53 CEST Christian Casteyde a écrit :
Hello
I've checked with 5.18 the problem is still there. Interestingly, I tried to revert the commit but it was rejected because of
the change in the test from: if (!adev->in_s0ix)
to: if (amdgpu_acpi_should_gpu_reset(adev))
in amdgpu_pmops_suspend.
I fixed the rejection, keeping shoud_gpu_reset, but it still fails. Then I changed to restore test of in_s0ix as it was in 5.17, and it works. I tried with a call to amd_gpu_asic_reset without testing at all in_s0ix, it works.
Therefore, my APU wants a reset in amdgpu_pmops_suspend.
By curiosity, I tried to do the reset in amdgpu_pmops_suspend_noirq as was intended in 5.18 original code, commenting out the test of amdgpu_acpi_should_gpu_reset(adev) (since this APU wants a reset). This does not work, I got the Fence timeout errors or freezes.
If I leave noirq function unchanged (original 5.18 code), and just add a reset in suspend() as was done in 5.17, it works.
Therefore, my GPU does NOT want to be reset in noirq, the reset must be in suspend.
In other words, I modified amdgpu_pmops_suspend (partial revert) like this and this works on my laptop:
static int amdgpu_pmops_suspend(struct device *dev) {
struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev);
int r;
if (amdgpu_acpi_is_s0ix_active(adev))
adev->in_s0ix = true;
else
adev->in_s3 = true;
- return amdgpu_device_suspend(drm_dev, true);
r = amdgpu_device_suspend(drm_dev, true);
if (r)
return r;
if (!adev->in_s0ix)
return amdgpu_asic_reset(adev);
return 0;
}
static int amdgpu_pmops_suspend_noirq(struct device *dev) {
struct drm_device *drm_dev = dev_get_drvdata(dev); struct amdgpu_device *adev = drm_to_adev(drm_dev); if (amdgpu_acpi_should_gpu_reset(adev)) return amdgpu_asic_reset(adev); return 0;
}
I don't know if other APU want a reset, in the same context, and how to differentiate all the cases, so I cannot go further, but I can test patches if needed.
CC
Le mercredi 18 mai 2022, 08:37:27 CEST Thorsten Leemhuis a écrit :
On 18.05.22 07:54, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:52 PM Thorsten Leemhuis
regressions@leemhuis.info wrote:
On 17.05.22 19:37, casteyde.christian@free.fr wrote: > I've tryied to revert the offending commit on 5.18-rc7 (887f75cfd0da > ("drm/amdgpu: Ensure HDA function is suspended before ASIC reset"), > and > the problem disappears so it's really this commit that breaks.
In that case I'll update the regzbot status to make sure it's visible as regression introduced in the 5.18 cycle:
#regzbot introduced: 887f75cfd0da
BTW: obviously would be nice to get this fixed before 5.18 is released (which might already happen on Sunday), especially as the culprit apparently was already backported to stable, but I guess that won't be easy...
Which made me wondering: is reverting the culprit temporarily in mainline (and reapplying it later with a fix) a option here?
It's too soon to call it's the culprit.
Well, sure, the root-cause might be somewhere else. But from the point of kernel regressions (and tracking them) it's the culprit, as that's the change that triggers the misbehavior. And that's how Linus approaches these things as well when it comes to reverting to fix regressions -- and he even might...
The suspend on the system doesn't work properly at the first place.
...ignore things like this, as long as a revert is unlikely to cause more damage than good.
Ciao. Thorsten
Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a lot of reports and sometimes miss something important when writing mails like this. If that's the case here, don't hesitate to tell me in a public reply, it's in everyone's interest to set the public record straight.
dmesg logs
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit :
Hi, this is your Linux kernel regression tracker. Thanks for
the report.
On 14.05.22 16:41, Christian Casteyde wrote: > #regzbot introduced v5.17.3..v5.17.4 > #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9
FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
suspended before ASIC reset") upstream.
Recently a regression was reported where 887f75cfd0da was
suspected as
the culprit: https://gitlab.freedesktop.org/drm/amd/-/issues/2008
And a one related to it: https://gitlab.freedesktop.org/drm/amd/-/issues/1982
You might want to take a look if what was discussed there might
be
related to your problem (I'm not directly involved in any of
this, I
don't know the details, it's just that 887f75cfd0da looked
familiar to
me). If it is, a fix for these two bugs was committed to master
earlier
this week:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
t/?i d=a56f445f807b0276
It will likely be backported to 5.17.y, maybe already in the
over-next
release. HTH.
Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
P.S.: As the Linux kernel's regression tracker I deal with a
lot of
reports and sometimes miss something important when writing
mails like
this. If that's the case here, don't hesitate to tell me in a
public
reply, it's in everyone's interest to set the public record
straight.
> Hello > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> symptoms are variable: > - either the laptop freezes; > - either the screen keeps blank; > - either the screen is OK but mouse is frozen; > - either display lags with several logs in dmesg: > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
> fences timed out! > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> [ 229.283485] [drm] Fence fallback timer expired on ring gfx > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > ... > > I've bisected the problem. > > Please note this laptop has a strange behaviour on suspend: > The first suspend request always fails (this point has never
been
> fixed > and > plagues us when trying to diagnose another regression on
touchpad not
> resuming in the past). The screen goes blank and I can get it
OK when
> pressing the power button, this seems to reset it. After that
all
> suspend/resume works OK. > > Since 5.17.4, it is not possible anymore to get the laptop
working
> again > after the first suspend failure. > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> 1650Ti > (turned off with ACPI call in order to get more battery, I'm
not using
> NVidia driver).
[Public]
dmesg logs
The thing getting reset here is an APU, so the device link stuff is irrelevant to help with suspend ordering as it pertains to HDA which is what that commit you reverted was supposed to help. It seems that commit had some collateral damage to APU in the S3 path (which is relatively "uncommon" now).
Can you have a try with 5.18-rc7 + https://patchwork.freedesktop.org/patch/486595/
That was for a different issue, but I think it may have the same outcome for you in what it's helping. If it doesn't, please another full log.
We were discussing taking out all of this ASIC reset stuff on suspend too, so if that doesn't help you maybe it's the impetus to do so.
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the
NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening
(basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng
kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit : > Hi, this is your Linux kernel regression tracker. Thanks for
the report.
> > On 14.05.22 16:41, Christian Casteyde wrote: > > #regzbot introduced v5.17.3..v5.17.4 > > #regzbot introduced:
001828fb3084379f3c3e228b905223c50bc237f9
> > FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
> suspended before ASIC reset") upstream. > > Recently a regression was reported where 887f75cfd0da was
suspected as
> the culprit: >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla b.freedesktop.org%2Fdrm%2Famd%2F- %2Fissues%2F2008&data=05%7C01%7Cmario.limonciello%40amd.com% 7C1bec068bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e 183d%7C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8ey JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D% 7C3000%7C%7C%7C&sdata=TzxAC63GZLbPsf1oZcj9PAIvEplX84r2VPbkb6 T47jo%3D&reserved=0
> > And a one related to it: >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla b.freedesktop.org%2Fdrm%2Famd%2F- %2Fissues%2F1982&data=05%7C01%7Cmario.limonciello%40amd.com% 7C1bec068bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e 183d%7C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8ey JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D% 7C3000%7C%7C%7C&sdata=Kh1u7tEddBkeJ9NO6EFCKPrykcYW5GbQ6Fr tAGFK72M%3D&reserved=0
> > You might want to take a look if what was discussed there might
be
> related to your problem (I'm not directly involved in any of
this, I
> don't know the details, it's just that 887f75cfd0da looked
familiar to
> me). If it is, a fix for these two bugs was committed to master
earlier
> this week: > >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.
kernel.org%2Fpub%2Fscm%2Flinux%2Fkernel%2Fgit%2Ftorvalds%2Flinux.git %2Fcommi&data=05%7C01%7Cmario.limonciello%40amd.com%7C1bec0 68bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e183d%7 C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8eyJWIjoi MC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C300 0%7C%7C%7C&sdata=Q3hdn9Q9DYu9CFTjn6ge5VFcxIbHClwLvy8qFPbiCi w%3D&reserved=0
> t/?i d=a56f445f807b0276 > > It will likely be backported to 5.17.y, maybe already in the
over-next
> release. HTH. > > Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
> > P.S.: As the Linux kernel's regression tracker I deal with a
lot of
> reports and sometimes miss something important when writing
mails like
> this. If that's the case here, don't hesitate to tell me in a
public
> reply, it's in everyone's interest to set the public record
straight.
> > > Hello > > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> > symptoms are variable: > > - either the laptop freezes; > > - either the screen keeps blank; > > - either the screen is OK but mouse is frozen; > > - either display lags with several logs in dmesg: > > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
> > fences timed out! > > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> > [ 229.283485] [drm] Fence fallback timer expired on ring gfx > > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > > ... > > > > I've bisected the problem. > > > > Please note this laptop has a strange behaviour on suspend: > > The first suspend request always fails (this point has never
been
> > fixed > > and > > plagues us when trying to diagnose another regression on
touchpad not
> > resuming in the past). The screen goes blank and I can get it
OK when
> > pressing the power button, this seems to reset it. After that
all
> > suspend/resume works OK. > > > > Since 5.17.4, it is not possible anymore to get the laptop
working
> > again > > after the first suspend failure. > > > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> > 1650Ti > > (turned off with ACPI call in order to get more battery, I'm
not using
> > NVidia driver).
Can you have a try with 5.18-rc7 + https://patchwork.freedesktop.org/patch/486595/
This doesn't fix my problem.
Appended dmesg log with this patch.
CC
----- Mail original ----- De: "Mario Limonciello" Mario.Limonciello@amd.com À: "casteyde christian" casteyde.christian@free.fr, "Kai-Heng Feng" kai.heng.feng@canonical.com Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "Alexander Deucher" Alexander.Deucher@amd.com, gregkh@linuxfoundation.org Envoyé: Mardi 17 Mai 2022 20:13:34 Objet: RE: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
[Public]
dmesg logs
The thing getting reset here is an APU, so the device link stuff is irrelevant to help with suspend ordering as it pertains to HDA which is what that commit you reverted was supposed to help. It seems that commit had some collateral damage to APU in the S3 path (which is relatively "uncommon" now).
Can you have a try with 5.18-rc7 + https://patchwork.freedesktop.org/patch/486595/
That was for a different issue, but I think it may have the same outcome for you in what it's helping. If it doesn't, please another full log.
We were discussing taking out all of this ASIC reset stuff on suspend too, so if that doesn't help you maybe it's the impetus to do so.
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the
NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening
(basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng
kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit : > Hi, this is your Linux kernel regression tracker. Thanks for
the report.
> > On 14.05.22 16:41, Christian Casteyde wrote: > > #regzbot introduced v5.17.3..v5.17.4 > > #regzbot introduced:
001828fb3084379f3c3e228b905223c50bc237f9
> > FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
> suspended before ASIC reset") upstream. > > Recently a regression was reported where 887f75cfd0da was
suspected as
> the culprit: >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla b.freedesktop.org%2Fdrm%2Famd%2F- %2Fissues%2F2008&data=05%7C01%7Cmario.limonciello%40amd.com% 7C1bec068bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e 183d%7C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8ey JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D% 7C3000%7C%7C%7C&sdata=TzxAC63GZLbPsf1oZcj9PAIvEplX84r2VPbkb6 T47jo%3D&reserved=0
> > And a one related to it: >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla b.freedesktop.org%2Fdrm%2Famd%2F- %2Fissues%2F1982&data=05%7C01%7Cmario.limonciello%40amd.com% 7C1bec068bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e 183d%7C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8ey JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D% 7C3000%7C%7C%7C&sdata=Kh1u7tEddBkeJ9NO6EFCKPrykcYW5GbQ6Fr tAGFK72M%3D&reserved=0
> > You might want to take a look if what was discussed there might
be
> related to your problem (I'm not directly involved in any of
this, I
> don't know the details, it's just that 887f75cfd0da looked
familiar to
> me). If it is, a fix for these two bugs was committed to master
earlier
> this week: > >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.
kernel.org%2Fpub%2Fscm%2Flinux%2Fkernel%2Fgit%2Ftorvalds%2Flinux.git %2Fcommi&data=05%7C01%7Cmario.limonciello%40amd.com%7C1bec0 68bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e183d%7 C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8eyJWIjoi MC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C300 0%7C%7C%7C&sdata=Q3hdn9Q9DYu9CFTjn6ge5VFcxIbHClwLvy8qFPbiCi w%3D&reserved=0
> t/?i d=a56f445f807b0276 > > It will likely be backported to 5.17.y, maybe already in the
over-next
> release. HTH. > > Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
> > P.S.: As the Linux kernel's regression tracker I deal with a
lot of
> reports and sometimes miss something important when writing
mails like
> this. If that's the case here, don't hesitate to tell me in a
public
> reply, it's in everyone's interest to set the public record
straight.
> > > Hello > > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> > symptoms are variable: > > - either the laptop freezes; > > - either the screen keeps blank; > > - either the screen is OK but mouse is frozen; > > - either display lags with several logs in dmesg: > > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
> > fences timed out! > > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> > [ 229.283485] [drm] Fence fallback timer expired on ring gfx > > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > > ... > > > > I've bisected the problem. > > > > Please note this laptop has a strange behaviour on suspend: > > The first suspend request always fails (this point has never
been
> > fixed > > and > > plagues us when trying to diagnose another regression on
touchpad not
> > resuming in the past). The screen goes blank and I can get it
OK when
> > pressing the power button, this seems to reset it. After that
all
> > suspend/resume works OK. > > > > Since 5.17.4, it is not possible anymore to get the laptop
working
> > again > > after the first suspend failure. > > > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> > 1650Ti > > (turned off with ACPI call in order to get more battery, I'm
not using
> > NVidia driver).
[Public]
I did compare your 3 logs (good, bad, bad + patch destined for 5.18-rc8 or 5.19-rc1).
I notice that in your "good" log, the part that makes things "good" is that the reset happens. I think this is just happenstance where you end up because that reset exists. A cleaner way would be to reset if the resume actually failed.
In addition to my other ask somewhere in this thread for increased debugging output to find your "real" problem, please give a try of this combo:
1) 5.18-rc7 2) Patch turning off reset on dGPUs in s2idle 3) The below patch
This patch should do pretty much the same thing, try to recover your GPU when it's in this bad state. Now it happens somewhere more sustainable and sensible: on the resume path. I think this might get you a functional display rather than that "black screen" from the failed suspend cycle. It's still papering over your real issue of whatever caused the failed suspend.
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c index 7fd0277b2805..ecb2c6aacf78 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c @@ -2353,6 +2353,11 @@ static int amdgpu_pmops_resume(struct device *dev) adev->no_hw_access = true;
r = amdgpu_device_resume(drm_dev, true); + if (r) { + dev_err(adev->dev, "resume failed with %d; attempting to reset ASIC\n", r); + amdgpu_asic_reset(adev); + } + if (amdgpu_acpi_is_s0ix_active(adev)) adev->in_s0ix = false; else
-----Original Message----- From: casteyde.christian@free.fr casteyde.christian@free.fr Sent: Wednesday, May 18, 2022 06:02 To: Limonciello, Mario Mario.Limonciello@amd.com Cc: stable@vger.kernel.org; Thorsten Leemhuis regressions@leemhuis.info; regressions@lists.linux.dev; Deucher, Alexander Alexander.Deucher@amd.com; gregkh@linuxfoundation.org; Kai-Heng Feng kai.heng.feng@canonical.com Subject: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
Can you have a try with 5.18-rc7 +
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fpatc hwork.freedesktop.org%2Fpatch%2F486595%2F&data=05%7C01%7CMa rio.Limonciello%40amd.com%7C8b917efde13a43372f7f08da38bddd20%7C3d d8961fe4884e608e11a82d994e183d%7C0%7C0%7C637884685679831558%7CU nknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI 6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=37BNeVXvua eG3ANquM53yyGU5qqWZkB%2BAgc4%2F5pdhVo%3D&reserved=0 This doesn't fix my problem.
Appended dmesg log with this patch.
CC
----- Mail original ----- De: "Mario Limonciello" Mario.Limonciello@amd.com À: "casteyde christian" casteyde.christian@free.fr, "Kai-Heng Feng" kai.heng.feng@canonical.com Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "Alexander Deucher" Alexander.Deucher@amd.com, gregkh@linuxfoundation.org Envoyé: Mardi 17 Mai 2022 20:13:34 Objet: RE: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
[Public]
dmesg logs
The thing getting reset here is an APU, so the device link stuff is irrelevant to help with suspend ordering as it pertains to HDA which is what that commit you reverted was supposed to help. It seems that commit had some collateral damage to APU in the S3 path (which is relatively "uncommon" now).
Can you have a try with 5.18-rc7 + https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fpatc hwork.freedesktop.org%2Fpatch%2F486595%2F&data=05%7C01%7CMa rio.Limonciello%40amd.com%7C8b917efde13a43372f7f08da38bddd20%7C3d d8961fe4884e608e11a82d994e183d%7C0%7C0%7C637884685679831558%7CU nknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI 6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=37BNeVXvua eG3ANquM53yyGU5qqWZkB%2BAgc4%2F5pdhVo%3D&reserved=0
That was for a different issue, but I think it may have the same outcome for you in what it's helping. If it doesn't, please another full log.
We were discussing taking out all of this ASIC reset stuff on suspend too, so if that doesn't help you maybe it's the impetus to do so.
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown
the
NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening
(basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng
kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote: > I've applied the commit a56f445f807b0276 on 5.17.7 and tested. > This does not fix the problem on my laptop.
Maybe some commits are still missing?
> For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
> May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
> suspended > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
> PCI INT A > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
> busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: > kworker/u32:18 Tainted: G O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: events_unbound > async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: > 0010:i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
> 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc > > ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
> 9c> > 4b 00 83 f8 01 74 > May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
> 00010286 > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff888540f170e8 > RCX: 0000000000000be5 > May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: > 0000000000000086 > RDI: ffffffffac858df8 > May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: > ffffffffabe46d60 > R09: 00000000ac86a0f6 > May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: > ffffffffffffffff > R12: ffff888540f5c070 > May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: > 00000000ffffff94 > R15: ffff888540f17028 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f640000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: > 0000000045e0c000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 > May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
> May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 > May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 > May 14 19:21:41 geek500 kernel:
__i2c_hid_command+0x106/0x2d0
> May 14 19:21:41 geek500 kernel: ?
amd_gpio_irq_enable+0x19/0x50
> May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 > May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 > May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
> May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 > May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 > May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 > May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
> change power setting. > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > acpi_subsys_resume+0x0/0x50 returns -108 > May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
PM:
failed
> to > resume async: error -108 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: > [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-
> May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
> *ERROR* resume of IP block <gfx_v9_0> failed -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: > amdgpu_device_ip_resume failed (-110). > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > pci_pm_resume+0x0/0x120 returns -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
> async: error -110 > May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff8885401b6300 > RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 > May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 > May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: done. > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 > May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 > May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. > May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
> consumption will be higher as BIOS has not been configured for > suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
> setup. > May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
> PCI INT A > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
> May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
> timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
> expired on ring sdma0 > May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
> failed (-110) > May 14 19:21:59 geek500 kernel: done. > May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
> dnsmasq-resolv.conf, will retry > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:02 geek500 last message buffered 2 times > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:05 geek500 last message buffered 2 times > May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 last message buffered 1 times > ... > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
> Waiting for fences timed out! > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > > CC > > Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit : > > Hi, this is your Linux kernel regression tracker. Thanks for
the report.
> > > > On 14.05.22 16:41, Christian Casteyde wrote: > > > #regzbot introduced v5.17.3..v5.17.4 > > > #regzbot introduced:
001828fb3084379f3c3e228b905223c50bc237f9
> > > > FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
> > suspended before ASIC reset") upstream. > > > > Recently a regression was reported where 887f75cfd0da was
suspected as
> > the culprit: > >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla
b.freedesktop.org%2Fdrm%2Famd%2F-
%2Fissues%2F2008&data=05%7C01%7Cmario.limonciello%40amd.com%
7C1bec068bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e
183d%7C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8ey
JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%
7C3000%7C%7C%7C&sdata=TzxAC63GZLbPsf1oZcj9PAIvEplX84r2VPbkb6
T47jo%3D&reserved=0
> > > > And a one related to it: > >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitla
b.freedesktop.org%2Fdrm%2Famd%2F-
%2Fissues%2F1982&data=05%7C01%7Cmario.limonciello%40amd.com%
7C1bec068bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e
183d%7C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8ey
JWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%
7C3000%7C%7C%7C&sdata=Kh1u7tEddBkeJ9NO6EFCKPrykcYW5GbQ6Fr
tAGFK72M%3D&reserved=0
> > > > You might want to take a look if what was discussed there might
be
> > related to your problem (I'm not directly involved in any of
this, I
> > don't know the details, it's just that 887f75cfd0da looked
familiar to
> > me). If it is, a fix for these two bugs was committed to master
earlier
> > this week: > > > >
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.
kernel.org%2Fpub%2Fscm%2Flinux%2Fkernel%2Fgit%2Ftorvalds%2Flinux.git
%2Fcommi&data=05%7C01%7Cmario.limonciello%40amd.com%7C1bec0
68bb98a4378972f08da382c147a%7C3dd8961fe4884e608e11a82d994e183d%7
C0%7C0%7C637884059824032387%7CUnknown%7CTWFpbGZsb3d8eyJWIjoi
MC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C300
0%7C%7C%7C&sdata=Q3hdn9Q9DYu9CFTjn6ge5VFcxIbHClwLvy8qFPbiCi
w%3D&reserved=0
> > t/?i d=a56f445f807b0276 > > > > It will likely be backported to 5.17.y, maybe already in the
over-next
> > release. HTH. > > > > Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
> > > > P.S.: As the Linux kernel's regression tracker I deal with a
lot of
> > reports and sometimes miss something important when writing
mails like
> > this. If that's the case here, don't hesitate to tell me in a
public
> > reply, it's in everyone's interest to set the public record
straight.
> > > > > Hello > > > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> > > symptoms are variable: > > > - either the laptop freezes; > > > - either the screen keeps blank; > > > - either the screen is OK but mouse is frozen; > > > - either display lags with several logs in dmesg: > > > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > > > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail]
*ERROR*
Waiting for
> > > fences timed out! > > > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > > > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> > > [ 229.283485] [drm] Fence fallback timer expired on ring gfx > > > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > > > ... > > > > > > I've bisected the problem. > > > > > > Please note this laptop has a strange behaviour on suspend: > > > The first suspend request always fails (this point has never
been
> > > fixed > > > and > > > plagues us when trying to diagnose another regression on
touchpad not
> > > resuming in the past). The screen goes blank and I can get it
OK when
> > > pressing the power button, this seems to reset it. After that
all
> > > suspend/resume works OK. > > > > > > Since 5.17.4, it is not possible anymore to get the laptop
working
> > > again > > > after the first suspend failure. > > > > > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> > > 1650Ti > > > (turned off with ACPI call in order to get more battery, I'm
not using
> > > NVidia driver).
[Public]
In addition to my other ask somewhere in this thread for increased debugging output to find your "real" problem, please give a try of this combo:
- 5.18-rc7
- Patch turning off reset on dGPUs in s2idle
- The below patch
This patch should do pretty much the same thing, try to recover your GPU when it's in this bad state. Now it happens somewhere more sustainable and sensible: on the resume path. I think this might get you a functional display rather than that "black screen" from the failed suspend cycle. It's still papering over your real issue of whatever caused the failed suspend.
I noticed the reset doesn't redo the HW init, that's still another needed step.
So here's a proper link to the patch you can apply as an mbox: https://patchwork.freedesktop.org/patch/486836/ This is untested.
On Wed, May 18, 2022 at 1:38 AM casteyde.christian@free.fr wrote:
dmesg logs
Actually, the "good" is still no good: [ 43.375323] PM: suspend entry (deep) ... [ 43.695342] PM: late suspend of devices failed ... [ 44.554108] PM: suspend exit [ 44.554168] PM: suspend entry (s2idle)
So we need to find out why the suspend failed at first place.
Kai-Heng
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote:
I've applied the commit a56f445f807b0276 on 5.17.7 and tested. This does not fix the problem on my laptop.
Maybe some commits are still missing?
For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
suspended May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: kworker/u32:18 Tainted: G O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: events_unbound async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: 0010:i2c_dw_xfer+0x3f6/0x440 May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc
ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
9c> 4b 00 83 f8 01 74 May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
00010286 May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff888540f170e8 RCX: 0000000000000be5 May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: 0000000000000086 RDI: ffffffffac858df8 May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: ffffffffabe46d60 R09: 00000000ac86a0f6 May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888540f5c070 May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: 00000000ffffff94 R15: ffff888540f17028 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f640000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: 0000000045e0c000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
change power setting. May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): acpi_subsys_resume+0x0/0x50 returns -108 May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
to resume async: error -108 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
*ERROR* resume of IP block <gfx_v9_0> failed -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0x120 returns -110 May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
async: error -110 May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: done. May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:971 clk_core_disable+0x80/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: ffff8885401b6300 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
clk.c:829 clk_core_unprepare+0xb1/0x1a0 May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: kworker/6:3 Tainted: G W O 5.17.7+ #7 May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
Laptop 15-ec1xxx/87B2, BIOS F.25 08/18/2021 May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
RBX: ffff8885401b6300 RCX: 0000000000000027 May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: 0000000000000001 RDI: ffff88885f59f460 May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: ffffffffabf26da8 R09: 00000000ffffdfff May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: ffffffffabe46dc0 R12: 0000000000000000 May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: 0000000000000008 R15: 0000000000000000 May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) GS:ffff88885f580000(0000) knlGS:0000000000000000 May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: 0000000102956000 CR4: 0000000000350ee0 May 14 19:21:41 geek500 kernel: Call Trace: May 14 19:21:41 geek500 kernel: <TASK> May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 May 14 19:21:41 geek500 kernel: </TASK> May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
consumption will be higher as BIOS has not been configured for suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
setup. May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
power state from D3cold to D0 (config space inaccessible) May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
PCI INT A May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
expired on ring sdma0 May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
failed (-110) May 14 19:21:59 geek500 kernel: done. May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
sync register 0x4f0800. -5 May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
dnsmasq-resolv.conf, will retry May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:02 geek500 last message buffered 2 times May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:05 geek500 last message buffered 2 times May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
ring sdma0 May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
gfx May 14 19:22:06 geek500 last message buffered 1 times ... May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0 May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for fences timed out! May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
sdma0
CC
Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit : > Hi, this is your Linux kernel regression tracker. Thanks for
the report.
> > On 14.05.22 16:41, Christian Casteyde wrote: > > #regzbot introduced v5.17.3..v5.17.4 > > #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9 > > FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
> suspended before ASIC reset") upstream. > > Recently a regression was reported where 887f75cfd0da was
suspected as
> the culprit: > https://gitlab.freedesktop.org/drm/amd/-/issues/2008 > > And a one related to it: > https://gitlab.freedesktop.org/drm/amd/-/issues/1982 > > You might want to take a look if what was discussed there might
be
> related to your problem (I'm not directly involved in any of
this, I
> don't know the details, it's just that 887f75cfd0da looked
familiar to
> me). If it is, a fix for these two bugs was committed to master
earlier
> this week: > >
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
> t/?i d=a56f445f807b0276 > > It will likely be backported to 5.17.y, maybe already in the
over-next
> release. HTH. > > Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
> > P.S.: As the Linux kernel's regression tracker I deal with a
lot of
> reports and sometimes miss something important when writing
mails like
> this. If that's the case here, don't hesitate to tell me in a
public
> reply, it's in everyone's interest to set the public record
straight.
> > > Hello > > Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
> > symptoms are variable: > > - either the laptop freezes; > > - either the screen keeps blank; > > - either the screen is OK but mouse is frozen; > > - either display lags with several logs in dmesg: > > [ 228.275492] [drm] Fence fallback timer expired on ring gfx > > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
> > fences timed out! > > [ 228.779490] [drm] Fence fallback timer expired on ring gfx > > [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
> > [ 229.283485] [drm] Fence fallback timer expired on ring gfx > > [ 229.787487] [drm] Fence fallback timer expired on ring gfx > > ... > > > > I've bisected the problem. > > > > Please note this laptop has a strange behaviour on suspend: > > The first suspend request always fails (this point has never
been
> > fixed > > and > > plagues us when trying to diagnose another regression on
touchpad not
> > resuming in the past). The screen goes blank and I can get it
OK when
> > pressing the power button, this seems to reset it. After that
all
> > suspend/resume works OK. > > > > Since 5.17.4, it is not possible anymore to get the laptop
working
> > again > > after the first suspend failure. > > > > HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
> > 1650Ti > > (turned off with ACPI call in order to get more battery, I'm
not using
> > NVidia driver).
On 5/17/22 21:08, Kai-Heng Feng wrote:
On Wed, May 18, 2022 at 1:38 AM casteyde.christian@free.fr wrote:
dmesg logs
Actually, the "good" is still no good: [ 43.375323] PM: suspend entry (deep) ... [ 43.695342] PM: late suspend of devices failed ... [ 44.554108] PM: suspend exit [ 44.554168] PM: suspend entry (s2idle)
So we need to find out why the suspend failed at first place.
I noticed that too; but the patch I suggested will completely avoid the GPU reset for the APU, which is my guess at why this suspend fails in the first place even in "good" scenario.
Kai-Heng
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18 branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote: > I've applied the commit a56f445f807b0276 on 5.17.7 and tested. > This does not fix the problem on my laptop.
Maybe some commits are still missing?
> For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
> May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
> suspended > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
> PCI INT A > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
> busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972 Comm: > kworker/u32:18 Tainted: G O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: events_unbound > async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: > 0010:i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01 4c 8b
67 50 4d
> 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01 cc > > ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc fc ff
ff e9 2d
> 9c> > 4b 00 83 f8 01 74 > May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
> 00010286 > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff888540f170e8 > RCX: 0000000000000be5 > May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: > 0000000000000086 > RDI: ffffffffac858df8 > May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: > ffffffffabe46d60 > R09: 00000000ac86a0f6 > May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: > ffffffffffffffff > R12: ffff888540f5c070 > May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: > 00000000ffffff94 > R15: ffff888540f17028 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f640000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: > 0000000045e0c000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 > May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
> May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 > May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 > May 14 19:21:41 geek500 kernel: __i2c_hid_command+0x106/0x2d0 > May 14 19:21:41 geek500 kernel: ? amd_gpio_irq_enable+0x19/0x50 > May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 > May 14 19:21:41 geek500 kernel: i2c_hid_core_resume+0x60/0xb0 > May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
> May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 > May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 > May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 > May 14 19:21:41 geek500 kernel: async_run_entry_fn+0x1b/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00:
failed to
> change power setting. > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > acpi_subsys_resume+0x0/0x50 returns -108 > May 14 19:21:41 geek500 kernel: i2c_hid_acpi i2c-ELAN0718:00: PM:
failed
> to > resume async: error -108 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: > [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed (-110) > May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
> *ERROR* resume of IP block <gfx_v9_0> failed -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: > amdgpu_device_ip_resume failed (-110). > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > pci_pm_resume+0x0/0x120 returns -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM: failed
to resume
> async: error -110 > May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work > May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff8885401b6300 > RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x74/0xd0 > May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 > May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work > May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x7c/0xd0 > May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 > May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: done. > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work > May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f 1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7 c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89 c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x88/0xd0 > May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 > May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already unprepared > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091 Comm: > kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm pm_runtime_work > May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b 30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7 c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0 48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel: i2c_dw_prepare_clk+0x90/0xd0 > May 14 19:21:41 geek500 kernel: dw_i2c_plat_suspend+0x2e/0x40 > May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ? acpi_dev_suspend+0x160/0x160 > May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel: process_one_work+0x1d3/0x3a0 > May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ? rescuer_thread+0x380/0x380 > May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace 0000000000000000
]---
> May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds) done. > May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
> consumption will be higher as BIOS has not been configured for > suspend-to-idle. To use suspend-to-idle change the sleep mode in
BIOS
> setup. > May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1: can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't derive
routing for
> PCI INT A > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A: no
GSI
> May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
> timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
> expired on ring sdma0 > May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL: download fw
command
> failed (-110) > May 14 19:21:59 geek500 kernel: done. > May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
> dnsmasq-resolv.conf, will retry > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:02 geek500 last message buffered 2 times > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:05 geek500 last message buffered 2 times > May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on
> ring sdma0 > May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 last message buffered 1 times > ... > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
> Waiting for fences timed out! > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > > CC > > Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a écrit : >> Hi, this is your Linux kernel regression tracker. Thanks for
the report.
>> >> On 14.05.22 16:41, Christian Casteyde wrote: >>> #regzbot introduced v5.17.3..v5.17.4 >>> #regzbot introduced: 001828fb3084379f3c3e228b905223c50bc237f9 >> >> FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
>> suspended before ASIC reset") upstream. >> >> Recently a regression was reported where 887f75cfd0da was
suspected as
>> the culprit: >> https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitlab.fre... >> >> And a one related to it: >> https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgitlab.fre... >> >> You might want to take a look if what was discussed there might
be
>> related to your problem (I'm not directly involved in any of
this, I
>> don't know the details, it's just that 887f75cfd0da looked
familiar to
>> me). If it is, a fix for these two bugs was committed to master
earlier
>> this week: >> >>
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgit.kernel...
>> t/?i d=a56f445f807b0276 >> >> It will likely be backported to 5.17.y, maybe already in the
over-next
>> release. HTH. >> >> Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
>> >> P.S.: As the Linux kernel's regression tracker I deal with a
lot of
>> reports and sometimes miss something important when writing
mails like
>> this. If that's the case here, don't hesitate to tell me in a
public
>> reply, it's in everyone's interest to set the public record
straight.
>> >>> Hello >>> Since 5.17.4 my laptop doesn't resume from suspend anymore.
At resume,
>>> symptoms are variable: >>> - either the laptop freezes; >>> - either the screen keeps blank; >>> - either the screen is OK but mouse is frozen; >>> - either display lags with several logs in dmesg: >>> [ 228.275492] [drm] Fence fallback timer expired on ring gfx >>> [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail] *ERROR*
Waiting for
>>> fences timed out! >>> [ 228.779490] [drm] Fence fallback timer expired on ring gfx >>> [ 229.283484] [drm] Fence fallback timer expired on ring
sdma0
>>> [ 229.283485] [drm] Fence fallback timer expired on ring gfx >>> [ 229.787487] [drm] Fence fallback timer expired on ring gfx >>> ... >>> >>> I've bisected the problem. >>> >>> Please note this laptop has a strange behaviour on suspend: >>> The first suspend request always fails (this point has never
been
>>> fixed >>> and >>> plagues us when trying to diagnose another regression on
touchpad not
>>> resuming in the past). The screen goes blank and I can get it
OK when
>>> pressing the power button, this seems to reset it. After that
all
>>> suspend/resume works OK. >>> >>> Since 5.17.4, it is not possible anymore to get the laptop
working
>>> again >>> after the first suspend failure. >>> >>> HW : HP Pavilion / Ryzen 4600H with AMD graphics integrated +
NVidia
>>> 1650Ti >>> (turned off with ACPI call in order to get more battery, I'm
not using
>>> NVidia driver).
This laptop has never managed to suspend correctly at first try. However on second try without the commit, it does work. What I do is: - try first, the suspend fails but the screen remains blank. - press the power button, that does something that resumes the screen - try second, and this times it works. I will append a dmesg output with the second pass also when it works. FYI we also tried to find the first pass failure while chasing another previous regression but we didn't managed. With the regression, I cannot resume from the first try at all (either the laptop remains stuck, or it resumes the screen but it lags with all the timeouts in dmesg). So it 'doesnt work worse'.
CC
Télécharger BlueMail pour Android
Le 18 mai 2022 à 04:08, à 04:08, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Wed, May 18, 2022 at 1:38 AM casteyde.christian@free.fr wrote:
dmesg logs
Actually, the "good" is still no good: [ 43.375323] PM: suspend entry (deep) ... [ 43.695342] PM: late suspend of devices failed ... [ 44.554108] PM: suspend exit [ 44.554168] PM: suspend entry (s2idle)
So we need to find out why the suspend failed at first place.
Kai-Heng
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis"
regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com
Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video
since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to
shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this
evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng
kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18
branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up
to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints
kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote: > I've applied the commit a56f445f807b0276 on 5.17.7 and
tested.
> This does not fix the problem on my laptop.
Maybe some commits are still missing?
> For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
> May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1:
can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
> suspended > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't
derive
routing for
> PCI INT A > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A:
no
GSI
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
> busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972
Comm:
> kworker/u32:18 Tainted: G O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: events_unbound > async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: > 0010:i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01
4c 8b
67 50 4d
> 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01
cc
> > ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc
fc ff
ff e9 2d
> 9c> > 4b 00 83 f8 01 74 > May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
> 00010286 > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff888540f170e8 > RCX: 0000000000000be5 > May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: > 0000000000000086 > RDI: ffffffffac858df8 > May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: > ffffffffabe46d60 > R09: 00000000ac86a0f6 > May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: > ffffffffffffffff > R12: ffff888540f5c070 > May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: > 00000000ffffff94 > R15: ffff888540f17028 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f640000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: > 0000000045e0c000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 > May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
> May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 > May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 > May 14 19:21:41 geek500 kernel:
__i2c_hid_command+0x106/0x2d0
> May 14 19:21:41 geek500 kernel: ?
amd_gpio_irq_enable+0x19/0x50
> May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 > May 14 19:21:41 geek500 kernel:
i2c_hid_core_resume+0x60/0xb0
> May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
> May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 > May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 > May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 > May 14 19:21:41 geek500 kernel:
async_run_entry_fn+0x1b/0xa0
> May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: i2c_hid_acpi
i2c-ELAN0718:00:
failed to
> change power setting. > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > acpi_subsys_resume+0x0/0x50 returns -108 > May 14 19:21:41 geek500 kernel: i2c_hid_acpi
i2c-ELAN0718:00: PM:
failed
> to > resume async: error -108 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: > [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed
(-110)
> May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
> *ERROR* resume of IP block <gfx_v9_0> failed -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: > amdgpu_device_ip_resume failed (-110). > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > pci_pm_resume+0x0/0x120 returns -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM:
failed
to resume
> async: error -110 > May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f
1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7
c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89
c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff8885401b6300 > RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x74/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already
unprepared
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b
30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7
c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0
48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x7c/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: done. > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f
1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7
c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89
c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x88/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already
unprepared
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b
30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7
c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0
48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x90/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds)
done.
> May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
> consumption will be higher as BIOS has not been configured
for
> suspend-to-idle. To use suspend-to-idle change the sleep
mode in
BIOS
> setup. > May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1:
can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't
derive
routing for
> PCI INT A > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A:
no
GSI
> May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
> timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
> expired on ring sdma0 > May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL:
download fw
command
> failed (-110) > May 14 19:21:59 geek500 kernel: done. > May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
> dnsmasq-resolv.conf, will retry > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:02 geek500 last message buffered 2 times > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:05 geek500 last message buffered 2 times > May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 last message buffered 1 times > ... > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
> Waiting for fences timed out! > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > > CC > > Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a
écrit :
> > Hi, this is your Linux kernel regression tracker. Thanks
for
the report.
> > > > On 14.05.22 16:41, Christian Casteyde wrote: > > > #regzbot introduced v5.17.3..v5.17.4 > > > #regzbot introduced:
001828fb3084379f3c3e228b905223c50bc237f9
> > > > FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
> > suspended before ASIC reset") upstream. > > > > Recently a regression was reported where 887f75cfd0da was
suspected as
> > the culprit: > > https://gitlab.freedesktop.org/drm/amd/-/issues/2008 > > > > And a one related to it: > > https://gitlab.freedesktop.org/drm/amd/-/issues/1982 > > > > You might want to take a look if what was discussed there
might
be
> > related to your problem (I'm not directly involved in any
of
this, I
> > don't know the details, it's just that 887f75cfd0da looked
familiar to
> > me). If it is, a fix for these two bugs was committed to
master
earlier
> > this week: > > > >
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
> > t/?i d=a56f445f807b0276 > > > > It will likely be backported to 5.17.y, maybe already in
the
over-next
> > release. HTH. > > > > Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
> > > > P.S.: As the Linux kernel's regression tracker I deal with
a
lot of
> > reports and sometimes miss something important when
writing
mails like
> > this. If that's the case here, don't hesitate to tell me
in a
public
> > reply, it's in everyone's interest to set the public
record
straight.
> > > > > Hello > > > Since 5.17.4 my laptop doesn't resume from suspend
anymore.
At resume,
> > > symptoms are variable: > > > - either the laptop freezes; > > > - either the screen keeps blank; > > > - either the screen is OK but mouse is frozen; > > > - either display lags with several logs in dmesg: > > > [ 228.275492] [drm] Fence fallback timer expired on
ring gfx
> > > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail]
*ERROR*
Waiting for
> > > fences timed out! > > > [ 228.779490] [drm] Fence fallback timer expired on
ring gfx
> > > [ 229.283484] [drm] Fence fallback timer expired on
ring
sdma0
> > > [ 229.283485] [drm] Fence fallback timer expired on
ring gfx
> > > [ 229.787487] [drm] Fence fallback timer expired on
ring gfx
> > > ... > > > > > > I've bisected the problem. > > > > > > Please note this laptop has a strange behaviour on
suspend:
> > > The first suspend request always fails (this point has
never
been
> > > fixed > > > and > > > plagues us when trying to diagnose another regression on
touchpad not
> > > resuming in the past). The screen goes blank and I can
get it
OK when
> > > pressing the power button, this seems to reset it. After
that
all
> > > suspend/resume works OK. > > > > > > Since 5.17.4, it is not possible anymore to get the
laptop
working
> > > again > > > after the first suspend failure. > > > > > > HW : HP Pavilion / Ryzen 4600H with AMD graphics
integrated +
NVidia
> > > 1650Ti > > > (turned off with ACPI call in order to get more battery,
I'm
not using
> > > NVidia driver).
Here is a dmesg output with: - the commit reverted; - first suspend failing; - second suspend working.
CC
----- Mail original ----- De: "Christian Casteyde" casteyde.christian@free.fr À: "Kai-Heng Feng" kai.heng.feng@canonical.com Cc: stable@vger.kernel.org, "Thorsten Leemhuis" regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com Envoyé: Mercredi 18 Mai 2022 09:15:33 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video since 5.17.4 (works 5.17.3)
This laptop has never managed to suspend correctly at first try. However on second try without the commit, it does work. What I do is: - try first, the suspend fails but the screen remains blank. - press the power button, that does something that resumes the screen - try second, and this times it works. I will append a dmesg output with the second pass also when it works. FYI we also tried to find the first pass failure while chasing another previous regression but we didn't managed. With the regression, I cannot resume from the first try at all (either the laptop remains stuck, or it resumes the screen but it lags with all the timeouts in dmesg). So it 'doesnt work worse'.
CC
Télécharger BlueMail pour Android
Le 18 mai 2022 à 04:08, à 04:08, Kai-Heng Feng kai.heng.feng@canonical.com a écrit:
On Wed, May 18, 2022 at 1:38 AM casteyde.christian@free.fr wrote:
dmesg logs
Actually, the "good" is still no good: [ 43.375323] PM: suspend entry (deep) ... [ 43.695342] PM: late suspend of devices failed ... [ 44.554108] PM: suspend exit [ 44.554168] PM: suspend entry (s2idle)
So we need to find out why the suspend failed at first place.
Kai-Heng
----- Mail original ----- De: "Kai-Heng Feng" kai.heng.feng@canonical.com À: "Christian Casteyde" casteyde.christian@free.fr Cc: stable@vger.kernel.org, "Thorsten Leemhuis"
regressions@leemhuis.info, regressions@lists.linux.dev, "alexander deucher" alexander.deucher@amd.com, gregkh@linuxfoundation.org, "Mario Limonciello" mario.limonciello@amd.com
Envoyé: Mardi 17 Mai 2022 08:58:30 Objet: Re: [REGRESSION] Laptop with Ryzen 4600H fails to resume video
since 5.17.4 (works 5.17.3)
On Tue, May 17, 2022 at 2:36 PM Christian Casteyde casteyde.christian@free.fr wrote:
No, the problem is there even without acpicall. Fyi I use it to
shutdown the NVidia card that eats the battery otherwise.
I managed to get a dmesg output with 2.18rc7 I will post it this
evening (basically exact same behavior as 2.17.4).
Can you please also attach dmesg without the offending commit (i.e. when it's working)?
Kai-Heng
CC
Télécharger BlueMail pour Android
Le 17 mai 2022 à 04:03, à 04:03, Kai-Heng Feng
kai.heng.feng@canonical.com a écrit:
On Tue, May 17, 2022 at 1:23 AM Christian Casteyde casteyde.christian@free.fr wrote:
I've tried with 5.18-rc7, it doesn't work either. I guess 5.18
branch
have all
commits.
full dmesg appended (not for 5.18, I didn't manage to resume up
to
the point
to get a console for now).
Interestingly, I found you are using acpi_call: [ 30.667348] acpi_call: loading out-of-tree module taints
kernel.
Does removing the acpi_call solve the issue?
Kai-Heng
CC
Le lundi 16 mai 2022, 04:47:25 CEST Kai-Heng Feng a écrit :
[+Cc Mario]
On Sun, May 15, 2022 at 1:34 AM Christian Casteyde
casteyde.christian@free.fr wrote: > I've applied the commit a56f445f807b0276 on 5.17.7 and
tested.
> This does not fix the problem on my laptop.
Maybe some commits are still missing?
> For informatio, here is a part of the log around the suspend
process:
Is it possible to attach full dmesg?
Kai-Heng
> May 14 19:21:41 geek500 kernel: snd_hda_intel 0000:01:00.1:
can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:41 geek500 kernel: PM: late suspend of devices
failed
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: i2c_designware AMDI0010:03:
Transfer while
> suspended > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: can't
derive
routing for
> PCI INT A > May 14 19:21:41 geek500 kernel: pci 0000:00:00.2: PCI INT A:
no
GSI
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 9 PID: 1972 at
drivers/i2c/
> busses/i2c-designware-master.c:570 i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 9 PID: 1972
Comm:
> kworker/u32:18 Tainted: G O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: events_unbound > async_run_entry_fn May 14 19:21:41 geek500 kernel: RIP: > 0010:i2c_dw_xfer+0x3f6/0x440 > May 14 19:21:41 geek500 kernel: Code: c6 05 db 31 45 01 01
4c 8b
67 50 4d
> 85 e4 75 03 4c 8b 27 e8 fc e1 e9 ff 4c 89 e2 48 c7 c7 00 01
cc
> > ab 48 89 c6 e8 b3 4f 45 00 <0f> 0b 41 be 94 ff ff ff e9 cc
fc ff
ff e9 2d
> 9c> > 4b 00 83 f8 01 74 > May 14 19:21:41 geek500 kernel: RSP: 0018:ffff8dbfc31e7c68
EFLAGS:
> 00010286 > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff888540f170e8 > RCX: 0000000000000be5 > May 14 19:21:41 geek500 kernel: RDX: 0000000000000000 RSI: > 0000000000000086 > RDI: ffffffffac858df8 > May 14 19:21:41 geek500 kernel: RBP: ffff888540f170e8 R08: > ffffffffabe46d60 > R09: 00000000ac86a0f6 > May 14 19:21:41 geek500 kernel: R10: ffffffffffffffff R11: > ffffffffffffffff > R12: ffff888540f5c070 > May 14 19:21:41 geek500 kernel: R13: ffff8dbfc31e7d70 R14: > 00000000ffffff94 > R15: ffff888540f17028 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f640000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00007f1984067028 CR3: > 0000000045e0c000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: ? dequeue_entity+0xd4/0x250 > May 14 19:21:41 geek500 kernel: ?
newidle_balance.constprop.0+0x1f7/0x3b0
> May 14 19:21:41 geek500 kernel: __i2c_transfer+0x16d/0x520 > May 14 19:21:41 geek500 kernel: i2c_transfer+0x7a/0xd0 > May 14 19:21:41 geek500 kernel:
__i2c_hid_command+0x106/0x2d0
> May 14 19:21:41 geek500 kernel: ?
amd_gpio_irq_enable+0x19/0x50
> May 14 19:21:41 geek500 kernel: i2c_hid_set_power+0x4a/0xd0 > May 14 19:21:41 geek500 kernel:
i2c_hid_core_resume+0x60/0xb0
> May 14 19:21:41 geek500 kernel: ?
acpi_subsys_resume_early+0x50/0x50
> May 14 19:21:41 geek500 kernel: dpm_run_callback+0x1d/0xd0 > May 14 19:21:41 geek500 kernel: device_resume+0x122/0x230 > May 14 19:21:41 geek500 kernel: async_resume+0x14/0x30 > May 14 19:21:41 geek500 kernel:
async_run_entry_fn+0x1b/0xa0
> May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: i2c_hid_acpi
i2c-ELAN0718:00:
failed to
> change power setting. > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > acpi_subsys_resume+0x0/0x50 returns -108 > May 14 19:21:41 geek500 kernel: i2c_hid_acpi
i2c-ELAN0718:00: PM:
failed
> to > resume async: error -108 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: > [drm:amdgpu_ring_test_helper] *ERROR* ring gfx test failed
(-110)
> May 14 19:21:41 geek500 kernel:
[drm:amdgpu_device_ip_resume_phase2]
> *ERROR* resume of IP block <gfx_v9_0> failed -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: amdgpu: > amdgpu_device_ip_resume failed (-110). > May 14 19:21:41 geek500 kernel: PM: dpm_run_callback(): > pci_pm_resume+0x0/0x120 returns -110 > May 14 19:21:41 geek500 kernel: amdgpu 0000:05:00.0: PM:
failed
to resume
> async: error -110 > May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f
1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7
c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89
c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: > May 14 19:21:41 geek500 kernel: RAX: 0000000000000000 RBX: > ffff8885401b6300 > RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000283 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x74/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already
unprepared
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b
30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7
c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0
48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x7c/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: done. > May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already disabled > May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:971 clk_core_disable+0x80/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_disable+0x80/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 10 e8 e4 4a d1 00 0f
1f 44
00 00 48
> 8b 5b 30 48 85 db 74 b6 8b 43 7c 85 c0 75 a4 48 8b 33 48 c7
c7 7d
87 c4
> ab e8 79 7a 9a 00 <0f> 0b 5b 5d c3 65 8b 05 5c a1 92 55 89
c0 48
0f a3 05
> 4a 61 9d 01 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d50
> EFLAGS: 00010082 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: 0000000000000287 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: ffff8885401b6300 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_disable+0x24/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x88/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:41 geek500 kernel: ------------[ cut here
]------------
> May 14 19:21:41 geek500 kernel: AMDI0010:03 already
unprepared
> May 14 19:21:41 geek500 kernel: WARNING: CPU: 6 PID: 1091 at
drivers/clk/
> clk.c:829 clk_core_unprepare+0xb1/0x1a0 > May 14 19:21:41 geek500 kernel: Modules linked in: [last
unloaded:
> acpi_call] May 14 19:21:41 geek500 kernel: CPU: 6 PID: 1091
Comm:
> kworker/6:3 Tainted: G W O 5.17.7+ #7 > May 14 19:21:41 geek500 kernel: Hardware name: HP HP
Pavilion
Gaming
> Laptop > 15-ec1xxx/87B2, BIOS F.25 08/18/2021 > May 14 19:21:41 geek500 kernel: Workqueue: pm
pm_runtime_work
> May 14 19:21:41 geek500 kernel: RIP:
0010:clk_core_unprepare+0xb1/0x1a0
> May 14 19:21:41 geek500 kernel: Code: 40 00 66 90 48 8b 5b
30 48
85 db 74
> a2 8b 83 80 00 00 00 85 c0 0f 85 79 ff ff ff 48 8b 33 48 c7
c7 35
87 c4
> ab e8 18 7c 9a 00 <0f> 0b 5b c3 65 8b 05 fc a2 92 55 89 c0
48 0f
a3 05 ea
> 62 9d 01 73 May 14 19:21:41 geek500 kernel: RSP:
0018:ffff8dbfc1c47d60
> EFLAGS: 00010286 May 14 19:21:41 geek500 kernel: RAX:
0000000000000000
> RBX: ffff8885401b6300 RCX: 0000000000000027 > May 14 19:21:41 geek500 kernel: RDX: ffff88885f59f468 RSI: > 0000000000000001 > RDI: ffff88885f59f460 > May 14 19:21:41 geek500 kernel: RBP: ffff8885401b6300 R08: > ffffffffabf26da8 > R09: 00000000ffffdfff > May 14 19:21:41 geek500 kernel: R10: ffffffffabe46dc0 R11: > ffffffffabe46dc0 > R12: 0000000000000000 > May 14 19:21:41 geek500 kernel: R13: ffff888540fc30f4 R14: > 0000000000000008 > R15: 0000000000000000 > May 14 19:21:41 geek500 kernel: FS: 0000000000000000(0000) > GS:ffff88885f580000(0000) knlGS:0000000000000000 > May 14 19:21:41 geek500 kernel: CS: 0010 DS: 0000 ES: 0000
CR0:
> 0000000080050033 > May 14 19:21:41 geek500 kernel: CR2: 00000000010fa990 CR3: > 0000000102956000 > CR4: 0000000000350ee0 > May 14 19:21:41 geek500 kernel: Call Trace: > May 14 19:21:41 geek500 kernel: <TASK> > May 14 19:21:41 geek500 kernel: clk_unprepare+0x1f/0x30 > May 14 19:21:41 geek500 kernel:
i2c_dw_prepare_clk+0x90/0xd0
> May 14 19:21:41 geek500 kernel:
dw_i2c_plat_suspend+0x2e/0x40
> May 14 19:21:41 geek500 kernel:
acpi_subsys_runtime_suspend+0x9/0x20
> May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: __rpm_callback+0x3f/0x150 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_callback+0x54/0x60 > May 14 19:21:41 geek500 kernel: ?
acpi_dev_suspend+0x160/0x160
> May 14 19:21:41 geek500 kernel: rpm_suspend+0x142/0x720 > May 14 19:21:41 geek500 kernel: pm_runtime_work+0x8f/0xa0 > May 14 19:21:41 geek500 kernel:
process_one_work+0x1d3/0x3a0
> May 14 19:21:41 geek500 kernel: worker_thread+0x48/0x3c0 > May 14 19:21:41 geek500 kernel: ?
rescuer_thread+0x380/0x380
> May 14 19:21:41 geek500 kernel: kthread+0xd3/0x100 > May 14 19:21:41 geek500 kernel: ?
kthread_complete_and_exit+0x20/0x20
> May 14 19:21:41 geek500 kernel: ret_from_fork+0x22/0x30 > May 14 19:21:41 geek500 kernel: </TASK> > May 14 19:21:41 geek500 kernel: ---[ end trace
0000000000000000
]---
> May 14 19:21:59 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:21:59 geek500 kernel: (elapsed 0.175 seconds)
done.
> May 14 19:21:59 geek500 kernel: amdgpu 0000:05:00.0: amdgpu:
Power
> consumption will be higher as BIOS has not been configured
for
> suspend-to-idle. To use suspend-to-idle change the sleep
mode in
BIOS
> setup. > May 14 19:21:59 geek500 kernel: snd_hda_intel 0000:01:00.1:
can't
change
> power state from D3cold to D0 (config space inaccessible) > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: can't
derive
routing for
> PCI INT A > May 14 19:21:59 geek500 kernel: pci 0000:00:00.2: PCI INT A:
no
GSI
> May 14 19:21:59 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:21:59 geek500 kernel: Bluetooth: hci0: command
0xfc20 tx
> timeout May 14 19:21:59 geek500 kernel: [drm] Fence fallback
timer
> expired on ring sdma0 > May 14 19:21:59 geek500 kernel: Bluetooth: hci0: RTL:
download fw
command
> failed (-110) > May 14 19:21:59 geek500 kernel: done. > May 14 19:22:00 geek500 kernel: snd_hda_codec_hdmi
hdaudioC1D0:
Unable to
> sync register 0x4f0800. -5 > May 14 19:22:00 geek500 dnsmasq[2079]: no servers found in
/etc/dnsmasq.d/
> dnsmasq-resolv.conf, will retry > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:01 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:01 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:02 geek500 last message buffered 2 times > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:03 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:03 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:04 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:04 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:05 geek500 last message buffered 2 times > May 14 19:22:05 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 kernel: [drm] Fence fallback
timer
expired on
> ring sdma0 > May 14 19:22:06 geek500 kernel: [drm] Fence fallback timer
expired on ring
> gfx May 14 19:22:06 geek500 last message buffered 1 times > ... > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > May 14 19:22:18 geek500 kernel:
[drm:amdgpu_dm_atomic_commit_tail] *ERROR*
> Waiting for fences timed out! > May 14 19:22:18 geek500 kernel: [drm] Fence fallback timer
expired on ring
> sdma0 > > CC > > Le samedi 14 mai 2022, 17:12:33 CEST Thorsten Leemhuis a
écrit :
> > Hi, this is your Linux kernel regression tracker. Thanks
for
the report.
> > > > On 14.05.22 16:41, Christian Casteyde wrote: > > > #regzbot introduced v5.17.3..v5.17.4 > > > #regzbot introduced:
001828fb3084379f3c3e228b905223c50bc237f9
> > > > FWIW, that's commit 887f75cfd0da ("drm/amdgpu: Ensure HDA
function is
> > suspended before ASIC reset") upstream. > > > > Recently a regression was reported where 887f75cfd0da was
suspected as
> > the culprit: > > https://gitlab.freedesktop.org/drm/amd/-/issues/2008 > > > > And a one related to it: > > https://gitlab.freedesktop.org/drm/amd/-/issues/1982 > > > > You might want to take a look if what was discussed there
might
be
> > related to your problem (I'm not directly involved in any
of
this, I
> > don't know the details, it's just that 887f75cfd0da looked
familiar to
> > me). If it is, a fix for these two bugs was committed to
master
earlier
> > this week: > > > >
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commi
> > t/?i d=a56f445f807b0276 > > > > It will likely be backported to 5.17.y, maybe already in
the
over-next
> > release. HTH. > > > > Ciao, Thorsten (wearing his 'the Linux kernel's regression
tracker' hat)
> > > > P.S.: As the Linux kernel's regression tracker I deal with
a
lot of
> > reports and sometimes miss something important when
writing
mails like
> > this. If that's the case here, don't hesitate to tell me
in a
public
> > reply, it's in everyone's interest to set the public
record
straight.
> > > > > Hello > > > Since 5.17.4 my laptop doesn't resume from suspend
anymore.
At resume,
> > > symptoms are variable: > > > - either the laptop freezes; > > > - either the screen keeps blank; > > > - either the screen is OK but mouse is frozen; > > > - either display lags with several logs in dmesg: > > > [ 228.275492] [drm] Fence fallback timer expired on
ring gfx
> > > [ 228.395466] [drm:amdgpu_dm_atomic_commit_tail]
*ERROR*
Waiting for
> > > fences timed out! > > > [ 228.779490] [drm] Fence fallback timer expired on
ring gfx
> > > [ 229.283484] [drm] Fence fallback timer expired on
ring
sdma0
> > > [ 229.283485] [drm] Fence fallback timer expired on
ring gfx
> > > [ 229.787487] [drm] Fence fallback timer expired on
ring gfx
> > > ... > > > > > > I've bisected the problem. > > > > > > Please note this laptop has a strange behaviour on
suspend:
> > > The first suspend request always fails (this point has
never
been
> > > fixed > > > and > > > plagues us when trying to diagnose another regression on
touchpad not
> > > resuming in the past). The screen goes blank and I can
get it
OK when
> > > pressing the power button, this seems to reset it. After
that
all
> > > suspend/resume works OK. > > > > > > Since 5.17.4, it is not possible anymore to get the
laptop
working
> > > again > > > after the first suspend failure. > > > > > > HW : HP Pavilion / Ryzen 4600H with AMD graphics
integrated +
NVidia
> > > 1650Ti > > > (turned off with ACPI call in order to get more battery,
I'm
not using
> > > NVidia driver).
linux-stable-mirror@lists.linaro.org