Re: [PATCH] powerpc/pseries/msi: Avoid reading PCI device registers in reduced power states

From: Venkat Rao Bagalkote
Date: Wed Mar 26 2025 - 13:34:18 EST



On 10/03/25 10:30 am, Vaibhav Jain wrote:
Gautam Menghani <gautam@xxxxxxxxxxxxx> writes:

When a system is being suspended to RAM, the PCI devices are also
suspended and the PPC code ends up calling pseries_msi_compose_msg() and
this triggers the BUG_ON() in __pci_read_msi_msg() because the device at
this point is in reduced power state. In reduced power state, the memory
mapped registers of the PCI device are not accessible.

To replicate the bug:
1. Make sure deep sleep is selected
# cat /sys/power/mem_sleep
s2idle [deep]

2. Make sure console is not suspended (so that dmesg logs are visible)
echo N > /sys/module/printk/parameters/console_suspend

3. Suspend the system
echo mem > /sys/power/state

To fix this behaviour, read the cached msi message of the device when the
device is not in PCI_D0 power state instead of touching the hardware.

Fixes: a5f3d2c17b07 ("powerpc/pseries/pci: Add MSI domains")
Cc: stable@xxxxxxxxxxxxxxx # v5.15+
Signed-off-by: Gautam Menghani <gautam@xxxxxxxxxxxxx>
I am able to reporduce this issue without this patch and with this pacth, there is no BUG_ON() in __pci_read_msi_msg(), but did see kernel warnings. not sure if its side effect of this patch or a seperate issue.

Without this patch: [ 96.888399] ------------[ cut here ]------------ [ 96.888402] kernel BUG at drivers/pci/msi/msi.c:158! [ 96.888407] Oops: Exception in kernel mode, sig: 5 [#1] [ 96.888410] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=8192 NUMA pSeries [ 96.888414] Modules linked in: nft_compat nf_tables nfnetlink bonding tls rfkill binfmt_misc kmem device_dax pseries_rng vmx_crypto dax_pmem drm drm_panel_orientation_quirks xfs dm_service_time sd_mod sg nd_pmem ibmvfc nd_btt ibmvscsi scsi_transport_fc ibmveth scsi_transport_srp papr_scm libnvdimm tg3 dm_multipath dm_mirror dm_region_hash dm_log dm_mod fuse [ 96.888473] CPU: 14 UID: 0 PID: 89 Comm: migration/14 Kdump: loaded Not tainted 6.14.0-auto #3 [ 96.888479] Hardware name: IBM,9009-42A POWER9 (architected) 0x4e0202 0xf000005 of:IBM,FW950.A0 (VL950_141) hv:phyp pSeries [ 96.888481] Stopper: multi_cpu_stop+0x0/0x22c <- __stop_cpus.constprop.0+0x68/0xc0 [ 96.888494] NIP: c000000000995aec LR: c00000000010ec20 CTR: c00000000010ebf8 [ 96.888498] REGS: c00000000680f830 TRAP: 0700 Not tainted (6.14.0-auto) [ 96.888501] MSR: 8000000002823033 <SF,VEC,VSX,FP,ME,IR,DR,RI,LE> CR: 44004208 XER: 00000000 [ 96.888520] CFAR: c00000000010ec1c IRQMASK: 3 GPR00: c00000000010ec20 c00000000680fad0 c000000001668100 c000000006c537e0 GPR04: c00000000680fb80 0000000000000000 0000000000000000 c009ffffff8325f0 GPR08: 0000000000000001 0000000000000001 0000000000000003 0000000000001003 GPR12: c00000000010ebf8 c00000000f7beb00 c0000000001acbe0 c000000004056d40 GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR20: 0000000000000000 0000000000000003 000000000000001d c000000002cfaa88 GPR24: c00000006545b800 c000000002cfc080 c000000001126bb0 0000000000000000 GPR28: 0000000000000010 c00000000ce790c8 c00000000680fb80 c000000006c537e0 [ 96.888586] NIP [c000000000995aec] __pci_read_msi_msg+0x48/0x278 [ 96.888592] LR [c00000000010ec20] pseries_msi_compose_msg+0x28/0x3c [ 96.888599] Call Trace: [ 96.888600] [c00000000680fad0] [000000000000001d] 0x1d (unreliable) [ 96.888608] [c00000000680fb20] [c00000006545b820] 0xc00000006545b820 [ 96.888613] [c00000000680fb40] [c00000000023b41c] irq_chip_compose_msi_msg+0x5c/0x90 [ 96.888620] [c00000000680fb60] [c000000000242aec] msi_domain_set_affinity+0xb8/0xf4 [ 96.888627] [c00000000680fbb0] [c000000000234634] irq_do_set_affinity+0x14c/0x25c [ 96.888633] [c00000000680fc10] [c000000000234870] irq_set_affinity_locked+0x12c/0x1c4 [ 96.888639] [c00000000680fc60] [c000000000234a84] irq_set_affinity+0x64/0xa0 [ 96.888644] [c00000000680fca0] [c0000000000c9d40] xics_migrate_irqs_away+0x27c/0x30c [ 96.888650] [c00000000680fd60] [c000000000111834] pseries_cpu_disable+0xc8/0xf0 [ 96.888657] [c00000000680fd90] [c0000000000611e0] __cpu_disable+0x54/0xb0 [ 96.888662] [c00000000680fdc0] [c0000000001715e8] take_cpu_down+0x4c/0xcc [ 96.888669] [c00000000680fe10] [c0000000002ebbc4] multi_cpu_stop+0xd8/0x22c [ 96.888676] [c00000000680fe80] [c0000000002eb898] cpu_stopper_thread+0x158/0x24c [ 96.888683] [c00000000680ff30] [c0000000001b7a0c] smpboot_thread_fn+0x1ec/0x25c [ 96.888691] [c00000000680ff90] [c0000000001acd04] kthread+0x12c/0x14c [ 96.888697] [c00000000680ffe0] [c00000000000df98] start_kernel_thread+0x14/0x18 [ 96.888703] Code: fba1ffe8 39200001 f821ffb1 7c7f1b78 7c9e2378 e94d0c78 f9410028 39400000 eba30008 815dffd8 2c0a0000 7d20489e <0b090000> a123004c 712a0001 41820168 [ 96.888730] ---[ end trace 0000000000000000 ]---

With this patch: No System crash observed. But below warnings were observed.

[ 99.450644] ------------[ cut here ]------------ [ 99.450648] WARNING: CPU: 0 PID: 17 at arch/powerpc/sysdev/xics/icp-hv.c:55 icp_hv_eoi+0xc4/0x120 [ 99.450659] Modules linked in: nft_compat nf_tables nfnetlink bonding tls rfkill binfmt_misc kmem device_dax pseries_rng vmx_crypto dax_pmem drm drm_panel_orientation_quirks xfs dm_service_time sd_mod sg nd_pmem ibmvfc nd_btt ibmvscsi scsi_transport_fc ibmveth scsi_transport_srp papr_scm libnvdimm tg3 dm_multipath dm_mirror dm_region_hash dm_log dm_mod fuse [ 99.450704] CPU: 0 UID: 0 PID: 17 Comm: ksoftirqd/0 Kdump: loaded Not tainted 6.14.0-auto-00001-g03419579f433 #4 [ 99.450712] Hardware name: IBM,9009-42A POWER9 (architected) 0x4e0202 0xf000005 of:IBM,FW950.A0 (VL950_141) hv:phyp pSeries [ 99.450717] NIP: c0000000000cadd4 LR: c0000000000cadd0 CTR: 00000000007088ec [ 99.450722] REGS: c000000004a2fa20 TRAP: 0700 Not tainted (6.14.0-auto-00001-g03419579f433) [ 99.450727] MSR: 800000000282b033 <SF,VEC,VSX,EE,FP,ME,IR,DR,RI,LE> CR: 2804424f XER: 00000010 [ 99.450743] CFAR: c000000000224da8 IRQMASK: 1 GPR00: c0000000000cadd0 c000000004a2fcc0 c000000001668100 000000000000003f GPR04: c0000007fd907c88 c0000007fd916000 c000000004a2fb08 00000007fb6a0000 GPR08: 0000000000000027 0000000000000000 0000000000000000 0000000000000001 GPR12: c000000002a37d48 c000000003000000 c0000000001acc60 c000000004052080 GPR16: 0000000000000006 0000000000000040 0000000000000006 0000000000000100 GPR20: 0000000004208040 0000000000000000 0000000000000001 c0000000002382c0 GPR24: 0000000000000001 0000000000000000 0000000000000006 0000000000000002 GPR28: c0000007fd9078b8 0000000000000000 c0000000010e69e8 00000000050a0002 [ 99.450802] NIP [c0000000000cadd4] icp_hv_eoi+0xc4/0x120 [ 99.450808] LR [c0000000000cadd0] icp_hv_eoi+0xc0/0x120 [ 99.450814] Call Trace: [ 99.450816] [c000000004a2fcc0] [c0000000000cadd0] icp_hv_eoi+0xc0/0x120 (unreliable) [ 99.450824] [c000000004a2fd30] [c000000000239eac] handle_fasteoi_irq+0x16c/0x344 [ 99.450832] [c000000004a2fd70] [c000000000238380] resend_irqs+0xc0/0x188 [ 99.450838] [c000000004a2fdb0] [c00000000017b054] tasklet_action_common+0x154/0x418 [ 99.450845] [c000000004a2fe20] [c00000000017a788] handle_softirqs+0x148/0x3b4 [ 99.450852] [c000000004a2ff10] [c00000000017aa58] run_ksoftirqd+0x64/0xa0 [ 99.450858] [c000000004a2ff30] [c0000000001b7a8c] smpboot_thread_fn+0x1ec/0x25c [ 99.450866] [c000000004a2ff90] [c0000000001acd84] kthread+0x12c/0x14c [ 99.450873] [c000000004a2ffe0] [c00000000000df98] start_kernel_thread+0x14/0x18 [ 99.450879] Code: ebe1fff8 7c0803a6 4e800020 3c82ffa8 3c62ffdc 7fc5f378 3fc2ffa8 3884e908 38632268 3bdee8e8 48159f95 60000000 <0fe00000> 7bff4622 38600068 7fe4fb78 [ 99.450900] ---[ end trace 0000000000000000 ]---

Please add below tag:

Tested-by: Venkat Rao Bagalkote <venkat88@xxxxxxxxxxxxx>

Regards,

Venkat.

LGTM. Hence
Reviewed-by: Vaibhav Jain <vaibhav@xxxxxxxxxxxxx>