Re: [PATCH v4 2/4] PCI/sysfs: Fix null pointer dereference during hotplug
From: duziming
Date: Thu Feb 26 2026 - 21:31:01 EST
在 2026/2/27 1:14, Bjorn Helgaas 写道:
On Fri, Jan 16, 2026 at 04:17:19PM +0800, Ziming Du wrote:
During the concurrent process of creating and rescanning in VF, theWhere are the two resource file creations? This will help review the
resource files for the same pci_dev may be created twice.
patch.
The process of creating VFs:
sriov_numvfs_store
hinic_pci_sriov_configure
hinic_pci_sriov_enable
pci_enable_sriov
sriov_enable
sriov_add_vfs
pci_iov_add_virtfn
pci_bus_add_device
pci_create_sysfs_dev_files
pci_create_resource_files
The process of rescanning VFs:
rescan_store
pci_rescan_bus
pci_bus_add_devices
pci_bus_add_device
pci_create_sysfs_dev_files
pci_create_resource_files
The secondI think it would be more informative to include an actual sample here.
creation attempt fails, resulting the res_attr in pci_dev to kfree(),
but the pointer is not set to NULL. This will subsequently lead to
dereferencing a null pointer when removing the device.
When we perform the following operation:
echo $sriov_totalvfs > /sys/class/net/"$pfname"/device/sriov_numvfs &
We can easily substitute the device names and numbers, given a
concrete example. It's a little bit harder to intuit what $pfname and
$sriov_totalvfs should be. E.g.,
$ cat /sys/bus/pci/devices/0000:02:00.0/sriov_totalvfs
128
$ echo 128 > /sys/bus/pci/devices/0000:02:00.0/sriov_numvfs &
Unless it's important to use /sys/class/net/..., use
/sys/bus/pci/devices/... both places to make it simpler.
sleep 0.5These look like shell commands ...
echo 1 > /sys/bus/pci/rescan
pci_remove "$pfname"but what is "pci_remove"? I guess it must be an echo into
/sys/bus/pci/devices/.../remove; expanding it here would be better.
Yes, you are right, based on your comments, I have revised it as follows:
$ cat /sys/bus/pci/devices/0000:02:00.0/sriov_totalvfs
128
$ echo 128 > /sys/bus/pci/devices/0000:02:00.0/sriov_numvfs &
$ sleep 0.5
$ echo 1 > /sys/bus/pci/rescan
$ echo 1 > /sys/bus/pci/devices/0000:02:00.0/remove
system will crash as follows:This *sounds* like it would still be racy unless there's a lock around
Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000
Call trace:
__pi_strlen+0x14/0x150
kernfs_find_ns+0x54/0x120
kernfs_remove_by_name_ns+0x58/0xf0
sysfs_remove_bin_file+0x24/0x38
pci_remove_resource_files+0x44/0x90
pci_remove_sysfs_dev_files+0x28/0x40
pci_stop_bus_device+0xb8/0x118
pci_stop_and_remove_bus_device+0x20/0x40
pci_iov_remove_virtfn+0xb8/0x138
sriov_disable+0xbc/0x190
pci_disable_sriov+0x30/0x48
hinic_pci_sriov_disable+0x54/0x138 [hinic]
hinic_remove+0x140/0x290 [hinic]
pci_device_remove+0x4c/0xf8
device_remove+0x54/0x90
device_release_driver_internal+0x1d4/0x238
device_release_driver+0x20/0x38
pci_stop_bus_device+0xa8/0x118
pci_stop_and_remove_bus_device_locked+0x28/0x50
remove_store+0x128/0x208
Fix this by set the pointer to NULL after releasing 'res_attr' immediately.
this. If there is a lock, please mention what it is and where it's
held.
The rescan process holds pci_lock_rescan_remove(), while the creation process
holds "device_lock(&pdev->dev)"
Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
Signed-off-by: Ziming Du <duziming2@xxxxxxxxxx>
---
drivers/pci/pci-sysfs.c | 2 ++
1 file changed, 2 insertions(+)
diff --git a/drivers/pci/pci-sysfs.c b/drivers/pci/pci-sysfs.c
index 18e5d4603b472..fbcbf39232732 100644
--- a/drivers/pci/pci-sysfs.c
+++ b/drivers/pci/pci-sysfs.c
@@ -1227,12 +1227,14 @@ static void pci_remove_resource_files(struct pci_dev *pdev)
if (res_attr) {
sysfs_remove_bin_file(&pdev->dev.kobj, res_attr);
kfree(res_attr);
+ pdev->res_attr[i] = NULL;
}
res_attr = pdev->res_attr_wc[i];
if (res_attr) {
sysfs_remove_bin_file(&pdev->dev.kobj, res_attr);
kfree(res_attr);
+ pdev->res_attr_wc[i] = NULL;
}
}
}
--
2.43.0