[PATCH v2] x86/PCI: Mark Power Control Unit as having non-compliant BARs

From: Xiaochun Lee
Date: Fri May 15 2020 - 06:24:25 EST


From: Xiaochun Lee <lixc17@xxxxxxxxxx>

The device [8086:a26c] is a Power Control Unit of
Intel Ice Lake Server Processor and devices [8086:a1ec,a1ed]
are the Power Control Unit of Intel Xeon Scalable Processor,
kernel treats their pci BARs as a base address register that
leading to a boot failure like:
"pci 0000:00:11.0: [Firmware Bug]: reg 0x30: invalid BAR (can't size)".

The symptoms in Ice Lake processor is:
"QU99 ICE LAKE ES1 HCC 24C 185W 3200 L-0"

The information of the device [8086:a26c] list as below:
00:11.0 Unassigned class [ff00]: Intel Corporation Device a26c (rev 03)
ÂÂÂÂÂÂÂ Subsystem: Lenovo Device 7811
ÂÂÂÂÂÂÂ Flags: fast devsel, NUMA node 0
ÂÂÂÂÂÂÂ Expansion ROM at <ignored> [disabled]
ÂÂÂÂÂÂÂ Capabilities: [80] Power Management version 3

The symptoms in Xeon Scalable Processor is:
"Intel(R) Xeon(R) Gold 5117 CPU @ 2.00GHz"
"Intel(R) Xeon(R) Gold 6252 CPU @ 2.00GHz"

The information of the Device [8086:a1ec] list as below:
00:11.0 Unassigned class [ff00]: Intel Corporation C620 Series Chipset Family MROMÂ0 [8086:a1ec] (rev 09)
ÂÂÂÂÂÂÂ Subsystem: Lenovo Device [17aa:7805]
ÂÂÂÂÂÂÂ Latency: 0, Cache Line Size: 64 bytes
ÂÂÂÂÂÂÂ NUMA node: 0
ÂÂÂÂÂÂÂ Expansion ROM at <ignored> [disabled]
ÂÂÂÂÂÂÂ Capabilities: [80] Power Management version 3

There are no other BARs on this devices, so mark the PCU as having
non-compliant BARs, therefore we don't try to probe any of them.

Signed-off-by: Xiaochun Lee <lixc17@xxxxxxxxxx>
---
arch/x86/pci/fixup.c | 6 ++++++
1 file changed, 6 insertions(+)

diff --git a/arch/x86/pci/fixup.c b/arch/x86/pci/fixup.c
index e723559..d9abc67 100644
--- a/arch/x86/pci/fixup.c
+++ b/arch/x86/pci/fixup.c
@@ -563,6 +563,9 @@ static void twinhead_reserve_killing_zone(struct pci_dev *dev)
* Erratum BDF2
* PCI BARs in the Home Agent Will Return Non-Zero Values During Enumeration
* http://www.intel.com/content/www/us/en/processors/xeon/xeon-e5-v4-spec-update.html
+ *
+ * Device [8086:a26c]
+ * Devices [8086:a1ec,a1ed]
*/
static void pci_invalid_bar(struct pci_dev *dev)
{
@@ -572,6 +575,9 @@ static void pci_invalid_bar(struct pci_dev *dev)
DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0x6f60, pci_invalid_bar);
DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0x6fa0, pci_invalid_bar);
DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0x6fc0, pci_invalid_bar);
+DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0xa1ec, pci_invalid_bar);
+DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0xa1ed, pci_invalid_bar);
+DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0xa26c, pci_invalid_bar);

/*
* Device [1022:7808]
--
1.8.3.1