[PATCH v3 0/4] Address error and recovery for AER and DPC

From: Oza Pawandeep
Date: Mon Jan 08 2018 - 02:55:27 EST


This patch set brings in error handling support for DPC

The current implementation of AER and error message broadcasting to the
EP driver is tightly coupled and limited to AER service driver.
It is important to factor out broadcasting and other link handling
callbacks. So that not only when AER gets triggered, but also when DPC get
triggered (for e.g. ERR_FATAL), callbacks are handled appropriately.

DPC should enumerate the devices after recovering the link, which is
achieved by implementing error_resume callback.

Changes since v2:
Based on feedback from Keith:

"
When DPC is triggered due to receipt of an uncorrectable error Message,
the Requester ID from the Message is recorded in the DPC Error
Source ID register and that Message is discarded and not forwarded Upstream.
"

Removed the patch where AER checks if DPC service is active
Changes since v1:
Kbuild errors fixed:
> pci_find_dpc_dev made static
> ras_event.h updated
> pci_find_aer_service call with CONFIG check
> pci_find_dpc_service call with CONFIG check

Oza Pawandeep (4):
PCI/AER: factor out error reporting from AER
PCI/DPC/AER: Address Concurrency between AER and DPC
PCI/ERR: Do not do recovery if DPC service is active
PCI/DPC: Enumerate the devices after DPC trigger event

drivers/acpi/apei/ghes.c | 2 +-
drivers/pci/pcie/Makefile | 2 +-
drivers/pci/pcie/aer/aerdrv.h | 30 ---
drivers/pci/pcie/aer/aerdrv_core.c | 306 +------------------------
drivers/pci/pcie/aer/aerdrv_errprint.c | 27 ++-
drivers/pci/pcie/pcie-dpc.c | 127 ++++++++++-
drivers/pci/pcie/pcie-err.c | 399 +++++++++++++++++++++++++++++++++
drivers/pci/pcie/portdrv.h | 2 +
include/linux/aer.h | 4 -
include/linux/pci.h | 23 ++
include/ras/ras_event.h | 6 +-
11 files changed, 579 insertions(+), 349 deletions(-)
create mode 100644 drivers/pci/pcie/pcie-err.c

--
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc.,
a Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.