[PATCH v6 0/5] x86/edac/amd64: Add heterogeneous node support

From: Naveen Krishna Chatradhi
Date: Thu Oct 28 2021 - 09:01:38 EST


On newer heterogeneous systems with AMD CPUs the data fabrics of GPUs
can be connected directly via custom links.

This series of patchset does the following
1. amd_nb.c:
a. Add support for northbridges on Aldebaran GPU nodes
b. export AMD node map details to be used by edac and mce modules

2. mce_amd module:
a. Identify the node ID where the error occurred and map the node
id to linux enumerated node id.

3. amd64_edac module
a. Add new family op routines
b. Enumerate UMCs and HBMs on the GPU nodes
c. Move fam_type structure into amd64_pvt struct

This patchset is rebased on top of
"
commit 07416cadfdfa38283b840e700427ae3782c76f6b
Author: Yazen Ghannam <yazen.ghannam@xxxxxxx>
Date: Tue Oct 5 15:44:19 2021 +0000

EDAC/amd64: Handle three rank interleaving mode
"

Muralidhara M K (3):
x86/amd_nb: Add support for northbridges on Aldebaran
EDAC/amd64: Extend family ops functions
EDAC/amd64: Move struct fam_type into amd64_pvt structure

Naveen Krishna Chatradhi (2):
EDAC/mce_amd: Extract node id from MCA_IPID
EDAC/amd64: Enumerate memory on Aldebaran GPU nodes

arch/x86/include/asm/amd_nb.h | 9 +
arch/x86/kernel/amd_nb.c | 146 ++++++--
drivers/edac/amd64_edac.c | 654 ++++++++++++++++++++++++----------
drivers/edac/amd64_edac.h | 39 +-
drivers/edac/mce_amd.c | 24 +-
include/linux/pci_ids.h | 1 +
6 files changed, 663 insertions(+), 210 deletions(-)

--
2.25.1