[PATCH V3 00/11] AMD XDNA driver

From: Lizhi Hou
Date: Wed Sep 11 2024 - 14:06:40 EST


This patchset introduces a new Linux Kernel Driver, amdxdna for AMD NPUs.
The driver is based on Linux accel subsystem.

NPU (Neural Processing Unit) is an AI inference accelerator integrated
into AMD client CPUs. NPU enables efficient execution of Machine Learning
applications like CNNs, LLMs, etc. NPU is based on AMD XDNA
architecture [1].

AMD NPU consists of the following components:

- Tiled array of AMD AI Engine processors.
- Micro Controller which runs the NPU Firmware responsible for
command processing, AIE array configuration, and execution management.
- PCI EP for host control of the NPU device.
- Interconnect for connecting the NPU components together.
- SRAM for use by the NPU Firmware.
- Address translation hardware for protected host memory access by the
NPU.

NPU supports multiple concurrent fully isolated contexts. Concurrent
contexts may be bound to AI Engine array spatially and or temporarily.

The driver is licensed under GPL-2.0 except for UAPI header which is
licensed GPL-2.0 WITH Linux-syscall-note.

User mode driver stack consists of XRT [2] and AMD AIE Plugin for IREE [3].

The firmware for the NPU is distributed as a closed source binary, and has
already been pushed to the DRM firmware repository [4].

[1] https://www.amd.com/en/technologies/xdna.html
[2] https://github.com/Xilinx/XRT
[3] https://github.com/nod-ai/iree-amd-aie
[4] https://gitlab.freedesktop.org/drm/firmware/-/tree/amd-ipu-staging/amdnpu

Changes since v2:
- Add document amdnpu.rst
- Change AIE2_DEVM_SIZE to 64M due to firmware change
- Changes based on code review comments

Changes since v1:
- Remove some inline defines
- Minor changes based on code review comments

Lizhi Hou (11):
accel/amdxdna: Add documentation for AMD NPU accelerator driver
accel/amdxdna: Add a new driver for AMD AI Engine
accel/amdxdna: Support hardware mailbox
accel/amdxdna: Add hardware resource solver
accel/amdxdna: Add hardware context
accel/amdxdna: Add GEM buffer object management
accel/amdxdna: Add command execution
accel/amdxdna: Add suspend and resume
accel/amdxdna: Add error handling
accel/amdxdna: Add query functions
accel/amdxdna: Add firmware debug buffer support

Documentation/accel/amdxdna/amdnpu.rst | 283 ++++++
Documentation/accel/amdxdna/index.rst | 11 +
Documentation/accel/index.rst | 1 +
MAINTAINERS | 10 +
drivers/accel/Kconfig | 1 +
drivers/accel/Makefile | 1 +
drivers/accel/amdxdna/Kconfig | 15 +
drivers/accel/amdxdna/Makefile | 21 +
drivers/accel/amdxdna/TODO | 5 +
drivers/accel/amdxdna/aie2_ctx.c | 960 ++++++++++++++++++
drivers/accel/amdxdna/aie2_error.c | 357 +++++++
drivers/accel/amdxdna/aie2_message.c | 790 ++++++++++++++
drivers/accel/amdxdna/aie2_msg_priv.h | 370 +++++++
drivers/accel/amdxdna/aie2_pci.c | 766 ++++++++++++++
drivers/accel/amdxdna/aie2_pci.h | 251 +++++
drivers/accel/amdxdna/aie2_psp.c | 146 +++
drivers/accel/amdxdna/aie2_smu.c | 121 +++
drivers/accel/amdxdna/aie2_solver.c | 330 ++++++
drivers/accel/amdxdna/aie2_solver.h | 154 +++
drivers/accel/amdxdna/amdxdna_ctx.c | 608 +++++++++++
drivers/accel/amdxdna/amdxdna_ctx.h | 161 +++
drivers/accel/amdxdna/amdxdna_gem.c | 709 +++++++++++++
drivers/accel/amdxdna/amdxdna_gem.h | 69 ++
drivers/accel/amdxdna/amdxdna_mailbox.c | 575 +++++++++++
drivers/accel/amdxdna/amdxdna_mailbox.h | 124 +++
.../accel/amdxdna/amdxdna_mailbox_helper.c | 61 ++
.../accel/amdxdna/amdxdna_mailbox_helper.h | 42 +
drivers/accel/amdxdna/amdxdna_pci_drv.c | 402 ++++++++
drivers/accel/amdxdna/amdxdna_pci_drv.h | 122 +++
drivers/accel/amdxdna/amdxdna_sysfs.c | 67 ++
drivers/accel/amdxdna/npu1_regs.c | 101 ++
drivers/accel/amdxdna/npu2_regs.c | 118 +++
drivers/accel/amdxdna/npu4_regs.c | 118 +++
drivers/accel/amdxdna/npu5_regs.c | 118 +++
include/trace/events/amdxdna.h | 101 ++
include/uapi/drm/amdxdna_accel.h | 461 +++++++++
36 files changed, 8550 insertions(+)
create mode 100644 Documentation/accel/amdxdna/amdnpu.rst
create mode 100644 Documentation/accel/amdxdna/index.rst
create mode 100644 drivers/accel/amdxdna/Kconfig
create mode 100644 drivers/accel/amdxdna/Makefile
create mode 100644 drivers/accel/amdxdna/TODO
create mode 100644 drivers/accel/amdxdna/aie2_ctx.c
create mode 100644 drivers/accel/amdxdna/aie2_error.c
create mode 100644 drivers/accel/amdxdna/aie2_message.c
create mode 100644 drivers/accel/amdxdna/aie2_msg_priv.h
create mode 100644 drivers/accel/amdxdna/aie2_pci.c
create mode 100644 drivers/accel/amdxdna/aie2_pci.h
create mode 100644 drivers/accel/amdxdna/aie2_psp.c
create mode 100644 drivers/accel/amdxdna/aie2_smu.c
create mode 100644 drivers/accel/amdxdna/aie2_solver.c
create mode 100644 drivers/accel/amdxdna/aie2_solver.h
create mode 100644 drivers/accel/amdxdna/amdxdna_ctx.c
create mode 100644 drivers/accel/amdxdna/amdxdna_ctx.h
create mode 100644 drivers/accel/amdxdna/amdxdna_gem.c
create mode 100644 drivers/accel/amdxdna/amdxdna_gem.h
create mode 100644 drivers/accel/amdxdna/amdxdna_mailbox.c
create mode 100644 drivers/accel/amdxdna/amdxdna_mailbox.h
create mode 100644 drivers/accel/amdxdna/amdxdna_mailbox_helper.c
create mode 100644 drivers/accel/amdxdna/amdxdna_mailbox_helper.h
create mode 100644 drivers/accel/amdxdna/amdxdna_pci_drv.c
create mode 100644 drivers/accel/amdxdna/amdxdna_pci_drv.h
create mode 100644 drivers/accel/amdxdna/amdxdna_sysfs.c
create mode 100644 drivers/accel/amdxdna/npu1_regs.c
create mode 100644 drivers/accel/amdxdna/npu2_regs.c
create mode 100644 drivers/accel/amdxdna/npu4_regs.c
create mode 100644 drivers/accel/amdxdna/npu5_regs.c
create mode 100644 include/trace/events/amdxdna.h
create mode 100644 include/uapi/drm/amdxdna_accel.h

--
2.34.1