[PATCH 0/9] Add support for SVM atomics in Nouveau

From: Alistair Popple
Date: Mon Feb 08 2021 - 20:10:21 EST


This series adds support to Nouveau for atomic memory operations on OpenCL
shared virtual memory (SVM). This is achieved using the atomic PTE bits on
the GPU to only permit atomic operations to system memory when a page is
not mapped in userspace on the CPU.

This is implemented by adding a mode to migrate_vma_pages() which unmaps
and isolates existing pages from the CPU and pins them. The original
userspace page table entries are migrated to point to device private pages
allocated by the driver. This allows the driver to enable GPU atomic access
to the page as it will receive a callback when CPU userspace needs to
access it.

In response to this callback the driver revokes the atomic access
permission from the GPU and migrates entries to point back to the original
page. The original page is unpinned as part of the migration operation
which also returns it to the LRU.

Patch 3 contains the bulk of the memory management changes to implement
unmap and pin.

Patches 6-9 extend Nouveau to use the new mode to allow system wide atomics
for OpenCL SVM to be implemented on Nouveau.

This has been tested using the latest upstream Mesa userspace with a simple
OpenCL test program which checks the results of atomic GPU operations on a
buffer whilst also writing to the same buffer from the CPU.

Problems yet to be addressed:

Recent changes to pin_user_pages() prevent the creation of pinned pages in
ZONE_MOVABLE. This series allows pinned pages to be created in ZONE_MOVABLE
as attempts to migrate may fail which would be fatal to userspace.

In this case migration of the pinned page is unnecessary as the page can be
unpinned at anytime by having the driver revoke atomic permission as it
does for the migrate_to_ram() callback. However a method of calling this
when memory needs to be moved has yet to be resolved so any discussion is
welcome.

Alistair Popple (9):
mm/migrate.c: Always allow device private pages to migrate
mm/migrate.c: Allow pfn flags to be passed to migrate_vma_setup()
mm/migrate: Add a unmap and pin migration mode
Documentation: Add unmap and pin to HMM
hmm-tests: Add test for unmap and pin
nouveau/dmem: Only map migrating pages
nouveau/svm: Refactor nouveau_range_fault
nouveau/dmem: Add support for multiple page types
nouveau/svm: Implement atomic SVM access

Documentation/vm/hmm.rst | 22 +-
arch/powerpc/kvm/book3s_hv_uvmem.c | 4 +-
drivers/gpu/drm/nouveau/include/nvif/if000c.h | 1 +
drivers/gpu/drm/nouveau/nouveau_dmem.c | 190 +++++++++++++++---
drivers/gpu/drm/nouveau/nouveau_dmem.h | 9 +
drivers/gpu/drm/nouveau/nouveau_svm.c | 148 +++++++++++---
drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmm.h | 1 +
.../drm/nouveau/nvkm/subdev/mmu/vmmgp100.c | 6 +
include/linux/migrate.h | 2 +
include/linux/migrate_mode.h | 1 +
lib/test_hmm.c | 109 ++++++++--
lib/test_hmm_uapi.h | 1 +
mm/migrate.c | 82 +++++---
tools/testing/selftests/vm/hmm-tests.c | 49 +++++
14 files changed, 524 insertions(+), 101 deletions(-)

--
2.20.1