[PATCH 0/2] drm/msm: Add tracking for faults associated with an address space

From: Rob Clark
Date: Thu Jan 27 2022 - 12:10:43 EST


From: Rob Clark <robdclark@xxxxxxxxxxxx>

Currently, for GL_EXT_robustness userspace uses the global and per-
submitqueue fault counters to determine GUILTY_CONTEXT_RESET_EXT vs
INNOCENT_CONTEXT_RESET_EXT. But that is a bit overly paranoid, in
that a fault in a different process's context (when it has it's own
isolated address space) should not hurt anything.

This is particularly annoying with CrOS and chrome's exit_on_context_lost quirk,
while running deqp in the android container, as the deqp-egl suite has
tests that intentionally trigger gpu hangs (for the purpose of testing
the robustness extension), which triggers chrome to restart, which
restarts the android container!

This new param gives userspace a way to ignore faults triggered by other
processes.

Applies on top of https://patchwork.freedesktop.org/series/98907/

Rob Clark (2):
drm/msm/gpu: Add ctx to get_param()
drm/msm/gpu: Add param to get address space faults

drivers/gpu/drm/msm/adreno/adreno_gpu.c | 6 +++++-
drivers/gpu/drm/msm/adreno/adreno_gpu.h | 3 ++-
drivers/gpu/drm/msm/msm_drv.c | 4 +++-
drivers/gpu/drm/msm/msm_gem.h | 3 +++
drivers/gpu/drm/msm/msm_gpu.c | 1 +
drivers/gpu/drm/msm/msm_gpu.h | 3 ++-
drivers/gpu/drm/msm/msm_rd.c | 6 ++++--
include/uapi/drm/msm_drm.h | 3 ++-
8 files changed, 22 insertions(+), 7 deletions(-)

--
2.34.1