Re: [PATCH] drm/msm/dpu: Force disabling commits to take non-async path

From: Rob Clark
Date: Thu Jan 09 2025 - 10:44:45 EST


On Wed, Jan 8, 2025 at 7:14 PM Abhinav Kumar <quic_abhinavk@xxxxxxxxxxx> wrote:
>
>
>
> On 1/8/2025 7:04 PM, Rob Clark wrote:
> > On Wed, Jan 8, 2025 at 6:22 PM Abhinav Kumar <quic_abhinavk@xxxxxxxxxxx> wrote:
> >>
> >>
> >>
> >> On 1/8/2025 6:14 PM, Dmitry Baryshkov wrote:
> >>> On Thu, 9 Jan 2025 at 03:45, Rob Clark <robdclark@xxxxxxxxx> wrote:
> >>>>
> >>>> On Wed, Jan 8, 2025 at 2:58 PM Jessica Zhang <quic_jesszhan@xxxxxxxxxxx> wrote:
> >>>>>
> >>>>> Force commit that are disabling a plane in the async_crtc to take the
> >>>>> non-async commit tail path.
> >>>>>
> >>>>> In cases where there are two consecutive async cursor updates (one
> >>>>> regular non-NULL update followed by a disabling NULL FB update), it is
> >>>>> possible for the second NULL update to not be queued (due to the
> >>>>> pending_crtc_mask check) or otherwise not be run before the cursor FB is
> >>>>> deallocated by drm_atomic_helper_cleanup_planes(). This would cause a
> >>>>> context fault since the hardware would try to fetch the old plane state
> >>>>> with the stale FB address.
> >>>>>
> >>>>> Avoid this issue by forcing cursor updates that will disable the cursor
> >>>>> plane to be blocking commits. This will ensure that hardware clears and
> >>>>> stops fetching the FB source address before the driver deallocates the FB
> >>>>>
> >>>>> Fixes: 2d99ced787e3 ("drm/msm: async commit support")
> >>>>> Signed-off-by: Jessica Zhang <quic_jesszhan@xxxxxxxxxxx>
> >>>>> ---
> >>>>> drivers/gpu/drm/msm/msm_atomic.c | 13 +++++++++++++
> >>>>> 1 file changed, 13 insertions(+)
> >>>>>
> >>>>> diff --git a/drivers/gpu/drm/msm/msm_atomic.c b/drivers/gpu/drm/msm/msm_atomic.c
> >>>>> index 9c45d641b5212c11078ab38c13a519663d85e10a..ddc74c68148c643d34ca631dd28d4cdc2b8c7dc0 100644
> >>>>> --- a/drivers/gpu/drm/msm/msm_atomic.c
> >>>>> +++ b/drivers/gpu/drm/msm/msm_atomic.c
> >>>>> @@ -142,6 +142,7 @@ static bool can_do_async(struct drm_atomic_state *state,
> >>>>> struct drm_connector_state *connector_state;
> >>>>> struct drm_connector *connector;
> >>>>> struct drm_crtc_state *crtc_state;
> >>>>> + struct drm_plane_state *plane_state;
> >>>>> struct drm_crtc *crtc;
> >>>>> int i, num_crtcs = 0;
> >>>>>
> >>>>> @@ -162,6 +163,18 @@ static bool can_do_async(struct drm_atomic_state *state,
> >>>>> *async_crtc = crtc;
> >>>>> }
> >>>>>
> >>>>> + /*
> >>>>> + * Force a blocking commit if the cursor is being disabled. This is to
> >>>>> + * ensure that the registers are cleared and hardware doesn't try to
> >>>>> + * fetch from a stale address.
> >>>>> + */
> >>>>> + if (*async_crtc) {
> >>>>> + plane_state = drm_atomic_get_new_plane_state(state,
> >>>>> + (*async_crtc)->cursor);
> >>>>> + if (plane_state && !plane_state->fb)
> >>>>> + return false;
> >>>>
> >>>> hmm, I suppose we want the same even if the fb changes? Or
> >>>> alternatively somewhere hold an extra ref to the backing obj until hw
> >>>> has finished scanout?
> >>>
> >>
> >> Hi Rob
> >>
> >> Do you mean we need to also check if old_plane_state->fb !=
> >> new_plane_state->fb, then use blocking commit?
> >
> > yeah, basically.. if we release any outgoing fb the backing bo could
> > be potentially freed+unmapped leading to the same problem.
> >
>
> Yeah true, this case also we can hit this. Will add it and check.
>
> > idk if this more conservative approach would cause fps issues..
> > holding an extra ref would avoid potential issues, but offhand I'm not
> > sure if it would be a perf problem in practice. Maybe with animated
> > cursors?
> >
>
> hmmm.... we did not see any significant lags or drops when we tested
> this (that was also our major concern to make sure we dont)

It's tricky because it is very much at the mercy of what userspace
does.. and different userspace does different things.

I remember issues back in the gtk2 days with something (gdm?) trying
to update a spinning cursor at 1000fps. When that was clamped at
60fps it would literally take 5min to login!

> If we do have to hold an extra ref, we will have to do something like below:
>
> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
> b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
> index 098abc2c0003..97d9e056038c 100644
> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
> @@ -649,8 +649,10 @@ static int dpu_plane_prepare_fb(struct drm_plane
> *plane,
> struct dpu_kms *kms = _dpu_plane_get_kms(&pdpu->base);
> int ret;
>
> - if (!new_state->fb)
> + if (!new_state->fb) {
> + refcount_inc(&msm_fb->dirtyfb);
> return 0;
> + }
>
> DPU_DEBUG_PLANE(pdpu, "FB[%u]\n", fb->base.id);
>
> @@ -682,8 +684,10 @@ static void dpu_plane_cleanup_fb(struct drm_plane
> *plane,
> struct dpu_plane *pdpu = to_dpu_plane(plane);
> struct dpu_plane_state *old_pstate;
>
> - if (!old_state || !old_state->fb)
> + if (!old_state || !old_state->fb) {
> + refcount_dec(&msm_fb->dirtyfb);
> return;
> + }
>
> old_pstate = to_dpu_plane_state(old_state);
>
> I dont know if this is clean though. WDYT?

The dirtyfb ref is not what you want. It is tracking if any attached
display changes need dirtyfb to flush changes to the panel. See [1]

The other issue is that cleanup_fb will be called when drm core
_thinks_ the atomic commit is completed, not when it _actually_ has.

Probably the dpu_plane should hold a ref of the scanout bo until
vblank, rather than rely on the plane state holding a ref to the fb
(which holds a ref to the bo)? drm_flip_work might be helpful for
this sort of thing.

BR,
-R

[1] https://lore.kernel.org/all/20220223191118.881321-1-robdclark@xxxxxxxxx/

>
>
> > BR,
> > -R
> >
> >> We can try that out.
> >>
> >> holding extra ref gets tricky IMO. In this way, the calls are balanced
> >> in places we know.
> >>
> >>> I think a more correct approach would be to run a worker, waiting for
> >>> the commit to happen and then freeing the FBs.
> >>>
> >>
> >> Hi Dmitry
> >>
> >> This option was tried . It gets very messy to handle it this way. Then
> >> we realized that, the worker is going to try to do the same thing a
> >> blocking commit does which is to wait for hw to finish scanout and then
> >> cleanup planes. Hence this was preferred and is better IMO.
> >>
> >>>>
> >>>> BR,
> >>>> -R
> >>>>
> >>>>> + }
> >>>>> +
> >>>>> return true;
> >>>>> }
> >>>>>
> >>>>>
> >>>>> ---
> >>>>> base-commit: 866e43b945bf98f8e807dfa45eca92f931f3a032
> >>>>> change-id: 20250108-async-disable-fix-cc1b9a1d5b19
> >>>>>
> >>>>> Best regards,
> >>>>> --
> >>>>> Jessica Zhang <quic_jesszhan@xxxxxxxxxxx>
> >>>>>
> >>>
> >>>
> >>>