Re: [PATCH 0/1] Fiji GPU audio register timeout when in BACO state
From: Takashi Iwai
Date: Thu Apr 30 2020 - 13:01:14 EST
On Thu, 30 Apr 2020 18:52:20 +0200,
Nicholas Johnson wrote:
>
> On Thu, Apr 30, 2020 at 05:14:56PM +0200, Takashi Iwai wrote:
> > On Wed, 29 Apr 2020 18:19:57 +0200,
> > Alex Deucher wrote:
> > >
> > > On Wed, Apr 29, 2020 at 12:05 PM Takashi Iwai <tiwai@xxxxxxx> wrote:
> > > > Well, but the code path there is the runtime PM resume of the audio
> > > > device and it means that GPU must have been runtime-resumed again
> > > > beforehand via the device link. So, it should have worked from the
> > > > beginning but in reality not -- that is, apparently some inconsistency
> > > > is found in the initial attempt of the runtime resume...
> > >
> > > Yeah, it should be covered, but I wonder if there is something in the
> > > ELD update sequence that needs to call pm_runtime_get_sync()? The ELD
> > > sequence on AMD GPUs doesn't work the same as on other vendors. The
> > > GPU driver has a backdoor into the HDA device's verbs to set update
> > > the audio state rather than doing it via an ELD buffer update. We
> > > still update the ELD buffer for consistency. Maybe when the GPU
> > > driver sets the audio state at monitor detection time that triggers an
> > > interrupt or something on the HDA side which races with the CPU and
> > > the power down of the GPU. That still seems unlikely though since the
> > > runtime pm on the GPU side defaults to a 5 second suspend timer.
> >
> > I'm not sure whether it's the race between runtime suspend of GPU vs
> > runtime resume of audio. My wild guess is rather that it's the timing
> > GPU notifies to the audio; then the audio driver notifies to
> > user-space and user-space opens the stream, which in turn invokes the
> > runtime resume of GPU. But in GPU side, it's still under processing,
> > so it proceeds before the GPU finishes its initialization job.
> >
> > Nicholas, could you try the patch below and see whether the problem
> > still appears? The patch artificially delays the notification and ELD
> > update for 300msec. If this works, it means the timing problem.
> The bug still occurred after applying the patch.
>
> But you were absolutely correct - it just needed to be increased to
> 3000ms - then the bug stopped.
Interesting. 3 seconds are too long, but I guess 1 second would work
as well?
In anyway, the success with a long delay means that the sound setup
after the full runtime resume of GPU seems working.
> Now the question is, what do we do now that we know this?
>
> Also, are you still interested in the contents of the ELD# files? I can
> dump them all into a file at some specific moment in time which you
> request, if needed.
Yes, please take the snapshot before plugging, right after plugging
and right after enabling. I'm not sure whether your monitor supports
the audio, and ELD contents should show that, at least.
thanks,
Takashi