Re: [PATCH -v2 1/9] rtmutex: Deboost before waking up the top waiter

From: Peter Zijlstra
Date: Mon Sep 26 2016 - 11:22:43 EST


On Mon, Sep 26, 2016 at 11:15:11AM -0400, Steven Rostedt wrote:
> On Mon, 26 Sep 2016 14:32:14 +0200
> Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
> >
> > --- a/kernel/futex.c
> > +++ b/kernel/futex.c
> > @@ -1374,9 +1374,8 @@ static int wake_futex_pi(u32 __user *uad
> > * scheduled away before the wake up can take place.
> > */
> > spin_unlock(&hb->lock);
> > - wake_up_q(&wake_q);
> > - if (deboost)
> > - rt_mutex_adjust_prio(current);
> > +
> > + rt_mutex_postunlock(&wake_q, deboost);
>
> Hmm...
>
> >
> > return 0;
> > }
> > --- a/kernel/locking/rtmutex.c
> > +++ b/kernel/locking/rtmutex.c
> > @@ -1325,6 +1308,16 @@ static bool __sched rt_mutex_slowunlock(
> > */
> > mark_wakeup_next_waiter(wake_q, lock);
> >
> > + /*
> > + * We should deboost before waking the top waiter task such that
> > + * we don't run two tasks with the 'same' priority. This however
> > + * can lead to prio-inversion if we would get preempted after
> > + * the deboost but before waking our high-prio task, hence the
> > + * preempt_disable before unlock. Pairs with preempt_enable() in
> > + * rt_mutex_postunlock();
>
> There's a preempt_enable() in rt_mutex_postunlock()? Does
> wake_futex_pi() know that?
>

Not sure I see your point. rt_mutex_futex_unlock() calls
rt_mutex_slowunlock() which does the preempt_disable(), we then pass the
return of that into deboost, which we pass into rt_mutex_postunlock()
and everything should be balanced.