Re: [PATCH] sched/core: fix pick_next_task 'max' tracking

From: Peter Zijlstra
Date: Tue Aug 24 2021 - 05:04:35 EST


On Mon, Aug 23, 2021 at 04:25:28PM -0400, Vineeth Pillai wrote:
> Hi Peter,
>
>
> > > Here, we should have instead updated 'max' when picking for SMT-1. Note
> > > that this code would eventually have righted itself, since the retry
> > > loop would re-pick p2, and update 'max' accordingly. However, this patch
> > > avoids the extra round-trip.
> >
> > Going with the observation Tao made; how about we rewrite the whole lot
> > to not be mind-bending complicated :-)
> >
> > How's this? It seems to build and pass the core-sched selftest thingy
> > (so it must be perfect, right? :-)
> >
> Nice, the code is much simpler now :-). A minor suggestion down..
>
> > - for_each_cpu(i, smt_mask) {
> > - struct rq *rq_i = cpu_rq(i);
> > -
> > + /*
> > + * For each thread: do the regular task pick and find the max prio task
> > + * amongst them.
> > + *
> > + * Tie-break prio towards the current CPU
> > + */
> > + for_each_cpu_wrap(i, smt_mask, cpu) {
> > + rq_i = cpu_rq(i);
> > rq_i->core_pick = NULL;
> >
> > if (i != cpu)
> > update_rq_clock(rq_i);
> > +
> > + for_each_class(class) {
> > + p = rq_i->core_temp = class->pick_task(rq_i);
> I think we can use core_pick to store the pick here and core_temp
> might not be required. What do you feel?

Indeed we can; makes the code a little less obvious but saves a few
bytes.

Let me go do that and also attempt a Changelog to go with it ;-)