Re: [PATCH 2/3] sched/fair: Prevent active LB from preempting higher sched classes

From: Valentin Schneider
Date: Thu Aug 08 2019 - 06:46:42 EST


On 08/08/2019 10:24, Qais Yousef wrote:
>> @@ -8834,6 +8834,10 @@ static inline enum alb_status active_load_balance(struct lb_env *env)
>>
>> raw_spin_lock_irqsave(&busiest->lock, flags);
>>
>> + /* Make sure we're not about to stop a task from a higher sched class */
>> + if (busiest->curr->sched_class != &fair_sched_class)
>> + goto unlock;
>> +
>
> This looks correct to me, but I wonder if this check is something that belongs
> to the CONFIG_PREEMPT_RT land. This will give a preference to not disrupt the
> RT/DL tasks which is certainly the desired behavior there, but maybe in none
> PREEMPT_RT world balancing CFS tasks is more important? Hmmm
>

My take on this is that if the running task isn't CFS, there is no point in
running the cpu_stopper there (PREEMPT_RT or not). We can still try other
things though.

It could be that the running task had been > CFS all along, so if we
failed to move any load then we just couldn't pull any CFS task and should
bail out of load balance at this point.

If the running task was CFS but got preempted by a > CFS task in the
meantime (e.g. after detach_tasks() failed to pull anything), the best we
could do is run detach_one_task() (locally - no need for any cpu_stopper)
to try and nab the now not-running CFS task. Otherwise we'll have to wait
for another round of load_balance().
Not sure how much we care about this case - I think it's extremely unlikely
to repeatedly want to pull a currently-running CFS task and have it
repeatedly preempted by a > CFS task whenever we get to active_load_balance().

Let me try and see if I can come up with something sensible with that
detach_one_task() thingy.

> --
> Qais Yousef
>
>> /*
>> * Don't kick the active_load_balance_cpu_stop, if the curr task on
>> * busiest CPU can't be moved to dst_cpu:
>> --
>> 2.22.0
>>