Re: [RFC PATCH 0/3] sched/deadline: cpuset: Rework DEADLINE bandwidth restoration

From: Juri Lelli
Date: Wed Mar 15 2023 - 13:11:55 EST


On 15/03/23 14:55, Qais Yousef wrote:
> On 03/15/23 12:18, Juri Lelli wrote:
> > Qais reported [1] that iterating over all tasks when rebuilding root
> > domains for finding out which ones are DEADLINE and need their bandwidth
> > correctly restored on such root domains can be a costly operation (10+
> > ms delays on suspend-resume). He proposed we skip rebuilding root
> > domains for certain operations, but that approach seemed arch specific
> > and possibly prone to errors, as paths that ultimately trigger a rebuild
> > might be quite convoluted (thanks Qais for spending time on this!).
>
> Thanks a lot for this! And sorry I couldn't provide something better.

Ah, no worries. Actually still have to convice myself what I have it's
actually better. :)

> >
> > To fix the problem I instead would propose we
> >
> > 1 - Bring back cpuset_mutex (so that we have write access to cpusets
> > from scheduler operations - and we also fix some problems
> > associated to percpu_cpuset_rwsem)
> > 2 - Keep track of the number of DEADLINE tasks belonging to each cpuset
> > 3 - Use this information to only perform the costly iteration if
> > DEADLINE tasks are actually present in the cpuset for which a
> > corresponding root domain is being rebuilt
>
> nit:
>
> Would you consider adding another patch to rename the functions?
> rebuild_root_domains() and update_tasks_root_domain() are deadline accounting
> specific functions and don't actually rebuild root domains.

Yep, can do.

Thanks,
Juri