Re: [PATCH v5 10/14] sched/cpufreq: Refactor the utilization aggregation method

From: Quentin Perret
Date: Tue Jul 31 2018 - 04:00:01 EST


On Monday 30 Jul 2018 at 12:35:27 (-0700), skannan@xxxxxxxxxxxxxx wrote:
[...]
> If it's going to be a different aggregation from what's done for frequency
> guidance, I don't see the point of having this inside schedutil. Why not
> keep it inside the scheduler files?

This code basically results from a discussion we had with Peter on v4.
Keeping everything centralized can make sense from a maintenance
perspective, I think. That makes it easy to see the impact of any change
to utilization signals for both EAS and schedutil.


> Also, it seems weird to use a governor's
> code when it might not actually be in use. What if someone is using
> ondemand, conservative, performance, etc?

Yeah I thought about that too ... I would say that even if you don't
use schedutil, it is probably a fair assumption from the scheduler's
standpoint to assume that somewhat OPPs follow utilization (in a very
loose way). So yes, if you use ondemand with EAS you won't have a
perfectly consistent match between the frequency requests and what EAS
predicts, and that might result in sub-optimal decisions in some cases,
but I'm not sure if we can do anything better at this stage.

Also, if you do use schedutil, EAS will accurately predict what will be
the frequency _request_, but that gives you no guarantee whatsoever that
you'll actually get it for real (because you're throttled, or because of
thermal capping, or simply because the HW refuses it for some reason ...).

There will be inconsistencies between EAS' predictions and the actual
frequencies, and we have to live with that. The best we can do is make
sure we're at least internally consistent from the scheduler's
standpoint, and that's why I think it can make sense to factorize as
many things as possible with schedutil where applicable.

> > + if (type == frequency_util) {
> > + /*
> > + * Bandwidth required by DEADLINE must always be granted
> > + * while, for FAIR and RT, we use blocked utilization of
> > + * IDLE CPUs as a mechanism to gracefully reduce the
> > + * frequency when no tasks show up for longer periods of
> > + * time.
> > + *
> > + * Ideally we would like to set bw_dl as min/guaranteed
> > + * freq and util + bw_dl as requested freq. However,
> > + * cpufreq is not yet ready for such an interface. So,
> > + * we only do the latter for now.
> > + */
> > + util += cpu_bw_dl(rq);
> > + }
>
> Instead of all this indentation, can't you just return early without doing
> the code inside the if?

But then I'll need to duplicate the 'min' below, so not sure if it's
worth it ?

> > +enum schedutil_type {
> > + frequency_util,
> > + energy_util,
> > +};
>
> Please don't use lower case for enums. It's extremely confusing.

Ok, I'll change that in v6.

Thanks !
Quentin