Re: [PATCH v9 12/16] sched/core: uclamp: Extend CPU's cgroup controller

From: Patrick Bellasi
Date: Wed Jun 05 2019 - 11:10:38 EST


On 05-Jun 07:09, Tejun Heo wrote:
> Hello,

Hi,

> On Mon, Jun 03, 2019 at 01:29:29PM +0100, Patrick Bellasi wrote:
> > On 31-May 08:35, Tejun Heo wrote:
> > > Hello, Patrick.
> > >
> > > On Wed, May 15, 2019 at 10:44:55AM +0100, Patrick Bellasi wrote:
> >
> > [...]
> >
> > > For proportions (as opposed to weights), we use percentage rational
> > > numbers - e.g. 38.44 for 38.44%. I have parser and doc update commits
> > > pending. I'll put them on cgroup/for-5.3.
> >
> > That's a point worth discussing with Peter, we already changed one
> > time from percentages to 1024 scale.
>
> cgroup tries to uss uniform units for its interface files as much as
> possible even when that deviates from non-cgroup interface. We can
> bikeshed the pros and cons for that design choice for sure but I don't
> think it makes sense to deviate from that at this point unless there
> are really strong reasons to do so.

that makes sense to me, having a uniform interface has certainly a
value.

The only additional point I can think about as a (slightly) stronger
reason is that I guess we would like to have the same API for cgroups
as well as for the task specific and the system wide settings.

The task specific values comes in via the sched_setattr() syscall:

[PATCH v9 06/16] sched/core: uclamp: Extend sched_setattr() to support utilization clamping
https://lore.kernel.org/lkml/20190515094459.10317-7-patrick.bellasi@xxxxxxx/

where we need to encode each clamp into a __u32 value.

System wide settings are expose similarly to these:

grep '' /proc/sys/kernel/sched_*

where we have always integer numbers.

AFAIU your proposal will require to use a "scaled percentage" - e.g.
3844 for 38.44% which however it's still not quite the same as writing
the string "38.44".

Not sure that's a strong enough argument, is it?

> > Utilization clamps are expressed as percentages by definition,
> > they are just expressed in a convenient 1024 scale which should not be
> > alien to people using those knobs.
> >
> > If we wanna use a "more specific" name like uclamp.{min,max} then we
> > should probably also accept to use a "more specific" metric, don't we?
>
> Heh, this actually made me chuckle.

:)

> It's an interesting bargaining take but I don't think that same word
> being in two different places makes them tradable entities.

Sure, that was not my intention.

I was just trying to see if the need to be more specific could
be an argument for having also a more specific value.

> We can go into the weeds with the semantics but how about us using
> an alternative adjective "misleading" for the cpu.util.min/max names
> to short-circuit that?

Not quite sure to get what you mean here. Are you pointing out that
with clamps we don't strictly enforce a bandwidth but we just set a
bias?

Cheers,
Patrick

--
#include <best/regards.h>

Patrick Bellasi