Re: [RESEND v12 0/6] cgroup-aware OOM killer

From: Michal Hocko
Date: Mon Oct 23 2017 - 07:49:58 EST


On Sun 22-10-17 17:24:51, David Rientjes wrote:
> On Thu, 19 Oct 2017, Johannes Weiner wrote:
>
> > David would have really liked for this patchset to include knobs to
> > influence how the algorithm picks cgroup victims. The rest of us
> > agreed that this is beyond the scope of these patches, that the
> > patches don't need it to be useful, and that there is nothing
> > preventing anyone from adding configurability later on. David
> > subsequently nacked the series as he considers it incomplete. Neither
> > Michal nor I see technical merit in David's nack.
> >
>
> The nack is for three reasons:
>
> (1) unfair comparison of root mem cgroup usage to bias against that mem
> cgroup from oom kill in system oom conditions,

Most users who are going to use this feature right now will have
most of the userspace in their containers rather than in the root
memcg. The root memcg will always be special and as such there will
never be a universal best way to handle it. We should to satisfy most of
usecases. I would consider this something that is an open for a further
discussion but nothing that should stand in the way.

> (2) the ability of users to completely evade the oom killer by attaching
> all processes to child cgroups either purposefully or unpurposefully,
> and

This doesn't differ from the current state where a task can purposefully
or unpurposefully hide itself from the global memory killer by spawning
new processes.

> (3) the inability of userspace to effectively control oom victim
> selection.

this is not requested by the current usecase and it has been pointed out
that this will be possible to implement on top of the foundation of this
patchset.

So again, nothing to nack the work as is.
--
Michal Hocko
SUSE Labs