Re: [PATCH v4 0/2] cgroup: allow management of subtrees by new cgroup namespaces

From: Tejun Heo
Date: Tue May 31 2016 - 14:18:14 EST

Next message: Konrad Rzeszutek Wilk: "Re: [RFC v2] dma-mapping: Use unsigned long for dma_attrs"
Previous message: Waiman Long: "Re: [PATCH] pv-qspinlock: Try to re-hash the lock after spurious_wakeup"
In reply to: Aleksa Sarai: "Re: [PATCH v4 0/2] cgroup: allow management of subtrees by new cgroup namespaces"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hello, Aleksa.

On Sat, May 28, 2016 at 03:12:16PM +1000, Aleksa Sarai wrote:
> I see two options here, which would allow us to have subtree
> delegation without losing hierarchical structure:
>
> 1. Don't enable subtree delegation on v1 hierarchies. This would be
> the simplest solution, and would cut out most people from using this
> feature today -- but it would mean less work around trying to figure
> out which hierarchies are safe to delegate (we make it explicit that
> when you enable a cgroup on v2 that it must be safe to delegate by an
> unprivileged user). We also get the benefit of having the more strict
> cgroup.procs write rules.
> 2. Don't do subtree delegation on hierarchies that aren't
> hierarchical. This would have to be done in collaboration with the
> controllers (since cgroup core doesn't know which is hierarchical),
> and would allow all users of cgroups today to get subtree delegation.

Regardless of the above two, the proposed solution is too hacky and
doesn't fit with the overall design. At the moment, I'm not sure the
problem is worth solving.

> >> > Why does an unpriv NS need to have cgroup delegated to it without
> >> > cooperation from cgroup manager?
> >>
> >> There's actually many answers to this. The one I'm insterested in is
> >> the ability for applications to make use of container features without
> >> having to ask permission from some orchestration engine. The problem
> >
> > What's "container features"? Do you mean resource control by that?
>
> Yes. Also the device cgroup. And ignoring the container usecase, it
> would be useful to regular programs if they could use cgroup resource
> accounting as part of their regular operation. Regular processes can
> use rlimits -- why can't they use cgroups without needing cooperation
> from an admin process (which makes for security and administration
> issues).

The current VFS based interface simply isn't conducive to such usage.
e.g. What if someone else relocates the process while it was trying to
access its interface files? What if the permission or ownership
changes beneath the process.

> If I understand correctly, the security issues James is referring to
> is that the cgroup manager could have a bug in it (and because the
> cgroup interface is the filesystem, it would probably be some kind of
> write-to-any-path bug). This is an intrinsic part of the model of "you
> need to have cooperation with an admin process in order to use
> resource limiting for your own processes".

Sure, that's the inherent characteristic of the interface that cgroup
ended up with. I'm not a big fan of it either but 1. it still is a
workable model 2. adding mismatching hacks on top is highly likely to
lead to interface disaster in the long term.

Thanks.

--
tejun

Next message: Konrad Rzeszutek Wilk: "Re: [RFC v2] dma-mapping: Use unsigned long for dma_attrs"
Previous message: Waiman Long: "Re: [PATCH] pv-qspinlock: Try to re-hash the lock after spurious_wakeup"
In reply to: Aleksa Sarai: "Re: [PATCH v4 0/2] cgroup: allow management of subtrees by new cgroup namespaces"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]