Re: [PATCH 1/3] capabilities: user namespace capabilities

From: Jonathan Calmels
Date: Fri May 17 2024 - 13:58:38 EST


> > On Fri, May 17, 2024 at 06:32:46AM GMT, Eric W. Biederman wrote:
> As I read your introduction you were justifying the introduction
> of a new security mechanism with the observation that distributions
> were carrying distribution specific patches.
>
> To the best of my knowledge distribution specific patches and
> distributions disabling user namespaces have been gone for quite a
> while. So if that has changed recently I would like to know.

On the top of my head:

- RHEL based:
namespace.unpriv_enable
user_namespace.enable

- Arch/Debian based:
kernel.unprivileged_userns_clone

- Ubuntu based:
kernel.apparmor_restrict_unprivileged_userns

I'm not sure which exact version those apply to, but it's definitely
still out there.

The observation is that while you can disable namespaces today, in
practice it breaks userspace in various ways. Hence, being able to
control capabilities is a better way to approach it.

For example, today's big hammer to prevent CAP_NET_ADMIN in userns:

# sysctl -qw user.max_net_namespaces=0

$ unshare -U -r -n ip tuntap add mode tap tap0 && echo OK
unshare: unshare failed: No space left on device

With patch, this becomes manageable:

# capsh --drop=cap_net_admin --secbits=$((1 << 8)) --user=$USER -- \
-c 'unshare -U -r -n ip tuntap add mode tap tap0 && echo OK'
ioctl(TUNSETIFF): Operation not permitted