Adding the containers list to the discussion so more interested people
have a chance of seeing this.
Benjamin Coddington <bcodding@xxxxxxxxxx> writes:
A persistent unsolved problem exists: how can the kernel find and/or create
the appropriate "container" within which to execute a userspace program to
construct keys or satisfy users of call_usermodehelper()?
I believe the latest serious attempt to solve this problem was David's "Make
containers kernel objects":
Over in NFS' space, we've most recently pondered this issue while looking at
ways to pass a kernel socket to userspace in order to handle TLS events:
The problem is that containers are not kernel objects, rather a collection
of namespaces, cgroups, etc. Attempts at making the kernel aware of
containers have been mired in discussion and problems. It has been
suggested that the best representation of a "container" from the kernel's
perspective is a process.
Keyagents are processes represented by a key. If a keyagent's key is linked
to a session_keyring, it can be sent a realtime signal when a calling
process requests a matching key_type. That signal will dispatch the process
to construct the desired key within the keyagent process context. Keyagents
are similar to ssh-agents. To use a keyagent, one must execute a keyagent
process in the desired context, and then link the keyagent's key onto other
This method of linking keyagent keys to session_keyrings can be used to
construct the various mappings of callers to keyagents that containers may
need. A single keyagent process can answer request-key upcalls across
container boundaries, or upcalls can be restricted to specific containers.
I'm aware that building on realtime signals may not be a popular choice, but
using realtime signals makes this work simple and ensures delivery. Realtime
signals are able to convey everything needed to construct keys in userspace:
the under-construction key's serial number.
This work is not complete; it has security implications, it needs
documentation, it has not been reviewed by anyone. Thanks for reading this
RFC. I wish to collect criticism and validate this approach.
At a high level I do agree that we need to send a message to a userspace
process and that message should contain enough information to start the
user mode helper.
Then a daemon or possibly the container init can receive the message
and dispatch the user mode helper.
Fundamentally that design solves all of the container issues, and I
think solves a few of the user mode helper issues as well.
The challenge with this design is that it requires someone standing up a
daemon to receive the messages and call a user mode helper to retain
compatibility with current systems.
I would prefer to see a file descriptor rather than a signal used to
deliver the message. Signals suck for many many reasons and a file
descriptor based notification potentially can be much simpler.
One of those many reasons is that by not following the common pattern
for filling in kernel_siginfo you have left uninitialized padding in
your structure that will be copied to userspace thus creating a kernel
information leak. Similarly your code doesn't fill in about half the
fields that are present in the siginfo union for the _rt case.
I think a file descriptor based design could additionally address the
back and forth your design needs with keys to figure out what event has
happened and what user mode helper to invoke.
Ideally I would also like to see a design less tied to keys. So that we
could use this for the other user mode helper cases as well. That said
solving request_key appears to be the truly important part, there aren't
many other user mode helpers. Still it would be nice if in theory the
design could be used to dispatch the coredump helper as well.