Re: [RFC][PATCH 00/10] Mount, FS, Block and Keyrings notifications [ver #3]

From: David Howells
Date: Thu Jun 06 2019 - 09:21:06 EST


Stephen Smalley <sds@xxxxxxxxxxxxx> wrote:

This might be easier to discuss if you can reply to:

https://lore.kernel.org/lkml/5393.1559768763@xxxxxxxxxxxxxxxxxxxxxx/

which is on the ver #2 posting of this patchset.

> > LSM support is included, but controversial:
> >
> > (1) The creds of the process that did the fput() that reduced the refcount
> > to zero are cached in the file struct.
> >
> > (2) __fput() overrides the current creds with the creds from (1) whilst
> > doing the cleanup, thereby making sure that the creds seen by the
> > destruction notification generated by mntput() appears to come from
> > the last fputter.
> >
> > (3) security_post_notification() is called for each queue that we might
> > want to post a notification into, thereby allowing the LSM to prevent
> > covert communications.
> >
> > (?) Do I need to add security_set_watch(), say, to rule on whether a watch
> > may be set in the first place? I might need to add a variant per
> > watch-type.
> >
> > (?) Do I really need to keep track of the process creds in which an
> > implicit object destruction happened? For example, imagine you create
> > an fd with fsopen()/fsmount(). It is marked to dissolve the mount it
> > refers to on close unless move_mount() clears that flag. Now, imagine
> > someone looking at that fd through procfs at the same time as you exit
> > due to an error. The LSM sees the destruction notification come from
> > the looker if they happen to do their fput() after yours.
>
>
> I'm not in favor of this approach.

Which bit? The last point? Keeping track of the process creds after an
implicit object destruction.

> Can we check permission to the object being watched when a watch is set
> (read-like access),

Yes, and I need to do that. I think it's likely to require an extra hook for
each entry point added because the objects are different:

int security_watch_key(struct watch *watch, struct key *key);
int security_watch_sb(struct watch *watch, struct path *path);
int security_watch_mount(struct watch *watch, struct path *path);
int security_watch_devices(struct watch *watch);

> make sure every access that can trigger a notification requires a
> (write-like) permission to the accessed object,

"write-like permssion" for whom? The triggerer or the watcher?

There are various 'classes' of events:

(1) System events (eg. hardware I/O errors, automount points expiring).

(2) Direct events (eg. automounts, manual mounts, EDQUOT, key linkage).

(3) Indirect events (eg. exit/close doing the last fput and causing an
unmount).

Class (1) are uncaused by a process, so I use init_cred for them. One could
argue that the automount point expiry should perhaps take place under the
creds of whoever triggered it in the first place, but we need to be careful
about long-term cred pinning.

Class (2) the causing process must've had permission to cause them - otherwise
we wouldn't have got the event.

Class (3) is interesting since it's currently entirely cleanup events and the
process may have the right to do them (close, dup2, exit, but also execve)
whether the LSM thinks it should be able to cause the object to be destroyed
or not.

It gets more complicated than that, though: multiple processes with different
security attributes can all have fds pointing to a common file object - and
the last one to close carries the can as far as the LSM is concerned.

And yet more complicated when you throw in unix sockets with partially passed
fds still in their queues. That's what patch 01 is designed to try and cope
with.

> and make sure there is some sane way to control the relationship between the
> accessed object and the watched object (write-like)?

This is the trick. Keys and superblocks have object labels of their own and
don't - for now - propagate their watches. With these, the watch is on the
object you initially assign it to and it goes no further than that.

mount_notify() is the interesting case since we want to be able to detect
mount topology change events from within the vfs subtree rooted at the watched
directory without having to manually put a watch on every directory in that
subtree - or even just every mount object.

Or, maybe, that's what I'll have to do: make it mount_notify() can only apply
to the subtree within its superblock, and the caller must call mount_notify()
for every mount object it wants to monitor. That would at least ensure that
the caller can, at that point, reach all those mount points.

> For cases where we have no object per se or at least no security
> structure/label associated with it, we may have to fall back to a
> coarse-grained "Can the watcher get this kind of notification in general?".

Agreed - and we should probably have that anyway.

David