Re: Upcoming: Notifications, FS notifications and fsinfo()

From: Miklos Szeredi
Date: Fri Apr 03 2020 - 07:49:06 EST


On Fri, Apr 3, 2020 at 1:08 PM Lennart Poettering <mzxreary@xxxxxxxxxxx> wrote:
>
> On Do, 02.04.20 19:20, Miklos Szeredi (miklos@xxxxxxxxxx) wrote:
>
> > On Thu, Apr 2, 2020 at 5:50 PM Lennart Poettering <mzxreary@xxxxxxxxxxx> wrote:
> > >
> > > On Do, 02.04.20 17:35, Miklos Szeredi (miklos@xxxxxxxxxx) wrote:
> > >
> > > > > systemd cares about all mount points in PID1's mount namespace.
> > > > >
> > > > > The fact that mount tables can grow large is why we want something
> > > > > better than constantly reparsing the whole /proc/self/mountinfo. But
> > > > > filtering subsets of that is something we don't really care about.
> > > >
> > > > I can accept that, but you haven't given a reason why that's so.
> > > >
> > > > What does it do with the fact that an automount point was crossed, for
> > > > example? How does that affect the operation of systemd?
> > >
> > > We don't care how a mount point came to be. If it's autofs or
> > > something else, we don't care. We don't access these mount points
> > > ourselves ever, we just watch their existance.
> > >
> > > I mean, it's not just about startup it's also about shutdown. At
> > > shutdown we need to unmount everything from the leaves towards the
> > > root so that all file systems are in a clean state.
> >
> > Unfortunately that's not guaranteed by umounting all filesystems from
> > the init namespace. A filesystem is shut down when all references to
> > it are gone. Perhaps you instead want to lazy unmount root (yeah,
> > that may not actually be allowed, but anyway, lazy unmounting the top
> > level ones should do) and watch for super block shutdown events
> > instead.
> >
> > Does that make any sense?
>
> When all mounts in the init mount namespace are unmounted and all
> remaining processes killed we switch root back to the initrd, so that
> even the root fs can be unmounted, and then we disassemble any backing
> complex storage if there is, i.e. lvm, luks, raid, â

I think it could be done the other way round, much simpler:

- switch back to initrd
- umount root, keeping the tree intact (UMOUNT_DETACHED)
- kill all remaining processes, wait for all to exit

I think that should guarantee that all super blocks have been shut down. Al?

The advantage would be that there's no need to walk the mount tree
unmounting individual leafs, since it's all done automagically.

Thanks,
Miklos