Re: [GIT PULL] please pull file-locking related changes for v3.20

From: Jeff Layton
Date: Tue Feb 17 2015 - 15:12:43 EST


On Tue, 17 Feb 2015 11:41:40 -0800
Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx> wrote:

> On Tue, Feb 17, 2015 at 11:27 AM, Jeff Layton <jlayton@xxxxxxxxxxxxxxx> wrote:
> >
> > What about this instead then?
>
> No. Really.
>
> > - leave the "drop the spinlock" thing in place in flock_lock_file for
> > v3.20
>
> No. The whole concept of "drop the lock in the middle" is *BROKEN*.
> It's seriously crap. It's not just a bug, it's a really fundamentally
> wrong thing to do.
>
> > - change locks_remove_flock to just walk the list and delete any locks
> > associated with the filp being closed
>
> No. That's still wrong. You can have two people holding a write-lock.
> Seriously. That's *shit*.
>
> The "drop the spinlock in the middle" must go. There's not even any
> reason for it. Just get rid of it. There can be no deadlock if you get
> rid of it, because
>
> - we hold the flc_lock over the whole event, so we can never see any
> half-way state
>
> - if we actually decide to sleep (due to conflicting locks) and
> return FILE_LOCK_DEFERRED, we will drop the lock before actually
> sleeping, so nobody else will be deadlocking on this file lock. So any
> *other* person who tries to do an upgrade will not sleep, because the
> pending upgrade will have moved to the blocking list (that whole
> "locks_insert_block" part.
>
> Ergo, either we'll upgrade the lock (atomically, within flc_lock), or
> we will drop the lock (possibly moving it to the blocking list). I
> don't see a deadlock.
>
> I think your (and mine - but mine had the more fundamental problem of
> never setting "old_fl" correctly at all) patch had a deadlock because
> you didn't actually remove the old lock when you returned
> FILE_LOCK_DEFERRED.
>
> But I think the correct minimal patch is actually to just remove the
> "if (found)" statement.
>
> Linus

I agree that there's no deadlock. I also agree that allowing two
LOCK_EX's (or a LOCK_SH + LOCK_EX) on the file is broken. I'm just
leery on making a user-visible change at this point. I'd prefer to let
something like that soak in linux-next for a while.

Another possibility is to keep dropping the spinlock, but check to see
if someone set a new lock on the same filp in the loop after that. If
they have, then we could just remove that lock before adding the new
one.

I don't think that would violate anything since there are no atomicity
guarantees here. If you're setting locks on the same filp from multiple
tasks then you're simply asking for trouble.

I don't expect that most apps do that though, but rather work on their
own set of open file descriptions. Those might get bitten however if we
stop dropping the spinlock there since we'll be changing how flock's
fairness works.

See the attached (untested) patch for what I'm thinking. If you still
think that removing the "if (found)" clause is the right thing to do,
I'll go with that, but I do worry that we might break some (fragile)
app that might rely on the way that flock works today.

--
Jeff Layton <jlayton@xxxxxxxxxxxxxxx>