Re: [PATCH 0/5 - V2] locks: avoid thundering-herd wake-ups

From: J. Bruce Fields
Date: Sat Aug 11 2018 - 08:35:31 EST

On Sat, Aug 11, 2018 at 07:56:25AM -0400, Jeff Layton wrote:
> FWIW, I did a bit of testing with lockperf tests that I had written on
> an earlier rework of this code:
> The posix01 and flock01 tests in there show about a 10x speedup with
> this set in place.
> I think something closer to Neil's design will end up being what we want
> here. Consider the relatively common case where you have a whole-file
> POSIX write lock held with a bunch of different waiters blocked on it
> (all whole file write locks with different owners):
> With Neil's patches, you will just wake up a single waiter when the
> blocked lock is released, as they would all be in a long chain of
> waiters.

Right, but you still need to walk the whole tree to make sure that it's
the only one you need to wake. The tree structure means that you know
all the other locks have non-overlapping ranges, but it doesn't tell you
the lock owners.

Maybe there's some reasonable way to rule out the shared-lockowner case
more quickly too. I haven't thought about that much.

> If you keep all the locks in a single list, you'll either have to:
> a) wake up all the waiters on the list when the lock comes free: no lock
> is held at that point so none of them will conflict.
> ...or...
> b) keep track of what waiters have already been awoken, and compare any
> further candidate for waking against the current set of held locks and
> any lock requests by waiters that you just woke.

Instead of keeping track of *every* waiter that you've woken, you could
keep track of some subset. Worst case that just means waking more
processes than you need to, which is wasteful but correct.

In the common case that you give, that subset could just be "the first
waiter you wake". You'd get the same result.

The every-waiter-a-whole-file-write-lock case is pretty easy. To
benefit from the tree you need a case where some of the waiters overlap
and some don't.

Might be worth it, sure.


> b seems more expensive as you have to walk over a larger set of locks
> on every change.
> --
> Jeff Layton <jlayton@xxxxxxxxxx>