Re: [PATCH RFC] futex: avoid false sharing between hb->chain and the bucket lock

Next message: Sean Christopherson: "Re: [PATCH 2/3] KVM: MMU: unconditionally clear MMIO cache on root rebuild"
Previous message: Stuart Hayes: "[PATCH v2] scsi: target: Allow FUA if no write cache enabled"
In reply to: Thomas Gleixner: "Re: [PATCH RFC] futex: avoid false sharing between hb-&gt;chain and the bucket lock"
Next in thread: Peter Zijlstra: "Re: [PATCH RFC] futex: avoid false sharing between hb-&gt;chain and the bucket lock"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Peter Zijlstra

Date: Tue Jun 09 2026 - 16:24:43 EST

On Tue, Jun 09, 2026 at 10:16:31PM +0200, Thomas Gleixner wrote:
> Breno!
>
> On Tue, Jun 09 2026 at 08:28, Breno Leitao wrote:
> > On Tue, Jun 09, 2026 at 12:46:03PM +0200, Peter Zijlstra wrote:
> >> On Fri, Jun 05, 2026 at 09:53:12AM -0700, Breno Leitao wrote:
> >> perf bench futex hash 192479 195523 +1.5%
> >> perf bench futex hash -b 256 3453734 3987880 +15.5%
> >>
> >> And then I do see the improvement from your patch, but I really cannot
> >> make sense of your reasoning for it.
> >
> > So, let me rephrase it. The bucket cacheline takes hits from four access
> > patterns - the three I listed (waiters_pending readers, lock spinners,
> > lock-holder chain writes) plus the lockless `fph = hb->priv` load on the
> > futex_hash() fast path, which is what c2c surfaced. That priv load is the
> > dominant HITM source on baseline, not the chain writes I emphasized.
>
> Ok. That makes a lot more sense now.
>
> >> > Cost: one extra cacheline (56 B padding) per bucket. Would it be
> >> > acceptable?
> >>
> >> I'm really not sure, it *doubles* the futex memory cost.
> >
> > I think it's worth the trade. The global hash scales linearly with
> > num_possible_cpus(), so the extra bytes track the same curve as the machines
> > that actually need the fix
> >
> > in simpler words, a box big enough to feel this contention has plenty of RAM
> > headroom to absorb it.
>
> Well, it's not only about the global hash. The per process private hash
> is affected too.
>
> Can you try the completely untested below?

This moves the access to futex_hash_put() :-)