Re: [patch 19/21] introduce L1_CACHE_SHIFT_MAX

From: Andrew Morton (akpm@zip.com.au)
Date: Sun Aug 11 2002 - 04:34:05 EST

Next message: Daniel Phillips: "Re: [patch 21/21] writeback correctness and peformance fixes"
Previous message: Andi Kleen: "Re: [patch 13/21] deferred and batched addition of faulted-in pages to the LRU"
In reply to: Daniel Phillips: "Re: [patch 19/21] introduce L1_CACHE_SHIFT_MAX"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Daniel Phillips wrote:
>
> On Sunday 11 August 2002 09:40, Andrew Morton wrote:
> > zone->lock and zone->lru_lock are two of the hottest locks in the
> > kernel. Their usage patterns are quite independent. And they have
> > just been put into the same structure. It is essential that they not
> > fall into the same cacheline.
> >
> > That could be fixed by padding with L1_CACHE_BYTES. But the problem
> > with this is that a kernel which was configured for (say) a PIII will
> > perform poorly on SMP PIV. This will cause problems for kernel
> > vendors. For example, RH currently ship PII and Athlon binaries. To
> > get best SMP performance they will end up needing to ship a lot of
> > differently configured kernels.
> >
> > To solve this we need to know, at compile time, the maximum L1 size
> > which this kernel will ever run on.
> >
> > This patch adds L1_CACHE_SHIFT_MAX to every architecture's cache.h.
> >
> > Of course it'll break when newer chips come out with increased
> > cacheline sizes. Better suggestions are welcome.
>
> I think you're being too paranoid.

Staring at too many horrific profile outputs does that to one.

These are *the* two big locks.

> You pushed the performance degradation
> from the PIV to the PIII (because it will tend to hit more cachelines than it
> should)

The buddy info is all in one cacheline and the LRU info is in another.
So there's no loss to PIII here. But those two things are soooo hot
that paranoia is warranted.

> and you won't be able to build a kernel that is optimal for the PIII
> any more. I'd say that is PIII kernel is *supposed* to suck to some degree
> when run on a PIV, otherwise why bother having the PIV option?
>
> I expect the performance difference you're talking about is marginal anyway.
> Maybe you've measured it?

No, I haven't. NUMA boxes don't need it if the node-local allocation is
working right. But if they go cross-node much, it'll help. On high-performance
UMA SMP, allowing those two particular locks to fall into the same cacheline
is a big goofup.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Daniel Phillips: "Re: [patch 21/21] writeback correctness and peformance fixes"
Previous message: Andi Kleen: "Re: [patch 13/21] deferred and batched addition of faulted-in pages to the LRU"
In reply to: Daniel Phillips: "Re: [patch 19/21] introduce L1_CACHE_SHIFT_MAX"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Thu Aug 15 2002 - 22:00:24 EST