Re: [PATCH] x86: Use clflush() instead of wbinvd() whenever possiblewhen changing mapping

From: Thomas Hellstrom
Date: Fri Jul 24 2009 - 07:14:17 EST

Next message: Frederik Himpe: "2.6.31 - snd_intel8x0: fails to initialize NVidia CK804 sound chip"
Previous message: Peter Zijlstra: "Re: [PATCH v3 3/4] ftrace: add tracepoint for hrtimer"
In reply to: Andi Kleen: "Re: [PATCH] x86: Use clflush() instead of wbinvd() wheneverpossible when changing mapping"
Next in thread: Andi Kleen: "Re: [PATCH] x86: Use clflush() instead of wbinvd() wheneverpossible when changing mapping"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Andi Kleen wrote:

On Fri, Jul 24, 2009 at 12:21:50PM +0200, Thomas Hellstrom wrote:

Andi Kleen wrote:

Thomas Hellstrom <thellstrom@xxxxxxxxxx> writes:

The current code uses wbinvd() when the area to flush is > 4MB. Although this
may be faster than using clflush() the effect of wbinvd() on irq latencies
may be catastrophical on systems with large caches. Therefore use clflush()

may be? You seem to miss some hard data here.

Admittedly.

So was it motivated by a real problem?

No. It was motivated by the assumption that wbinvd() is just bad:
Qoute:

WBINVD is a very nasty operation. I was talking to some CPU people and they really recommended to get rid of it as far as possible. Stopping the CPU for msecs is just wrong and there are apparently even some theoretical live lock situations. - It is not interruptible in earlier VT versions and messes up real time in the hypervisor. Some people were doing KVM on rt kernels and had latency spikes from that.

/Qoute
(I believe you wrote that ?)

However, the concept of flushing and invalidating the caches completely on systems with many
processors and huge caches when we intend to only flush only small piece of the cache also sounds like a big overkill.

The other CPUs will not block (just flush their caches in the background or
in parallel), so the latency shouldn't scale with the number of sockets.
Also number of cores also shouldn't impact it because these tend
to have shared cache hierarchies.

That's just a theory, but not necessarily a worse one than yours :-)

Furthermore, since the wbinvd() has been introduced as an optimization of the general clflush() case, did somebody ever check the effects on systems with many processors and huge caches?

Typically systems with large caches flush faster too.

OK. We should really test this at some point. I currently don't have the hardware to do so.

-Andi

/Thomas

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Frederik Himpe: "2.6.31 - snd_intel8x0: fails to initialize NVidia CK804 sound chip"
Previous message: Peter Zijlstra: "Re: [PATCH v3 3/4] ftrace: add tracepoint for hrtimer"
In reply to: Andi Kleen: "Re: [PATCH] x86: Use clflush() instead of wbinvd() wheneverpossible when changing mapping"
Next in thread: Andi Kleen: "Re: [PATCH] x86: Use clflush() instead of wbinvd() wheneverpossible when changing mapping"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]