Re: Do we need to do anything about "dead µops?"

From: Andrew Cooper
Date: Sat May 01 2021 - 13:54:50 EST

Next message: Christophe Leroy: "Re: [PATCH] Raise the minimum GCC version to 5.2"
Previous message: kernel test robot: "[rcu:rcu/next] BUILD SUCCESS ad1f357157aa20466b590638f8a6d252d3cf33bc"
In reply to: Andy Lutomirski: "Do we need to do anything about "dead µops?""
Next in thread: Josh Poimboeuf: "Re: Do we need to do anything about "dead µops?""
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 01/05/2021 17:26, Andy Lutomirski wrote:
> Hi all-
>
> The "I See Dead µops" paper that is all over the Internet right now is
> interesting, and I think we should discuss the extent to which we
> should do anything about it. I think there are two separate issues:
>
> First, should we (try to) flush the µop cache across privilege
> boundaries? I suspect we could find ways to do this, but I don't
> really see the point. A sufficiently capable attacker (i.e. one who
> can execute their own code in the dangerous speculative window or one
> who can find a capable enough string of gadgets) can put secrets into
> the TLB, various cache levels, etc. The µop cache is a nice piece of
> analysis, but I don't think it's qualitatively different from anything
> else that we don't flush. Am I wrong?
>
> Second, the authors claim that their attack works across LFENCE. I
> think that this is what's happening:
>
> load secret into %rax
> lfence
> call/jmp *%rax
>
> As I understand it, on CPUs from all major vendors, the call/jmp will
> gladly fetch before lfence retires, but the address from which it
> fetches will come from the branch predictors, not from the
> speculatively computed value in %rax.

The vendor-provided literature on pipelines (primarily, the optimisation
guides) has the register file down by the execute units, and not near
the frontend. Letting the frontend have access to the register value is
distinctly non-trivial.

> So this is nothing new. If the
> kernel leaks a secret into the branch predictors, we have already
> mostly lost, although we have a degree of protection from the various
> flushes we do. In other words, if we do:
>
> load secret into %rax
> <-- non-speculative control flow actually gets here
> lfence
> call/jmp *%rax
>
> then we will train our secret into the predictors but also leak it
> into icache, TLB, etc, and we already lose. We shouldn't be doing
> this in a way that matters. But, if we do:
>
> mispredicted branch
> load secret into %rax
> <-- this won't retire because the branch was mispredicted
> lfence
> <-- here we're fetching but not dispatching
> call/jmp *%rax
>
> then the leak does not actually occur unless we already did the
> problematic scenario above.
>
> So my tentative analysis is that no action on Linux's part is required.
>
> What do you all think?

Everything here seems to boil down to managing to encode the secret in
the branch predictor state, then managing to recover it via the uop
cache sidechannel.

It is well known and generally understood that once your secret is in
the branch predictor, YHAL. Code with that property was broken before
this paper, is still broken after this paper, and needs fixing irrespective.

Viewed in these terms, I don't see what security improvement is gained
from trying to flush the uop cache.

I tentatively agree with your conclusion, that no specific action
concerning the uop cache is required.

~Andrew

Next message: Christophe Leroy: "Re: [PATCH] Raise the minimum GCC version to 5.2"
Previous message: kernel test robot: "[rcu:rcu/next] BUILD SUCCESS ad1f357157aa20466b590638f8a6d252d3cf33bc"
In reply to: Andy Lutomirski: "Do we need to do anything about "dead µops?""
Next in thread: Josh Poimboeuf: "Re: Do we need to do anything about "dead µops?""
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]