Re: [PATCH] powerpc/lib/xor_vmx: Relax frame size for clang
From: Segher Boessenkool
Date: Thu Sep 08 2022 - 18:52:11 EST
Hi!
On Thu, Sep 08, 2022 at 05:07:24PM +0200, Arnd Bergmann wrote:
> - if the XOR code has its frame size explode like this, it's
> probably an indication of the compiler doing something wrong,
> not the kernel code.
On the contrary, it is most likely an indication that the kernel code
wants something unreasonable. Like, having 20 variables live at the
same time, but still wanting nicely scheduled machine code generated.
But I suspect GCC unrolled the loops here, even? Best way to prevent
that here is to put an option in the Makefile, for these files. We
don't want any of this unrolled after all? Or, alternatively, remove
all the manual unrolling from this code, let GCC do its thing, without
painting it in a corner.
> The result is likely that the "optimized"
> XOR implementation is slower than the default version as a
> result, and the kernel will pick the other one at boot time.
Yes. So it's self-healing even, of a sort :-)
Segher