Re: [PATCH RFC] x86: avoid atomic operation intest_and_set_bit_lock if possible

From: Peter Zijlstra
Date: Fri Mar 25 2011 - 06:19:20 EST


On Fri, 2011-03-25 at 10:22 +0100, Ingo Molnar wrote:
> * Andi Kleen <andi@xxxxxxxxxxxxxx> wrote:
>
> > > never EVER seen any good explanation of why that particular sh*t
> > > argument would b true. It seems to be purely about politics, where
> > > some idiotic vendor (namely HP) has convinced Intel that they really
> > > need it. To the point where some engineers seem to have bought into
> > > the whole thing and actually believe that fairy tale ("firmware can do
> > > better" - hah! They must be feeding people some bad drugs at the
> > > cafeteria)
> >
> > For the record I don't think it's a good idea for the BIOS to do
> > this (and I'm not aware of any engineer who does),
>
> There's really just two sane options:
>
> - complain about the BIOS corrupting CPU state and refusing to use the PMU
> - complain about the BIOS corrupting CPU state and using the PMU against the BIOS
>
> We went for the first one but i'll be more than glad to implement Linus's much
> more aggressive second option.
>
> Btw., for the record, the thing you have been advocating in the past was a
> third option: for the kernel to step aside quietly and to let the BIOS corrupt
> a counter or two. You even sent us some sort of BIOS specification about how to
> implement that. That's pretty much the worst solution imaginable.

Also seriously complicated by the kexec case where the previous kernel
didn't clean up PMU state. There is simply no sane way to detect if its
actually used and by whoem.

The whole PMU 'sharing' concept championed by Andi is utter crap.

As for simply using it despite the BIOS corrupting it, that might not
always work, the BIOS might simply over-write your state because it
one-sidedly declares to own the MSRs (observed behaviour).

Its all a big clusterfuck and really the best way (IMO) is what we have
now to put pressure on and force the BIOS vendors to play nice.

I assume both HP and DELL will be seriously unhappy with the kernel
spewing FIRMWARE BUG messages on boot on their boxen, the question is,
will they be unhappy enough to fix it..

Now Ingo's patch keeps the warning and lets you take the PMU back and
live with whatever consequences that brings (incorrect counts etc), that
might also work but puts less pressure on the vendors because things
appear to work.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/