Re: [PATCH RFC] x86: avoid atomic operation intest_and_set_bit_lock if possible

From: Robert Richter
Date: Fri Mar 25 2011 - 12:09:08 EST


On 25.03.11 06:21:16, Peter Zijlstra wrote:
> On Fri, 2011-03-25 at 10:22 +0100, Ingo Molnar wrote:
> > * Andi Kleen <andi@xxxxxxxxxxxxxx> wrote:

> > > For the record I don't think it's a good idea for the BIOS to do
> > > this (and I'm not aware of any engineer who does),
> >
> > There's really just two sane options:
> >
> > - complain about the BIOS corrupting CPU state and refusing to use the PMU
> > - complain about the BIOS corrupting CPU state and using the PMU against the BIOS
> >
> > We went for the first one but i'll be more than glad to implement Linus's much
> > more aggressive second option.
> >
> > Btw., for the record, the thing you have been advocating in the past was a
> > third option: for the kernel to step aside quietly and to let the BIOS corrupt
> > a counter or two. You even sent us some sort of BIOS specification about how to
> > implement that. That's pretty much the worst solution imaginable.

Option 2 wont work, I have seen BIOSes that block access to the
counter registers, then there is no way for the OS to take over
control.

So, if you want to use perf anyway on such systems, you will have to
implement option 3 to mark the counter as "reserved" ...


> Also seriously complicated by the kexec case where the previous kernel
> didn't clean up PMU state. There is simply no sane way to detect if its
> actually used and by whoem.
>
> The whole PMU 'sharing' concept championed by Andi is utter crap.

... but this seems not to be an option.

>
> As for simply using it despite the BIOS corrupting it, that might not
> always work, the BIOS might simply over-write your state because it
> one-sidedly declares to own the MSRs (observed behaviour).
>
> Its all a big clusterfuck and really the best way (IMO) is what we have
> now to put pressure on and force the BIOS vendors to play nice.
>
> I assume both HP and DELL will be seriously unhappy with the kernel
> spewing FIRMWARE BUG messages on boot on their boxen, the question is,
> will they be unhappy enough to fix it..

So, we better stick then with option 1. My experience is that new
system's bioses try not to claim perfctrs (affected systems I have
seen are about 2-3 years old), but I am not really sure here.

> Now Ingo's patch keeps the warning and lets you take the PMU back and
> live with whatever consequences that brings (incorrect counts etc), that
> might also work but puts less pressure on the vendors because things
> appear to work.

And yes, using the counter anyway may corrupt counter values.

-Robert


--
Advanced Micro Devices, Inc.
Operating System Research Center

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/