Re: [PATCH v3 6/9] perf, x86: handle multiple records in PEBS buffer

From: Peter Zijlstra
Date: Fri Jul 25 2014 - 10:07:17 EST


On Fri, Jul 25, 2014 at 04:34:44PM +0800, Yan, Zheng wrote:
> On 07/25/2014 04:10 PM, Peter Zijlstra wrote:
> > On Tue, Jul 22, 2014 at 04:09:59PM +0800, Yan, Zheng wrote:
> >> One corner case needs to mention is that the PEBS hardware doesn't
> >> deal well with collisions, when PEBS events happen near to each
> >> other. The records for the events can be collapsed into a single
> >> one. However in practice collisions are extremely rare, as long as
> >> different events are used. The periods are typically very large,
> >> so any collision is unlikely. When collision happens, we can either
> >> drop the PEBS record or use the record to serve multiple events.
> >> This patch chooses the later approach.
> >
> > You can't.. the events might have different security context.
> >
> > Remember, the overflow bit is set from the overflow until the PEBS
> > event is generated, this is quite a long time. So if another PEBS event
> > gets generated while the other is still pending it will have both bits
> > set. Even though the second bit is for another (unrelated) counter.
> >
> > The unrelated counter might not have privilege to observe the data of
> > the generated event.
> >
> > I think you can unwind and fully correct this trainwreck.
>
> could you give more information how to do this.

We went over that already:

lkml.kernel.org/r/20140528170531.GT30445@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Now ignore the patch there, its nonsense.

But the idea is that the bit gets cleared upon writing the PEBS record.
So look to the next record and see which bit got cleared.

Furthermore, we know that all bits set at PMI time are in-progress and
can therefore be cleared from the last record.

This should allow us to iterate the entire thing backwards and provide
a unique event for each record.

So take this series of 2 records and a PMI:

C0 C1 C3 C4
---------------
O
| O
| |
| A < R1
| O
A | < R2
---------+----- < PMI

O - overflow
A - assist

So at PMI time we have C3 set in the overflow mask, our last even R2
will have both C0 and C3 set, we clear C3 because we know it cannot have
been that. Then for R1 we have C0 and C1 set, but because R2 was C0 we
can clear C0 from R1, finding it was indeed C1.

So typically we'd have one event set and no problem, but in case there's
more we can reconstruct with such a backwards pass from a known good
state.

But when in doubt, we should drop the record, its the safest choice.

Attachment: pgpWJdPy5XKRO.pgp
Description: PGP signature