Re: perf overlapping maps...
From: David Miller
Date: Wed Oct 24 2018 - 17:30:48 EST
From: Jiri Olsa <jolsa@xxxxxxxxxx>
Date: Wed, 24 Oct 2018 13:34:16 +0200
> On Tue, Oct 23, 2018 at 11:15:03AM -0700, David Miller wrote:
>> From: Arnaldo Carvalho de Melo <acme@xxxxxxxxxx>
>> Date: Tue, 23 Oct 2018 15:05:03 -0300
>>
>> > IIRC this was first done for 'perf record', where we have to stash those
>> > events in the perf.data file, to then, later, 'perf report' to process
>> > those, so when working on 'perf top', it just reuses that machinery.
>> >
>> > Sure, with some love and care 'perf top' could do better and update all
>> > the data structures directly :-)
>>
>> Thanks for the history, it is useful information :)
>>
>> > Anyway, have you guys considered tweaking using event->header.misc |=
>> > PERF_RECORD_MISC_USER? The kernel leaves that as zero for the
>> > PERF_RECORD_FORK it emits:
>>
>> I really would like to steer the approach away from using UAPI
>> perf_event fields in an internal way.
>>
>> I am really very sorry for suggesting such a scheme myself in the
>> first place. It really was a bad idea upon much consideration.
>>
>> The synthetic fork is not really a fork, it's more like a "create".
>>
>> And this fundamental semantic difference is why we have all of these
>> issues wrt. handling COMM and parent map inheritance.
>>
>> There is also a bunch of non-trivial code to deal with whether we
>> synthetically create the child or the parent first, wrt. finding
>> thread leaders and parent threads.
>>
>> What I'm trying to say is that there is a clean design based solution
>> hiding somewhere in here and I'd like to find it :-)
>
> how about adding a data file marker/event when the synthesized
> portion of data is over
>
> attached patch adds an 'SYNTHESIZE_END' event and prevents
> parent's maps cloning on fork until that event is found
>
> we would need more code to stay backward compatible, which
> I did not include.. just to cleanly outline the solution
I appreciate everyone reiterating the ABI issue. Wherein, perf record
stashes these events into a file and later analysis tools need to
interpret them.
Considering this specifically, it is very tempting to make a new event
representing this thread "CREATE" operation.
But, if we make a new event then existing tools will break because
they will see this unrecognizable event and not create the threads.
>From this perspective setting a misc flag in the FORK event might in
fact be the best compromise.
Existing tools will still create the thread, in exactly the same
existing way. This keeps things working as well as they do currently.
So my vote returns to emitting FORK with the special COMM flag set in
the misc flags. And we document this properly of course.
Any objections? I'll cook up a formal patch meanwhile in case we do
agree.