Re: [PATCH v6 1/7] perf/core: Define the common branch type classification
From: Michael Ellerman
Date: Mon Jul 10 2017 - 22:13:53 EST
Segher Boessenkool <segher@xxxxxxxxxxxxxxxxxxx> writes:
> Hi!
>
> On Mon, Jul 10, 2017 at 07:46:17PM +0800, Jin, Yao wrote:
>> 1. We all agree these definitions:
>>
>> + PERF_BR_COND = 1, /* conditional */
>> + PERF_BR_UNCOND = 2, /* unconditional */
>> + PERF_BR_IND = 3, /* indirect */
>> + PERF_BR_CALL = 4, /* call */
>> + PERF_BR_IND_CALL = 5, /* indirect call */
>> + PERF_BR_RET = 6, /* return */
>> + PERF_BR_SYSCALL = 7, /* syscall */
>> + PERF_BR_SYSRET = 8, /* syscall return */
>> + PERF_BR_IRET = 11, /* return from interrupt */
>
> Do we? It does not map very well to PowerPC branch types.
I think they map well enough to the types of branches that are actually
used in practice.
To represent the full range of possibilities we'd need to switch to a
bitmap of flags, ie. COND, IND, CALL, RET, SYSCALL, INT, etc. But it
would need more than 4 bits and I don't think there's that much added
value in being able to represent all the bizarre combinations.
But maybe that is the best option as it makes the API more flexible and
means we don't have to get the list of branches correct up front?
I ran some quick numbers on a kernel I had here (powernv w/gcc 7):
Type Percent
-----------------
cond 40.92% beq (79166) bne (57379) ble (10411) bgt (9587) blt (6248) bge (3704) bdnz (1251) bdz (353) bns (30) bdnzf (2) bdnzt (1)
uncond 14.89% b (61182)
indirect 0.10% bctr (418)
call 33.33% bl (136926)
ind call 1.44% bctrl (5912)
return 9.23% blr (37943)
= 99.91%
If we add cond call/return that covers another 0.08% taking us to 99.99%
of branches.
I know future compilers and or different code might use a different
distribution, but I doubt it will change all that much.
Maybe cond could be broken down further, but the only really meaningful
sub category I can think of is the decrementing type, and those are
quite rare.
cheers