Re: [PATCH 1/8] perf, tools: Support handling complete branch stacks as histograms

From: Namhyung Kim
Date: Mon Nov 17 2014 - 01:23:21 EST


Hi Andi,

On Wed, 12 Nov 2014 00:31:53 +0100, Andi Kleen wrote:
> Sorry for the long delay. Just revisiting that.
>
> On Wed, Oct 22, 2014 at 10:03:51AM +0900, Namhyung Kim wrote:
>> > | | f2 tcall.c:5
>> > | | f1 tcall.c:12
>> > | | f1 tcall.c:12
>> > | | f2 tcall.c:7
>> > | | f2 tcall.c:5
>> > | | f1 tcall.c:11
>>
>> I think it'd be better if it just prints function names as normal
>> callchain does (and optionally srcline with a switch) and duplicates
>> removed like below:
>>
>> 54.91% tcall.c:6 [.] f2 tcall
>> |
>> |--65.53%-- f2 tcall.c:5
>> | |
>> | |--70.83%-- f1
>> | | main
>> | | f1
>> | | f2
>> | | f1
>> | | f2
>
> I considered this. For this example it doesn't make much difference
> because the functions are so small.
>
> But for anything larger I really need the line numbers to make
> sense of it.
>
> So I prefer to keep them. I'll look into some easy switch
> to turn them off though.

Oh, I'm not just removing line numbers - it also removed duplicates (f1
and f2). But having both from/to entries, I'm not sure it's worth tho..


>
>
>> > + if (sort__has_parent && !*parent &&
>> > + symbol__match_regex(al.sym, &parent_regex))
>> > + *parent = al.sym;
>> > + else if (have_ignore_callees && root_al &&
>> > + symbol__match_regex(al.sym, &ignore_callees_regex)) {
>> > + /* Treat this symbol as the root,
>> > + forgetting its callees. */
>> > + *root_al = al;
>> > + callchain_cursor_reset(&callchain_cursor);
>> > + }
>> > + if (!symbol_conf.use_callchain)
>> > + return -EINVAL;
>>
>> This check already went away.
>>
>> And, to remove duplicates, I think we need to check last callchain
>> cursor node wrt the callchain_param.key here.
>
> I don't understand the comment. I'm not modifying anything
> that has been already added to the callchain. Just things
> to be added in the future. So why would I need to check
> or change the cursor?

But didn't you already do it (with ips[first_call]) to remove overlaps
between LBR and normal callchain?


>
>>
>> Also, by comparing 'from' address, I'd expect you add the from address
>> alone but you add both of 'from' and 'to'. Do we really need to do
>> that?
>
> Adding from and to makes it much clearer to the user what happens,
> especially with conditional branches, so they can follow the
> control flow.

But it could be confusing too - esp. when it moves from LBR to normal
callchains? Hmm.. maybe we can print them bit differently.


>
>
>> And the first address saved in normal callchain is address of the
>> function itself so it might be 'to' you need to check if sampled before
>> any branch in a function.
>
> I'm checking against the CALL, not the target.

Yeah, but I'm afraid that it'd always fail to find a match.


>
>>
>> > + } else
>> > + be[i] = branch->entries[branch->nr - i - 1];
>> > + }
>> > +
>> > + nr = remove_loops(be, nr);
>> > +
>> > + for (i = 0; i < nr; i++) {
>> > + err = add_callchain_ip(machine, thread, parent,
>> > + root_al,
>> > + -1, be[i].to);
>> > + if (!err)
>> > + err = add_callchain_ip(machine, thread,
>> > + parent, root_al,
>> > + -1, be[i].from);
>> > + if (err == -EINVAL)
>> > + break;
>> > + if (err)
>> > + return err;
>> > + }
>> > + chain_nr -= nr;
>>
>> I'm not sure this line is needed.
>
> Without that i could exceed the limit.

What limit?

Let's say there's a callchains and LBR records below..

callchain:
f1 <- f2 <- f3 <- f4 <- f5

LBR
(f1<-f1) <- (f1<-f2)

So two entries are matched, we have nr = 2, first_call = 2 and chain_nr
= 5 right? So IIUC above code will print callchains like this:

- f1
- f1
- f1
- f2
- f3

while I expect below (with duplicates for now):

- f1
- f1
- f1
- f2
- f3
- f4
- f5

Do I miss something?

Thanks,
Namhyung
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/