On 2024-08-22 11:10 a.m., Ian Rogers wrote:
On Thu, Aug 22, 2024 at 7:32 AM Liang, Kan <kan.liang@xxxxxxxxxxxxxxx> wrote:
On 2024-08-22 9:24 a.m., James Clark wrote:
I rebased this one and made some other fixes so that I could test it,
so I thought I'd repost it here in case it's helpful. I also added a
new test.
But for the testing it all looks ok.
There is one small difference where it now shows "stalled-cycles-..."
as <not supported> events, when before it just didn't show them at all when
they weren't supported:
$ perf stat -- true
Performance counter stats for 'true':
0.66 msec task-clock # 0.384 CPUs utilized
0 context-switches # 0.000 /sec
0 cpu-migrations # 0.000 /sec
52 page-faults # 78.999 K/sec
<not counted> cpu_atom/instructions/ (0.00%)
978,399 cpu_core/instructions/ # 1.02 insn per cycle
<not counted> cpu_atom/cycles/ (0.00%)
959,722 cpu_core/cycles/ # 1.458 GHz
<not supported> cpu_atom/stalled-cycles-frontend/
<not supported> cpu_core/stalled-cycles-frontend/
Intel didn't support the events for a very long time. It would impact
many existing generations and all future generations.
The current method is to hide the non-exist events. The TopdownL1 is an
example. If it doesn't exist in the json file, perf stat will not
display it.
I don't think it's a good idea to disclose non-exist events in the perf
stat default.
The <not supported> doesn't help here, since there could be many reasons
that the perf tool fails to open a counter. It just provides a
misleading message for an event that never existed.
The list of "default" events, not metrics, similarly has "<not
supported>" in many configurations with "-dd" or "-ddd" on AMD. I'm
not sure the set of default events, at different detail levels, is
necessarily the best. The default events can also be a source of
multiplexing, for example, showing branch miss rate alongside topdown
metrics. Anyway, for the "<not supported>" we should probably be able
to tweak should_skip_zero_counter that is in stat-display.c and tag
these default events as "skippable".
The "skippable" should be fine as long as it's completely hidden.
BTW: The stalled-cycles-backend should be similar to the
stalled-cycles-frontend, but it isn't shown in the example. Is the
stalled-cycles-backend event missed?
Thanks,
Kan