[PATCH] perf stat: Fix uncore event mixed metric with workload error issue

From: Jin Yao
Date: Mon Apr 27 2020 - 10:43:34 EST


The metric mixed with uncore event is failed with workload.

root@lkp-csl-2sp5 ~# perf stat -M DRAM_BW_Use -- sleep 1
Error:
The sys_perf_event_open() syscall returned with 22 (Invalid argument) for event (uncore_imc/cas_count_read/).
/bin/dmesg | grep -i perf may provide additional information.

But it works without workload.

root@lkp-csl-2sp5 ~# perf stat -M DRAM_BW_Use
^C
Performance counter stats for 'system wide':

5.38 MiB uncore_imc/cas_count_read/
3.98 MiB uncore_imc/cas_count_write/
1550285007 ns duration_time

1.550285007 seconds time elapsed

The uncore events (e.g. uncore_imc/cas_count_read/ and
uncore_imc/cas_count_write/) have been forced to system wide but duration_time
is not.

Currently, the target system wide is set according to:

1. there's no workload specified
2. there is workload specified but all requested events are system wide events.

For DRAM_BW_Use, the target is not set to system wide since 2 is not
met. Next, the counter creation on CPU will run into some issues.

It should make sense to set the target system wide if there is workload
specified and at least one required event is system wide event.

With this patch,

root@lkp-csl-2sp5 ~# perf stat -M DRAM_BW_Use -- sleep 1

Performance counter stats for 'system wide':

4.81 MiB uncore_imc/cas_count_read/
2.89 MiB uncore_imc/cas_count_write/
1002019036 ns duration_time

1.002019036 seconds time elapsed

Fixes: e3ba76deef23 ("perf tools: Force uncore events to system wide monitoring")
Signed-off-by: Jin Yao <yao.jin@xxxxxxxxxxxxxxx>
---
tools/perf/builtin-stat.c | 13 ++++++++-----
1 file changed, 8 insertions(+), 5 deletions(-)

diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c
index 9207b6c45475..b01ee06b1965 100644
--- a/tools/perf/builtin-stat.c
+++ b/tools/perf/builtin-stat.c
@@ -1841,8 +1841,8 @@ static void setup_system_wide(int forks)
* conditions is met:
*
* - there's no workload specified
- * - there is workload specified but all requested
- * events are system wide events
+ * - there is workload specified but at least one requested
+ * event is system wide event
*/
if (!target__none(&target))
return;
@@ -1851,13 +1851,16 @@ static void setup_system_wide(int forks)
target.system_wide = true;
else {
struct evsel *counter;
+ bool system_wide = false;

evlist__for_each_entry(evsel_list, counter) {
- if (!counter->core.system_wide)
- return;
+ if (counter->core.system_wide) {
+ system_wide = true;
+ break;
+ }
}

- if (evsel_list->core.nr_entries)
+ if (evsel_list->core.nr_entries && system_wide)
target.system_wide = true;
}
}
--
2.17.1