[PATCHv2 00/30] perf stat: Introduce --per-task option

From: Jiri Olsa
Date: Sun Jun 14 2015 - 04:20:09 EST


hi,
adding the possibility to display stat data per thread.

Allowing following commands and output:

$ perf stat -e cycles,instructions --per-task -p 25388,25442 -a
^C
Performance counter stats for process id '25388,25442':

cat-25388 100,122 cycles
vim-25442 4,167,876 cycles
cat-25388 20,080 instructions
vim-25442 3,232,735 instructions

6.057130572 seconds time elapsed


Also works under interval mode:

$ perf stat -e cycles,instructions --per-task -p 25388,25442 -a -I 1000
# time task counts unit events
1.000190002 cat-25388 54,196 cycles
1.000190002 vim-25442 0 cycles
1.000190002 cat-25388 11,660 instructions
1.000190002 vim-25442 0 instructions
2.000742579 cat-25388 55,247 cycles
2.000742579 vim-25442 1,752,951 cycles
2.000742579 cat-25388 11,872 instructions
2.000742579 vim-25442 1,238,512 instructions
3.001057871 cat-25388 0 cycles
3.001057871 vim-25442 0 cycles
3.001057871 cat-25388 0 instructions
3.001057871 vim-25442 0 instructions
^C 3.242519732 cat-25388 0 cycles
3.242519732 vim-25442 0 cycles
3.242519732 cat-25388 0 instructions
3.242519732 vim-25442 0 instructions


v2 changes:
- renamed xyarray__zero to xyarray__reset
- using atomic_t for cpu_map/thread_map refcnt
- moving all stat functions into stat object
- using init_stats to initialize 'struct stats*'
- change TASK-$pid output into $comm-$pid
- updated doc with --per-task option

thanks for comments,
jirka


---
Jiri Olsa (30):
perf tools: Introduce xyarray__reset function
perf tools: Add thread_map__(alloc|realloc) helpers
perf tools: Change thread_map::map into struct
perf tools: Add comm string into struct thread_map
perf tests: Add thread_map object tests
perf tools: Add reference counting for cpu_map object
perf tools: Add reference counting for thread_map object
perf tools: Propagate cpu maps through the evlist
perf tools: Propagate thread maps through the evlist
perf tools: Make perf_evsel__(nr_)cpus generic
perf tools: Move perf_evsel__(alloc|free|reset)_counts into stat object
perf stat: Introduce perf_counts__(alloc|free|reset) functions
perf stat: Introduce perf_counts function
perf stat: Use xyarray for cpu evsel counts
perf stat: Make stats work over the thread dimension
perf stat: Rename struct perf_counts::cpu member to values
perf stat: Move perf_evsel__(alloc|free|reset)_stat_priv into stat object
perf stat: Move perf_evsel__(alloc|free)_prev_raw_counts into stat object
perf stat: Move perf_evlist__(alloc|free)_stats into evlist object
perf stat: Introduce perf_evsel__alloc_stats function
perf stat: Introduce perf_evsel__read function
perf stat: Introduce read_counters function
perf stat: Separate counters reading and processing
perf stat: Move zero_per_pkg into counter process code
perf stat: Move perf_stat initialization counter process code
perf stat: Remove perf_evsel__read_cb function
perf stat: Rename print_interval to process_interval
perf stat: Using init_stats instead of memset
perf stat: Introduce print_counters function
perf stat: Introduce --per-task option

tools/perf/Documentation/perf-stat.txt | 3 +
tools/perf/builtin-stat.c | 398 ++++++++++++++--------------
tools/perf/builtin-trace.c | 4 +-
tools/perf/tests/Build | 1 +
tools/perf/tests/builtin-test.c | 4 +
tools/perf/tests/code-reading.c | 4 +-
tools/perf/tests/keep-tracking.c | 4 +-
tools/perf/tests/mmap-basic.c | 4 +-
tools/perf/tests/mmap-thread-lookup.c | 2 +-
tools/perf/tests/openat-syscall-all-cpus.c | 9 +-
tools/perf/tests/openat-syscall-tp-fields.c | 2 +-
tools/perf/tests/openat-syscall.c | 6 +-
tools/perf/tests/switch-tracking.c | 4 +-
tools/perf/tests/tests.h | 1 +
tools/perf/tests/thread-map.c | 28 ++
tools/perf/util/auxtrace.c | 4 +-
tools/perf/util/cpumap.c | 25 +-
tools/perf/util/cpumap.h | 6 +-
tools/perf/util/event.c | 6 +-
tools/perf/util/evlist.c | 38 ++-
tools/perf/util/evlist.h | 1 -
tools/perf/util/evsel.c | 47 ++--
tools/perf/util/evsel.h | 43 ++-
tools/perf/util/parse-events.c | 5 +-
tools/perf/util/python-ext-sources | 2 +
tools/perf/util/python.c | 4 +-
tools/perf/util/record.c | 4 +-
tools/perf/util/session.c | 2 +-
tools/perf/util/stat.c | 134 ++++++++++
tools/perf/util/stat.h | 46 ++++
tools/perf/util/svghelper.c | 2 +-
tools/perf/util/thread_map.c | 131 +++++++--
tools/perf/util/thread_map.h | 17 +-
tools/perf/util/xyarray.c | 8 +
tools/perf/util/xyarray.h | 2 +
35 files changed, 672 insertions(+), 329 deletions(-)
create mode 100644 tools/perf/tests/thread-map.c
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/