[PATCH v7 0/2] perf/core: addressing 4x slowdown during per-process profiling of STREAM benchmark on Intel Xeon Phi
From: Alexey Budankov
Date: Fri Aug 18 2017 - 01:17:36 EST
Hi,
This patch set v7 moves event groups into rb trees and implements
skipping to the current CPU's list on hrtimer interrupt.
Events allocated for the same CPU are still kept in a linked list
of the event directly attached to the tree because it is unclear
how to implement fast iteration thru events allocated for
the same CPU when they are all attached to a tree employing
additional 64bit index as a secondary treee key.
The patch set addresses feeback captured previously. Specifically
API with a callback in signature is replaced by a macro what reduced
the size of adapting changes.
Patches in the set are expected to be applied one after another in
the mentioned order and they are logically split into two parts
to simplify the review process.
For more background details and feedback of the patch set please
refer to v6 and older.
Thanks,
Alexey
---
Alexey Budankov (2):
perf/core: use rb trees for pinned/flexible groups
perf/core: add mux switch to skip to the current CPU's events list on mux interrupt
include/linux/perf_event.h | 19 +-
kernel/events/core.c | 463 ++++++++++++++++++++++++++++++++++-----------
2 files changed, 364 insertions(+), 118 deletions(-)