Re: [RFCv1 4/4] perf: arm_spe: Dynamically switch PID tracing to contextidr

From: Leo Yan
Date: Sun Oct 24 2021 - 06:33:06 EST


Hi James,

On Fri, Oct 22, 2021 at 05:23:23PM +0100, James Clark wrote:
> On 22/10/2021 16:36, James Clark wrote:
> >
> >
> > On 21/10/2021 14:45, Leo Yan wrote:
> >> Now Arm64 provides API for enabling and disable PID tracing, Arm SPE
> >> driver invokes these functions to dynamically enable it during
> >> profiling when the program runs in root PID name space, and disable PID
> >> tracing when the perf event is stopped.
> >>
> >> Device drivers should not depend on CONFIG_PID_IN_CONTEXTIDR for PID
> >> tracing, so this patch uses the consistent condition for setting bit
> >> EL1_CX for PMSCR.
> >
> > Hi Leo,
> >
> > I've been testing this change, but I'm seeing something strange. Not sure
> > if it's a problem on my side or not yet. With this command:
> >
> > sudo ./perf record -vvv -e arm_spe//u -- taskset --cpu-list 1 bash -c ls
> >
> > I'm only seeing 0 values for context:
> >
> > sudo ./perf report -D | grep CONTEXT
> >
> > . 00038dce: 65 00 00 00 00 CONTEXT 0x0 el2
> > . 00038e0e: 65 00 00 00 00 CONTEXT 0x0 el2

Good catch! I reproduced this issue at my side and looked into the
flow, the root cause is relevant with timing.

When perf launches the program 'taskset --cpu-list 1 bash -c ls', it
forks a new process and 'ls' program is scheduled in, then function
arm_spe_pmu_start() invokes contextidr_enable() to enable the PID
tracing in contextidr. Since 'ls' program executes very short and it
simply runs to the end (so in the middle of 'ls' there have no any
context switching on the CPU), there have no any new PID is written
into contextidr and CPU's contextidr keeps zero. This is the reason
we see the context packets contain zeros for PID.

To fix this issue, we should enable PID tracing when setup AUX ring
buffer, at this phase, the profiled program has not been started yet.
So when the profiled program is scheduled in at the first time, PID
traing is getting ready and we can see the expected context packet in
Arm SPE trace data. So this patch should be updated as below, I will
apply it in next spin if no objection.

diff --git a/drivers/perf/arm_spe_pmu.c b/drivers/perf/arm_spe_pmu.c
index c21cf1385cc0..85aa2eab0c2e 100644
--- a/drivers/perf/arm_spe_pmu.c
+++ b/drivers/perf/arm_spe_pmu.c
@@ -876,6 +867,13 @@ static void *arm_spe_pmu_setup_aux(struct perf_event *event, void **pages,
buf->nr_pages = nr_pages;
buf->snapshot = snapshot;

+ /*
+ * Enable tracing PID to contextidr if profiling program runs in
+ * root PID namespace.
+ */
+ if (perfmon_capable() && (task_active_pid_ns(current) == &init_pid_ns))
+ contextidr_enable();
+
kfree(pglist);
return buf;

@@ -890,6 +888,9 @@ static void arm_spe_pmu_free_aux(void *aux)
{
struct arm_spe_pmu_buf *buf = aux;

+ if (perfmon_capable() && (task_active_pid_ns(current) == &init_pid_ns))
+ contextidr_disable();
+
vunmap(buf->base);
kfree(buf);
}

Thanks a lot for detailed testing!

Leo