Overhead of ring buffer in Ftrace

From: Fang Zhou
Date: Fri Aug 02 2019 - 01:42:16 EST

Next message: Viresh Kumar: "[PATCH V3 1/2] cpufreq: schedutil: Don't skip freq update when limits change"
Previous message: Chester Lin: "[PATCH] efi/arm: fix allocation failure when reserving the kernel base"
Next in thread: Fang Zhou: "Re: Overhead of ring buffer in Ftrace"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hi all,

Iâm currently using Ftrace with tracepoints to trace several events in
kernel. But I found the tracing overhead is a little high.

I found the major overhead comes from
âlocal_dec(&cpu_buffer->committing);â in rb_end_commit() function.
local_dec() will invoke atomic_long_dec(), which finally performs
LOCK_PREFIX plus "DECQ" on this variable.

I'm a little confused. cpu_buffer is a per-cpu buffer. Therefore, I
cannot come up with a scenario that two core runs INC or DEC on the
same per-cpu value at the same time.
So, why do we use such heavy-overhead operation here? Can we just
simply use "DECQ" without LOCK_PREFIX?

Thanks,
Tim

Next message: Viresh Kumar: "[PATCH V3 1/2] cpufreq: schedutil: Don't skip freq update when limits change"
Previous message: Chester Lin: "[PATCH] efi/arm: fix allocation failure when reserving the kernel base"
Next in thread: Fang Zhou: "Re: Overhead of ring buffer in Ftrace"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]