[RFC][PATCH -tip 0/5 V2] tracing: kprobe-based event tracer
From: Masami Hiramatsu
Date: Fri Mar 20 2009 - 21:32:36 EST
Hi,
This is a series of patches which introduce a proof-of concept of
kprobe-based event tracer for x86. I decided to start it from
minimum support, so I shrank down the patchset to support only
x86(-32/-64). (If someone is interested in porting this to other
architectures, I'd happy to help :))
This can be applied on the linux-2.6-tip tree.
This patchset includes following changes:
- Add kprobe-tracer plugin [1/5, 3/5, 5/5]
- Fix kernel_trap_sp() on x86 according to systemtap runtime. [2/5]
- Add module_*probe api for repawning/removing kprobes when target
module is coming/going. [4/5]
Future items:
- Enhance module_*probe for batch unregistering support.
- Add kernel_trap_sp() and fetch_*() on other archs.
- Support name-based register fetching (ax, bx, and so on)
- Support symbol-based memory fetching (for global variables)
- Support primitive types(long, ulong, int, uint, etc) for args.
- Support indirect memory fetch from registers etc.
- Check insertion point safety by using instruction decoder.
We may need to separate above arch-dependent fetching infrastructure.
kprobe-based event tracer
---------------------------
This tracer is similar to the events tracer which is based on Tracepoint
infrastructure. Instead of Tracepoint, this tracer is based on kprobes(kprobe
and kretprobe). It probes anywhere where kprobes can probe(this means, all
functions body except for __kprobes functions).
Unlike the function tracer, this tracer can probe instructions inside of
kernel functions. It allows you to check which instruction has been executed.
Unlike the Tracepoint based events tracer, this tracer can add new probe points
on the fly.
Similar to the events tracer, this tracer doesn't need to be activated via
current_tracer, instead of that, just set probe points via
/debug/tracing/kprobe_probes.
Synopsis of kprobe_probes:
p SYMBOL[+offs|-offs]|MEMADDR [FETCHARGS] : set a probe
r SYMBOL[+0] [FETCHARGS] : set a return probe
FETCHARGS:
rN : Fetch Nth register (N >= 0)
sN : Fetch Nth entry of stack (N >= 0)
mADDR : Fetch memory at ADDR (ADDR should be in kernel)
aN : Fetch function argument. (N >= 1)(*)
rv : Fetch return value.(**)
rp : Fetch return address.(**)
(*) aN may not correct on asmlinkaged functions and at function body.
(**) only for return probe.
E.g.
echo p do_sys_open a1 a2 a3 a4 > /debug/tracing/kprobe_probes
This sets a kprobe on the top of do_sys_open() function with recording
1st to 4th arguments.
echo r do_sys_open rv rp >> /debug/tracing/kprobe_probes
This sets a kretprobe on the return point of do_sys_open() function with
recording return value and return address.
echo > /debug/tracing/kprobe_probes
This clears all probe points. and you can see the traced information via
/debug/tracing/trace.
echo /debug/tracing/trace
# tracer: nop
#
# TASK-PID CPU# TIMESTAMP FUNCTION
# | | | | |
<...>-2376 [001] 262.389131: do_sys_open: @do_sys_open+0 0xffffff9c 0x98db83e 0x8880 0x0
<...>-2376 [001] 262.391166: sys_open: <-do_sys_open+0 0x5 0xc06e8ebb
<...>-2376 [001] 264.384876: do_sys_open: @do_sys_open+0 0xffffff9c 0x98db83e 0x8880 0x0
<...>-2376 [001] 264.386880: sys_open: <-do_sys_open+0 0x5 0xc06e8ebb
<...>-2084 [001] 265.380330: do_sys_open: @do_sys_open+0 0xffffff9c 0x804be3e 0x0 0x1b6
<...>-2084 [001] 265.380399: sys_open: <-do_sys_open+0 0x3 0xc06e8ebb
@SYMBOL means that kernel hits a probe, and <-SYMBOL means kernel returns
from SYMBOL(e.g. "sys_open: <-do_sys_open+0" means kernel returns from
do_sys_open to sys_open).
Documentation/ftrace.txt | 66 ++++
arch/x86/include/asm/ptrace.h | 4 +-
include/linux/kprobes.h | 44 +++
kernel/kprobes.c | 252 +++++++++++++++
kernel/trace/Kconfig | 9 +
kernel/trace/Makefile | 1 +
kernel/trace/trace_kprobe.c | 693 +++++++++++++++++++++++++++++++++++++++++
7 files changed, 1067 insertions(+), 2 deletions(-)
Thank you,
--
Masami Hiramatsu
Software Engineer
Hitachi Computer Products (America) Inc.
Software Solutions Division
e-mail: mhiramat@xxxxxxxxxx
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/