Re: [PATCH 0/11] LTTng-core (basic tracing infrastructure) 0.5.108

From: Martin Bligh
Date: Thu Sep 14 2006 - 16:56:17 EST


Ingo Molnar wrote:
* Martin Bligh <mbligh@xxxxxxxxxx> wrote:


if there are lots of tracepoints (and the union of _all_ useful tracepoints that i ever encountered in my life goes into the thousands) then the overhead is not zero at all.

also, the other disadvantages i listed very much count too. Static tracepoints are fundamentally limited because:

- they can only be added at the source code level

- modifying them requires a reboot which is not practical in a
production environment

- there can only be a limited set of them, while many problems need
finegrained tracepoints tailored to the problem at hand

- conditional tracepoints are typically either nonexistent or very
limited.

for me these are all _independent_ grounds for rejection, as a generic kernel infrastructure.

I don't think anyone is saying that static tracepoints do not have their limitations, or that dynamic tracepointing is useless. But that's not the point ... why can't we have one infrastructure that supports both? Preferably in a fairly simple, consistent way.


primarily because i fail to see any property of static tracers that are not met by dynamic tracers. So to me dynamic tracers like SystemTap are a superset of static tracers.

1. They're harder to maintain out of tree.
2. they're written in some jibberish awk crap
3. They're slower. If you're doing thousands of tracepoints a second,
into a circular 8GB log buffer, that *does* matter. You want
to peturb what you're measuring as little as possible.

If you're running across thousands of systems, in live production, in
order to catch a rare race condition, the performance does matter.

So my position is that what we should concentrate on is to make the life of dynamic tracers easier (be that a handful of generic, parametric hooks that gather debuginfo information and add NOPs for easy patching), while realizing that static tracers have no advantage over dynamic tracers.

I'm confused. You're saying that the dynamic tracers need help by
adding some static data to the kernel, and yet at the same time
rejecting static additions to the kernel on the grounds they have
no value???

Perhaps we're just meaning different things by static tracing. To me,
what is important is that there is a well-defined place in the source
code where the data needed to be logged, and the exact place to log
it at, is defined. If all that macro does to the compilation is add
a couple of nops, and make an entry in a symbol data, or other debug
data, for something to hook into later that's *fine*. The point is
to maintain the location and intelligence about *what* to trace.

Perhaps I'm calling that static, and you're calling it dynamic? Would
explain why we're disagreeing ;-) Seems to be exactly what you're
suggesting above?

If we want it to be superfast, we could compile with a different config option to insert some tracing statically in there or something, but I
agree it should not be the default.

i.e. why add infrastructure for the sake of something that is clearly inferior? I have no problem with adding infrastructure for SystemTap, but i am asking the question: is it worth adding a static tracer?

Yes ;-) Realise that your usage model is not exactly the same as
everyone else's, and I don't give a damn if I have to recompile. I
realise other people do, but ....

I would of course accept static tracers too if someone proved it that they offer something that dynamic tracers cannot do.

Can you *really* trace *any* variable (stack variables, etc) at *any*
point within *any* function with kprobes? It didn't do that before,
and I find it hard to see how it could, given compiler optimizations,
etc.

(Just like i would accept the reintroduction of the Big Kernel Lock too, if someone proved it that it's the right thing to do.)

Surely it's still there at the moment? ;-)

M.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/