Re: [PATCH 4/4] tracing, page-allocator: Add a postprocessingscript for page-allocator-related ftrace events

From: Andrew Morton
Date: Tue Aug 04 2009 - 15:15:32 EST


On Tue, 04 Aug 2009 14:27:16 -0400
Rik van Riel <riel@xxxxxxxxxx> wrote:

> Andrew Morton wrote:
> > On Tue, 4 Aug 2009 19:12:26 +0100 Mel Gorman <mel@xxxxxxxxx> wrote:
> >
> >> This patch adds a simple post-processing script for the page-allocator-related
> >> trace events. It can be used to give an indication of who the most
> >> allocator-intensive processes are and how often the zone lock was taken
> >> during the tracing period. Example output looks like
> >>
> >> find-2840
> >> o pages allocd = 1877
> >> o pages allocd under lock = 1817
> >> o pages freed directly = 9
> >> o pcpu refills = 1078
> >> o migrate fallbacks = 48
> >> - fragmentation causing = 48
> >> - severe = 46
> >> - moderate = 2
> >> - changed migratetype = 7
> >
> > The usual way of accumulating and presenting such measurements is via
> > /proc/vmstat. How do we justify adding a completely new and different
> > way of doing something which we already do?
>
> Mel's tracing is more akin to BSD process accounting,
> where these statistics are kept on a per-process basis.

Is that useful? Any time I've wanted to find out things like this, I
just don't run other stuff on the machine at the same time.

Maybe there are some scenarios where it's useful to filter out other
processes, but are those scenarios sufficiently important to warrant
creation of separate machinery like this?

> Nothing in /proc allows us to see statistics on a per
> process basis on process exit.

Can this script be used to monitor the process while it's still running?



Also, we have a counter for "moderate fragmentation causing migrate
fallbacks". There must be hundreds of MM statistics which can be
accumulated once we get down to this level of detail. Why choose these
nine?


Is there a plan to add the rest later on?


Or are these nine more a proof-of-concept demonstration-code thing? If
so, is it expected that developers will do an ad-hoc copy-n-paste to
solve a particular short-term problem and will then toss the tracepoint
away? I guess that could be useful, although you can do the same with
vmstat.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/