Re: [RFC][PATCH 1/3] trace_seq: Move the trace_seq code to lib/

From: Steven Rostedt
Date: Fri Jun 20 2014 - 12:58:31 EST


On Thu, 19 Jun 2014 22:06:07 -0700
Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote:

> On Thu, 19 Jun 2014 17:33:30 -0400 Steven Rostedt <rostedt@xxxxxxxxxxx> wrote:
>
> > From: "Steven Rostedt (Red Hat)" <rostedt@xxxxxxxxxxx>
> >
> > The trace_seq functions are rather useful outside of tracing. Instead
> > of having it be dependent on CONFIG_TRACING, move the code into lib/
> > and allow other users to have access to it even when tracing is not
> > configured.
>
> What LT said. It's pileon time!
>
> > Signed-off-by: Steven Rostedt <rostedt@xxxxxxxxxxx>
> > ---
> > include/linux/trace_seq.h | 68 ++--------
> > kernel/trace/trace.c | 24 ----
> > kernel/trace/trace_output.c | 268 ---------------------------------------
> > kernel/trace/trace_output.h | 16 ---
> > lib/Makefile | 2 +-
> > lib/trace_seq.c | 303 ++++++++++++++++++++++++++++++++++++++++++++
>
> Putting it in there makes me look at it ;)

Damn it! If I had known this, I would have moved it there on day one.

You can't say you didn't know about this. I cc you on all my patches :-)

So next time, to get you to review my code, I just need to threaten to
add it to lib?

>
> > --- a/include/linux/trace_seq.h
> > +++ b/include/linux/trace_seq.h
> >
> > ...
> >
> > +#define SEQ_PUT_FIELD_RET(s, x) \
> > +do { \
> > + if (!trace_seq_putmem(s, &(x), sizeof(x))) \
>
> hm, does sizeof(x++) increment x? I guess it does.

I hate this macro. I moved it just to make the kernel build. I would
love to change it. It is more to do with tracing than trace_seq anyway.

>
> > + return TRACE_TYPE_PARTIAL_LINE; \
> > +} while (0)
> >
> >
> > ...
> >
> > +#define SEQ_PUT_HEX_FIELD_RET(s, x) \
> > +do { \
> > + BUILD_BUG_ON(sizeof(x) > MAX_MEMHEX_BYTES); \
> > + if (!trace_seq_putmem_hex(s, &(x), sizeof(x))) \
> > + return TRACE_TYPE_PARTIAL_LINE; \
> > +} while (0)
>
> Also has side-effects.

Yep, and it shouldn't be in the lib anyway. Again, HATE IT!

>
> > #endif /* _LINUX_TRACE_SEQ_H */
> >
> > ...
> >
> > --- /dev/null
> > +++ b/lib/trace_seq.c
> > @@ -0,0 +1,303 @@
> > +/*
> > + * trace_seq.c
> > + *
> > + * Copyright (C) 2008-2014 Red Hat Inc, Steven Rostedt <srostedt@xxxxxxxxxx>
> > + *
> > + */
> > +#include <linux/uaccess.h>
> > +#include <linux/seq_file.h>
> > +#include <linux/trace_seq.h>
> > +
> > +int trace_print_seq(struct seq_file *m, struct trace_seq *s)
>
> -ENODOC

Heh, I noticed this too, and almost sent out a 2/4 patch that added the
docs. I should add it first before moving it. But I was lazy and sent
this out because I was more interested in the NMI code feedback than
this.


>
> > +{
> > + int len = s->len >= PAGE_SIZE ? PAGE_SIZE - 1 : s->len;
>
> int = uint >= ulong ? ulog : uint
>
> that's spastic. Can we choose a type and stick to it?

Ah, yeah that should be fixed.

>
> Also, min().

Right.

>
> > + int ret;
> > +
> > + ret = seq_write(m, s->buffer, len);
> > +
> > + /*
> > + * Only reset this buffer if we successfully wrote to the
> > + * seq_file buffer.
>
> why?

Because we might want to try again. I'll update the comment.

>
> > + */
> > + if (!ret)
> > + trace_seq_init(s);
> > +
> > + return ret;
> > +}
> > +
> > +/**
> > + * trace_seq_printf - sequence printing of trace information
> > + * @s: trace sequence descriptor
> > + * @fmt: printf format string
> > + *
> > + * It returns 0 if the trace oversizes the buffer's free
> > + * space, 1 otherwise.
>
> s/oversizes/would overrun/?

It made sense when I first read it (Jiri wrote it). How about:

Returns 0 if the recorded print is bigger than the free space
of the buffer. Returns 1 otherwise.

>
> > + * The tracer may use either sequence operations or its own
> > + * copy to user routines. To simplify formating of a trace
> > + * trace_seq_printf is used to store strings into a special
> > + * buffer (@s). Then the output may be either used by
> > + * the sequencer or pulled into another buffer.
> > + */
> > +int
> > +trace_seq_printf(struct trace_seq *s, const char *fmt, ...)
>
> unneeded newline

OK.

>
> > +{
> > + int len = (PAGE_SIZE - 1) - s->len;
>
> int = ulong - uint;

Maybe it should just be initialized via a macro too?

>
> > + va_list ap;
> > + int ret;
> > +
> > + if (s->full || !len)
> > + return 0;
> > +
> > + va_start(ap, fmt);
> > + ret = vsnprintf(s->buffer + s->len, len, fmt, ap);
> > + va_end(ap);
> > +
> > + /* If we can't write it all, don't bother writing anything */
>
> This is somewhat unusual behavior for a write()-style thing. Comment
> should explain "why", not "what".

Yeah. It's like that because this was driven by the code that calls it,
and is pretty engrained in the process. But taking it out of it's
natural habitat, we need to explain these strange rituals that it does.

The why is, this is a buffer write, and users of it expect it to be all
or nothing, because a partial write is usually solved by flushing and
trying again with the same write arguments. If we did partial writes,
then the user would need another buffer to store the full write to be
able to just write out the rest of the line. That basically destroys
the point of trace_seq as it is suppose to be that extra buffer.

>
> > + if (ret >= len) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + s->len += ret;
> > +
> > + return 1;
> > +}
> > +EXPORT_SYMBOL_GPL(trace_seq_printf);
> > +
> > +/**
> > + * trace_seq_bitmask - put a list of longs as a bitmask print output
>
> Is that grammatical?

No but it's almost a haiku:

trace_seq_bitmask -
put a list of longs as a
bitmask print output


>
> > + * @s: trace sequence descriptor
> > + * @maskp: points to an array of unsigned longs that represent a bitmask
> > + * @nmaskbits: The number of bits that are valid in @maskp
> > + *
> > + * It returns 0 if the trace oversizes the buffer's free
> > + * space, 1 otherwise.
>
> Ditto

The beauty of cut-and-paste.

>
> > + * Writes a ASCII representation of a bitmask string into @s.
> > + */
> > +int
> > +trace_seq_bitmask(struct trace_seq *s, const unsigned long *maskp,
> > + int nmaskbits)
> > +{
> > + int len = (PAGE_SIZE - 1) - s->len;
> > + int ret;
> > +
> > + if (s->full || !len)
> > + return 0;
> > +
> > + ret = bitmap_scnprintf(s->buffer, len, maskp, nmaskbits);
> > + s->len += ret;
> > +
> > + return 1;
> > +}
> > +EXPORT_SYMBOL_GPL(trace_seq_bitmask);
>
> More dittos.

Confused. What dittos is this dittoing?

>
> > +/**
> > + * trace_seq_vprintf - sequence printing of trace information
> > + * @s: trace sequence descriptor
> > + * @fmt: printf format string
> > + *
> > + * The tracer may use either sequence operations or its own
> > + * copy to user routines. To simplify formating of a trace
> > + * trace_seq_printf is used to store strings into a special
>
> "trace_seq_printf()". Apparently it makes the kerneldoc output come
> out right.

OK.

>
> > + * buffer (@s). Then the output may be either used by
> > + * the sequencer or pulled into another buffer.
> > + */
> > +int
> > +trace_seq_vprintf(struct trace_seq *s, const char *fmt, va_list args)
> > +{
> > + int len = (PAGE_SIZE - 1) - s->len;
> > + int ret;
> > +
> > + if (s->full || !len)
> > + return 0;
> > +
> > + ret = vsnprintf(s->buffer + s->len, len, fmt, args);
> > +
> > + /* If we can't write it all, don't bother writing anything */
> > + if (ret >= len) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + s->len += ret;
> > +
> > + return len;
> > +}
> > +EXPORT_SYMBOL_GPL(trace_seq_vprintf);
>
> Several dittos.

Oh, just on the function. You're not dittoing a comment about the
EXPORT_SYMBOL_GPL() that you forgot to add, are you?

>
> > +int trace_seq_bprintf(struct trace_seq *s, const char *fmt, const u32 *binary)
>
> -ENODOC

Ditto

>
> > +{
> > + int len = (PAGE_SIZE - 1) - s->len;
> > + int ret;
> > +
> > + if (s->full || !len)
> > + return 0;
> > +
> > + ret = bstr_printf(s->buffer + s->len, len, fmt, binary);
> > +
> > + /* If we can't write it all, don't bother writing anything */
> > + if (ret >= len) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + s->len += ret;
> > +
> > + return len;
> > +}
>
> Dittos.

Dittos

>
> > +/**
> > + * trace_seq_puts - trace sequence printing of simple string
> > + * @s: trace sequence descriptor
> > + * @str: simple string to record
> > + *
> > + * The tracer may use either the sequence operations or its own
> > + * copy to user routines. This function records a simple string
> > + * into a special buffer (@s) for later retrieval by a sequencer
> > + * or other mechanism.
> > + */
> > +int trace_seq_puts(struct trace_seq *s, const char *str)
> > +{
> > + int len = strlen(str);
> > +
> > + if (s->full)
> > + return 0;
> > +
> > + if (len > ((PAGE_SIZE - 1) - s->len)) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + memcpy(s->buffer + s->len, str, len);
> > + s->len += len;
> > +
> > + return len;
> > +}
>
> Missing EXPORT_SYMBOL?

The others were used by the tracepoint infrastructure. But I'm not even
sure they are needed anymore because the ftrace.h file now calls
another function that does more work which is exported and then calls
these.

But if I'm going to add this to the library, they probably all should
be exported.

>
> > +int trace_seq_putc(struct trace_seq *s, unsigned char c)
> > +{
> > + if (s->full)
> > + return 0;
> > +
> > + if (s->len >= (PAGE_SIZE - 1)) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + s->buffer[s->len++] = c;
> > +
> > + return 1;
> > +}
> > +EXPORT_SYMBOL(trace_seq_putc);
>
> Mix of EXPORT_SYMBOL() and EXPORT_SYMBOL_GPL()

That's a mistake. They all should be EXPORT_SYMBOL_GPL(). Hmm, I caught
this once before and made a patch to fix it. That got lost somehow :-/


>
> > +int trace_seq_putmem(struct trace_seq *s, const void *mem, size_t len)
> > +{
> > + if (s->full)
> > + return 0;
> > +
> > + if (len > ((PAGE_SIZE - 1) - s->len)) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + memcpy(s->buffer + s->len, mem, len);
> > + s->len += len;
> > +
> > + return len;
> > +}
> > +
>
> Lotsa dittos

Ditto back at'cha

>
> > +#define HEX_CHARS (MAX_MEMHEX_BYTES*2 + 1)
> > +
> > +int trace_seq_putmem_hex(struct trace_seq *s, const void *mem, size_t len)
> > +{
> > + unsigned char hex[HEX_CHARS];
> > + const unsigned char *data = mem;
> > + int i, j;
> > +
> > + if (s->full)
> > + return 0;
>
> What's this ->full thing all about anyway? Some central comment which
> explains the design is needed.

Comment? What? Git blame isn't good enough for ya? ;-)

>
> Is this test really needed? trace_seq_putmem() will handle this.

It was added as an optimization, because once it filled up, you could
still have multiple calls to the trace_seq() functions that would waste
time trying to write the buffer.

It seemed like a good idea at the time. I Cc'd Johannes Berg as he's
the one that implemented.

Johannes, is this really needed, should we bother keeping it?

>
> > +#ifdef __BIG_ENDIAN
> > + for (i = 0, j = 0; i < len; i++) {
> > +#else
> > + for (i = len-1, j = 0; i >= 0; i--) {
> > +#endif
> > + hex[j++] = hex_asc_hi(data[i]);
> > + hex[j++] = hex_asc_lo(data[i]);
> > + }
> > + hex[j++] = ' ';
> > +
> > + return trace_seq_putmem(s, hex, j);
> > +}
>
> -ENODOC, missing EXPORT_SYMBOL.

Ditto.

>
> > +void *trace_seq_reserve(struct trace_seq *s, size_t len)
>
> `len' is a size_t here, a uint in trace_seq and an int when it's a local.

We like to be diverse.

>
> > +{
> > + void *ret;
> > +
> > + if (s->full)
> > + return NULL;
> > +
> > + if (len > ((PAGE_SIZE - 1) - s->len)) {
> > + s->full = 1;
> > + return NULL;
> > + }
> > +
> > + ret = s->buffer + s->len;
> > + s->len += len;
> > +
> > + return ret;
> > +}
>
> Dittos

Return dittos

>
> > +int trace_seq_path(struct trace_seq *s, const struct path *path)
> > +{
> > + unsigned char *p;
> > +
> > + if (s->full)
> > + return 0;
> > +
> > + if (s->len >= (PAGE_SIZE - 1)) {
> > + s->full = 1;
> > + return 0;
> > + }
> > +
> > + p = d_path(path, s->buffer + s->len, PAGE_SIZE - s->len);
> > + if (!IS_ERR(p)) {
> > + p = mangle_path(s->buffer + s->len, p, "\n");
> > + if (p) {
> > + s->len = p - s->buffer;
> > + return 1;
> > + }
> > + } else {
> > + s->buffer[s->len++] = '?';
> > + return 1;
> > + }
> > +
> > + s->full = 1;
> > + return 0;
> > +}
>
> Dittos

sottiD

>
> > +ssize_t trace_seq_to_user(struct trace_seq *s, char __user *ubuf, size_t cnt)
> > +{
> > + int len;
> > + int ret;
> > +
> > + if (!cnt)
> > + return 0;
> > +
> > + if (s->len <= s->readpos)
> > + return -EBUSY;
> > +
> > + len = s->len - s->readpos;
> > + if (cnt > len)
> > + cnt = len;
> > + ret = copy_to_user(ubuf, s->buffer + s->readpos, cnt);
> > + if (ret == cnt)
> > + return -EFAULT;
> > +
> > + cnt -= ret;
> > +
> > + s->readpos += cnt;
> > + return cnt;
> > +}
>
> Dittos

Tripple ditto!


Hey! Thanks for the review. Much appreciated. And maybe you should read
those messages in your /dev/null folder that I cc you with. :-)

-- Steve
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/