Re: [PATCH v3 0/1] Compactly make code examples into literal blocks

From: Peter Lister
Date: Mon Mar 30 2020 - 18:29:39 EST


On 27/03/2020 16:41, Jonathan Corbet wrote:

On Fri, 27 Mar 2020 13:28:54 +0200
Jani Nikula <jani.nikula@xxxxxxxxxxxxxxx> wrote:

IMHO the real problem is kernel-doc doing too much preprocessing on the
input, preventing us from doing what would be the sensible thing in
rst. The more we try to fix the problem by adding more kernel-doc
processing, the further we dig ourselves into this hole.

If kernel-doc didn't have its own notion of section headers, such as
"example:", we wouldn't have this problem to begin with. We could just
use the usual rst construct; "example::" followed by an indented block.

I'm not going to stand in the way of the patch, but I'm telling you,
this is going to get harder, not easier, on this path.
I agree with you in principle. The problem, of course, is that this is a
legacy gift from before the RST days and it will be hard to change.

A quick grep shows that the pattern:

* Example:

appears nearly 100 times in current kernels. It is not inconceivable to
make a push to get rid of all of those, turning them into ordinary RST
syntax - especially since not all of those are actually kerneldoc
comments.

The same quick grep says that "returns?:" appears about 10,000 times.
*That* will be painful to change, and I can only imagine that some
resistance would have to be overcome at some point.

So what do folks think we should do? :)

I want to ponder on this for a bit. Peter, that may mean that I hold this
patch past the 5.7 merge window, which perhaps makes sense at this point
anyway, sorry. But I really would like to push things into a direction
that moves us away from gnarly perl hacks and toward something more
maintainable in the long term.

I would have been surprised if it had been accepted as is.

Matthew and Greg, thanks for reviewing - I have a feeling you might need to do this a few times more.

Over the past few days, I too have been pondering, certain thatthis patch, a mini tweak of the existing kernel-doc, is not theright answer.Equally, I'm unconvinced that the "right" answer is a wholesale move to ReST, so where's the happy medium?

<alert: long email, tldr: "Finding the happy medium on kerneldoc layout in C source comments">

A week or two back, I tried to fix doc build "indentation" warnings due to return value listsin sfb-bus.c. Russell King didn't like my patch saying "I think it's more important that the documentation interferes to a minimal degree with the code in the file".[ Mauro's patch to sfb-bus.c is now in linux-next. He fixed the problem with a bullet list - thriftier with the line breaks andthe official fix in the kerneldoc notes - but I'll argue that a simple list or definition list might be more appropriate. ]

Russell and Matthew argue that the primary purpose of source annotation is to aid developers and that any significant detraction (verbosity, whitespace) is not excused by prettier docs. FWIW, my background is sysadmin (much perl) and system/kernel programming (mostly C) and I agree with them.

Jani, if you see risks of complexity and maintenance problems, then so do I. But the point of kernel-doc is surely to be a specific semantic markup which works for developers and maintainers and allows doc authors access to their annotation. The format clearly needs thought,perl is less fashionable than it was and kernel-doc - er - needs work, but I don't see the *idea* as inherently broken. I suggest that there would be good developer buy-in to searchable pretty docs if a few compact idioms reliably did The Right Thing and didn't spit out doc build warnings.

The build warnings I currently observe do not tell me "the author got it wrong", but rather "kernel-doc didn't understand the author's intention".

Consider this excerpt from the kerneldoc comment for bitmap_cut(), recently added to lib/bitmap.c. (NB - this is an example in the description section - the parameters, including src, have already been described).

Â* In pictures, example for a big-endian 32-bit architecture:
Â*
Â* @src:
Â* 31ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ 63
Â* |ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |
Â* 10000000 11000001 11110010 00010101Â 10000000 11000001 01110010 00010101
Â*ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ | |ÂÂÂÂÂÂÂÂÂÂÂÂÂ |ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |
Â*ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ 16 14ÂÂÂÂÂÂÂÂÂÂÂÂ 0ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ 32
Â*

This fails because 1) the diagram isn't made literal (and so generates a indentation build warning) and 2) "@src:" is interpreted as an extra definition of src which scrunges the first one. I find it hard to assert that the author's intentions were wrong; it's the kind of good annotation we should hope for in a newly added function. If "* @src:" gained an double colon and we guaranteed that, after a description section header, references to a parameter didn't overwrite the original definition, this would work fine.

My instinct is to fix doc build issues with minimal changes: not actual ReST, but clear idioms reliably generating good ReST. This should be accompanied by tests for developers and reviewers so that we can have a fair stab at getting it right first time and (of course) documentation.

Could I ask anyone who disagrees to suggest their preferred way to lay out the comment for bitmap_cut()?

One head-on approachis to literalise *all* kerneldoc comments for functions and structures. The kerneldoc keywords then serve only to generate links; the ReST output is minimal but guaranteed validand warning free. Would any readers of API docs be inconvenienced? The target readership are presumably programmers, and the searchability of the sphinx RTD is more useful to me than the formatting.

All the best,
Peter