Re: XArray documentation

From: Matthew Wilcox
Date: Fri Nov 24 2017 - 12:03:13 EST


On Fri, Nov 24, 2017 at 05:50:41PM +0100, Martin Steigerwald wrote:
> Matthew Wilcox - 24.11.17, 02:16:
> > ======
> > XArray
> > ======
> >
> > Overview
> > ========
> >
> > The XArray is an array of ULONG_MAX entries. Each entry can be either
> > a pointer, or an encoded value between 0 and LONG_MAX. It is efficient
> > when the indices used are densely clustered; hashing the object and
> > using the hash as the index will not perform well. A freshly-initialised
> > XArray contains a NULL pointer at every index. There is no difference
> > between an entry which has never been stored to and an entry which has most
> > recently had NULL stored to it.
>
> I am no kernel developer (just provided a tiny bit of documentation a long
> time ago)â but on reading into this, I missed:
>
> What is it about? And what is it used for?
>
> "Overview" appears to be already a description of the actual implementation
> specifics, instead ofâ well an overview.
>
> Of course, I am sure you all know what it is forâ but someone who wants to
> learn about the kernel is likely to be confused by such a start.

Hi Martin,

Thank you for your comment. I'm clearly too close to it because even
after reading your useful critique, I'm not sure what to change. Please
help me!

Maybe it's that I've described the abstraction as if it's the
implementation and put too much detail into the overview. This might
be clearer?

The XArray is an abstract data type which behaves like an infinitely
large array of pointers. The index into the array is an unsigned long.
A freshly-initialised XArray contains a NULL pointer at every index.

----
and then move all this information into later paragraphs:

There is no difference between an entry which has never been stored to
and an entry which has most recently had NULL stored to it.
Each entry in the array can be either a pointer, or an
encoded value between 0 and LONG_MAX.
While you can use any index, the implementation is efficient when the
indices used are densely clustered; hashing the object and using the
hash as the index will not perform well.