Re: XArray documentation
From: Andreas Dilger
Date: Thu Nov 23 2017 - 23:30:39 EST
On Nov 23, 2017, at 6:16 PM, Matthew Wilcox <willy@xxxxxxxxxxxxx> wrote:
>
> Here's the current state of the documentation for the XArray. Suggestions
> for improvement gratefully received.
>
> ======
> XArray
> ======
>
> Overview
> ========
>
> The XArray is an array of ULONG_MAX entries. Each entry can be either
> a pointer, or an encoded value between 0 and LONG_MAX. It is efficient
> when the indices used are densely clustered; hashing the object and
> using the hash as the index will not perform well. A freshly-initialised
> XArray contains a NULL pointer at every index. There is no difference
> between an entry which has never been stored to and an entry which has most
> recently had NULL stored to it.
>
> Pointers to be stored in the XArray must have the bottom two bits clear
> (ie must point to something which is 4-byte aligned). This includes all
> objects allocated by calling :c:func:`kmalloc` and :c:func:`alloc_page`,
> but you cannot store pointers to arbitrary offsets within an object.
> The XArray does not support storing :c:func:`IS_ERR` pointers; some
> conflict with data values and others conflict with entries the XArray
> uses for its own purposes. If you need to store special values which
> cannot be confused with real kernel pointers, the values 4, 8, ... 4092
> are available.
Thought - if storing error values into the XArray in addition to regular
pointers is important for some use case, it would be easy to make
"ERR_PTR_XA()", "PTR_ERR_XA()", and "IS_ERR_XA()" macros that just shift
the error values up and down by two bits to avoid the conflict. That
would still allow error values up (down) to -1023 to be stored without
any chance of a pointer conflict, which should be enough.
> Each non-NULL entry in the array has three bits associated with it called
> tags. Each tag may be flipped on or off independently of the others.
> You can search for entries with a given tag set.
How can it be 3 tag bits, if the pointers only need to be 4-byte aligned?
> An unusual feature of the XArray is the ability to tie multiple entries
> together. Once stored to, looking up any entry in the range will give
> the same result as looking up any other entry in the range. Setting a
> tag on one entry will set it on all of them. Multiple entries can be
> explicitly split into smaller entries, or storing NULL into any entry
> will cause the XArray to forget about the tie.
>
> Normal API
> ==========
>
> Start by initialising an XArray, either with :c:func:`DEFINE_XARRAY`
> for statically allocated XArrays or :c:func:`xa_init` for dynamically
> allocated ones.
>
> You can then set entries using :c:func:`xa_store` and get entries using
> :c:func:`xa_load`. xa_store will overwrite a non-NULL entry with the
> new entry. It returns the previous entry stored at that index. You can
> conditionally replace an entry at an index by using :c:func:`xa_cmpxchg`.
> Like :c:func:`cmpxchg`, it will only succeed if the entry at that
> index has the 'old' value. It also returns the entry which was at
> that index; if it returns the same entry which was passed as 'old',
> then :c:func:`xa_cmpxchg` succeeded.
>
> If you want to store a pointer, you can do that directly. If you want
> to store an integer between 0 and LONG_MAX, you must first encode it
> using :c:func:`xa_mk_value`. When you retrieve an entry from the XArray,
> you can check whether it is a data value by calling :c:func:`xa_is_value`,
> and convert it back to an integer by calling :c:func:`xa_to_value`.
>
> You can enquire whether a tag is set on an entry by using
> :c:func:`xa_get_tag`. If the entry is not NULL, you can set a tag on
> it by using :c:func:`xa_set_tag` and remove the tag from an entry by
> calling :c:func:`xa_clear_tag`. You can ask whether any entry in the
> XArray has a particular tag set by calling :c:func:`xa_tagged`.
>
> You can copy entries out of the XArray into a plain array by
> calling :c:func:`xa_get_entries` and copy tagged entries by calling
> :c:func:`xa_get_tagged`. Or you can iterate over the non-NULL entries
> in place in the XArray by calling :c:func:`xa_for_each`. You may prefer
> to use :c:func:`xa_find` or :c:func:`xa_next` to move to the next present
> entry in the XArray.
>
> Finally, you can remove all entries from an XArray by calling
> :c:func:`xa_destroy`. If the XArray entries are pointers, you may wish
> to free the entries first. You can do this by iterating over all non-NULL
> entries in
... the XArray using xa_for_each() ?
> When using the Normal API, you do not have to worry about locking.
> The XArray uses RCU and an irq-safe spinlock to synchronise access to
> the XArray:
>
> No lock needed:
> * :c:func:`xa_empty`
> * :c:func:`xa_tagged`
>
> Takes RCU read lock:
> * :c:func:`xa_load`
> * :c:func:`xa_for_each`
> * :c:func:`xa_find`
> * :c:func:`xa_next`
> * :c:func:`xa_get_entries`
> * :c:func:`xa_get_tagged`
> * :c:func:`xa_get_tag`
>
> Takes xa_lock internally:
> * :c:func:`xa_store`
> * :c:func:`xa_cmpxchg`
> * :c:func:`xa_destroy`
> * :c:func:`xa_set_tag`
> * :c:func:`xa_clear_tag`
>
> The :c:func:`xa_store` and :c:func:`xa_cmpxchg` functions take a gfp_t
> parameter in case the XArray needs to allocate memory to store this entry.
> If the entry being stored is NULL, no memory allocation needs to be
> performed, and the GFP flags specified here will be ignored.
>
> Advanced API
> ============
>
> The advanced API offers more flexibility and better performance at the
> cost of an interface which can be harder to use and has fewer safeguards.
> No locking is done for you by the advanced API, and you are required to
> use the xa_lock while modifying the array. You can choose whether to use
> the xa_lock or the RCU lock while doing read-only operations on the array.
>
> The advanced API is based around the xa_state. This is an opaque data
> structure which you declare on the stack using the :c:func:`XA_STATE`
> macro. This macro initialises the xa_state ready to start walking
> around the XArray. It is used as a cursor to maintain the position
> in the XArray and let you compose various operations together without
> having to restart from the top every time.
>
> The xa_state is also used to store errors. If an operation fails, it
> calls :c:func:`xas_set_err` to note the error. All operations check
> whether the xa_state is in an error state before proceeding, so there's
> no need for you to check for an error after each call; you can make
> multiple calls in succession and only check at a convenient point.
>
> The only error currently generated by the xarray code itself is
> ENOMEM, but it supports arbitrary errors in case you want to call
> :c:func:`xas_set_err` yourself. If the xa_state is holding an ENOMEM
> error, :c:func:`xas_nomem` will attempt to allocate a single xa_node using
.. calling :c:func:`xas_nomem` ... ?
> the specified gfp flags and store it in the xa_state for the next attempt.
> The idea is that you take the xa_lock, attempt the operation and drop
> the lock. Then you allocate memory if there was a memory allocation
... then you try to allocate ...
> failure and retry the operation. You must call :c:func:`xas_destroy`
> if you call :c:func:`xas_nomem` in case it's not necessary to use the
> memory that was allocated.
This last sentence is not totally clear. How about:
If you called :c:func:`xas_nomem` to allocate memory, but didn't need
to use the memory for some reason, you need to call :c:func:`xas_destroy`
to free the allocated memory.
The question is where the "allocated memory" is stored, if it isn't in
the XArray? Is it in the XA_STATE? How do you know if the allocated
memory was needed, or is it always safe to call xas_destroy? Is the
allocated memory always consumed if xa_store/xa_cmpxchg are called?
What if there was another process that also added the same entry while
the xa_lock was dropped?
> When using the advanced API, it's possible to see internal entries
> in the XArray. You should never see an :c:func:`xa_is_node` entry,
> but you may see other internal entries, including sibling entries,
> skip entries and retry entries. The :c:func:`xas_retry` function is a
> useful helper function for handling internal entries, particularly in
> the middle of iterations.
How do you know if a returned value is an "internal entry"? Is there
some "xas_is_internal()" macro/function that tells you this?
> Functions
> =========
>
> .. kernel-doc:: include/linux/xarray.h
> .. kernel-doc:: lib/xarray.c
Cheers, Andreas
Attachment:
signature.asc
Description: Message signed with OpenPGP