On 2018-07-26 12:15, Wei Wang wrote:
On 07/26/2018 05:37 PM, Yury Norov wrote:Absolutely not! That would access bitmap[lim] (the final value of the k
On Thu, Jul 26, 2018 at 04:07:51PM +0800, Wei Wang wrote:Yes, I saw that. But it seems confusing for the corner case that nbits=0
The existing BITMAP_LAST_WORD_MASK macro returns 0xffffffff if nbits isI think this is intentional behavour. Previous version did return ~0UL
0. This patch changes the macro to return 0 when there is no bit
needs to
be masked.
explicitly in this case. See patch 89c1e79eb3023 (linux/bitmap.h: improve
BITMAP_{LAST,FIRST}_WORD_MASK) from Rasmus.
(no bits to mask), the macro returns with all the bits set.
Introducing conditional branch would affect performance. All existingI think that didn't save the conditional branch essentially, because
code checks nbits for 0 before handling last word where needed
explicitly. So I think we'd better change nothing here.
it's just moved from inside this macro to the caller as you mentioned.
If callers missed the check for some reason and passed 0 to the macro,
they will get something unexpected.
Current callers like __bitmap_weight, __bitmap_equal, and others, they have
if (bits % BITS_PER_LONG)
w += hweight_long(bitmap[k] & BITMAP_LAST_WORD_MASK(bits));
we could remove the "if" check by "w += hweight_long(bitmap[k] &
BITMAP_LAST_WORD_MASK(bits % BITS_PER_LONG));" the branch is the same.
variable) despite that word not being part of the bitmap.
More generally, look at the name of the macro: last_word_mask. It's a
mask to apply to the last word of a bitmap. If the bitmap happens to
consist of a multiple of BITS_PER_LONG bits, than that mask is and must
be ~0UL. So for nbits=64, 128, etc., that is what we want.
OTOH, for nbits=0, there _is_ no last word (since there are no words at
all), so by the time you want to apply the result of
BITMAP_LAST_WORD_MASK(0) to anything, you already have a bug, probably
either having read or being about to write into bitmap[0], which you
cannot do. Please check that user-space port and see if there are bugs
of that kind.
So no, the existing users of BITMAP_LAST_WORD_MASK do not check for
nbits being zero, they check for whether there is a partial last word,
which is something different.
And they mostly (those in lib/bitmap.c) do
that because they've already handled _all_ the full words. Then there
are some users in include/linux/bitmap.h, that check for
small_const_nbits(nbits), and in those cases, we really want ~0UL when
nbits is BITS_PER_LONG, because small_const_nbits implies there is
exactly one word. Yeah, there's an implicit assumption that the bitmap
routines are never called with a compile-time constant nbits==0 (see the
unconditional accesses to *src and *dst), but changing the semantics of
BITMAP_LAST_WORD_MASK and making it return different values for nbits=0
vs nbits=64 wouldn't fix that latent bug.