Re: [PATCH v2] x86: page: get_order() optimization

From: H. Peter Anvin
Date: Mon Mar 28 2011 - 15:48:12 EST


On 03/27/2011 01:45 AM, Maksym Planeta wrote:
> For x86 architecture get_order function can be optimized due to
> assembler instruction bsr.
>
> This is second version of patch where for constants gcc precompute the
> result.
>
> Signed-off-by: Maksym Planeta <mcsim.planeta@xxxxxxxxx>

gcc 4.x has an intrinsic, __builtin_clz(), which does the opposite of
the bsr instruction; specifically:

__builtin_clz(x) ^ 31

... generates a bsrl instruction if x is variable. This tends to
generate much better code than any assembly hacks.

-hpa
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/