Re: [PATCH] LZ4: compression/decompression signedness mismatch

From: Sergey Senozhatsky
Date: Fri Jul 12 2013 - 05:49:06 EST


On (07/12/13 11:28), Yann Collet wrote:
> The reference implementation, hosted at :ï
> [1]https://code.google.com/p/lz4/
> only proposes char* (signed) types as part of the interface contract.
> I would recommend to keep it that way, to remain consistent.
> Regards

Crypto lz4 accepts u8 * for both compression and decompression:

lz4_compress_crypto(struct crypto_tfm *tfm, const u8 *src,
unsigned int slen, u8 *dst, unsigned int *dlen)

lz4_decompress_crypto(struct crypto_tfm *tfm, const u8 *src,
unsigned int slen, u8 *dst, unsigned int *dlen)


Internally LZ4 may cast unsigned char* to signed char*, the same way you
already do with compression:

int lz4_compress(const unsigned char *src, size_t src_len,
unsigned char *dst, size_t *dst_len, void *wrkmem)

calls:
lz4_compressctx(void *ctx,
const char *source, char *dest,
int isize, int maxoutputsize)



At the moment API is a bit misaligned: unsiged char* for compression and signed char* for
decompression.


My 'real word' use case is, suppose:

struct foo {
[..]
int (*compress)(const unsigned char *src, size_t src_len,
unsigned char *dst, size_t *dst_len, void *wrkmem);
int (*decompress)(const unsigned char *src, size_t src_len,
unsigned char *dst, size_t *dst_len);
};


and (for example) module also provides sysfs attribute, so user can switch select
LZO or LZ4 compressions depending of his needs:

->compress = lzo1x_1_compress;
->decompress = lzo1x_decompress_safe;

to
->compress = lz4_compress;
->decompress = lz4_decompress_unknownoutputsize;


the last one produces unneccessary compilation warning.


-ss

> 2013/7/12 Sergey Senozhatsky <[2]sergey.senozhatsky@xxxxxxxxx>
>
> LZ4 compression and decompression functions require different
> in signedness input/output parameters: unsigned char for
> compression and signed char for decompression.
>
> Change decompression API to require unsigned char.
>
> Signed-off-by: Sergey Senozhatsky <[3]sergey.senozhatsky@xxxxxxxxx>
>
> ---
>
> ïinclude/linux/lz4.h ï ï ï| 8 ++++----
> ïlib/lz4/lz4_decompress.c | 8 ++++----
> ï2 files changed, 8 insertions(+), 8 deletions(-)
>
> diff --git a/include/linux/lz4.h b/include/linux/lz4.h
> index d21c13f..c13f0bc 100644
> --- a/include/linux/lz4.h
> +++ b/include/linux/lz4.h
> @@ -67,8 +67,8 @@ int lz4hc_compress(const unsigned char *src, size_t
> src_len,
> ï * ï ï note : ïDestination buffer must be already allocated.
> ï * ï ï ï ï ï ï slightly faster than lz4_decompress_unknownoutputsize()
> ï */
> -int lz4_decompress(const char *src, size_t *src_len, char *dest,
> - ï ï ï ï ï ï ï size_t actual_dest_len);
> +int lz4_decompress(unsigned const char *src, size_t *src_len,
> + ï ï ï ï ï ï ï unsigned char *dest, size_t actual_dest_len);
>
> ï/*
> ï * lz4_decompress_unknownoutputsize()
> @@ -82,6 +82,6 @@ int lz4_decompress(const char *src, size_t *src_len,
> char *dest,
> ï * ï ï ï ï ï ï ï Error if return (< 0)
> ï * ï ï note : ïDestination buffer must be already allocated.
> ï */
> -int lz4_decompress_unknownoutputsize(const char *src, size_t src_len,
> - ï ï ï ï ï ï ï char *dest, size_t *dest_len);
> +int lz4_decompress_unknownoutputsize(unsigned const char *src, size_t
> src_len,
> + ï ï ï ï ï ï ï unsigned char *dest, size_t *dest_len);
> ï#endif
> diff --git a/lib/lz4/lz4_decompress.c b/lib/lz4/lz4_decompress.c
> index d3414ea..7ceda1f 100644
> --- a/lib/lz4/lz4_decompress.c
> +++ b/lib/lz4/lz4_decompress.c
> @@ -283,8 +283,8 @@ _output_error:
> ï ï ï ï return (int) (-(((char *) ip) - source));
> ï}
>
> -int lz4_decompress(const char *src, size_t *src_len, char *dest,
> - ï ï ï ï ï ï ï size_t actual_dest_len)
> +int lz4_decompress(unsigned const char *src, size_t *src_len,
> + ï ï ï ï ï ï ï unsigned char *dest, size_t actual_dest_len)
> ï{
> ï ï ï ï int ret = -1;
> ï ï ï ï int input_len = 0;
> @@ -302,8 +302,8 @@ exit_0:
> ïEXPORT_SYMBOL_GPL(lz4_decompress);
> ï#endif
>
> -int lz4_decompress_unknownoutputsize(const char *src, size_t src_len,
> - ï ï ï ï ï ï ï char *dest, size_t *dest_len)
> +int lz4_decompress_unknownoutputsize(unsigned const char *src, size_t
> src_len,
> + ï ï ï ï ï ï ï unsigned char *dest, size_t *dest_len)
> ï{
> ï ï ï ï int ret = -1;
> ï ï ï ï int out_len = 0;
>
> References
>
> Visible links
> 1. https://code.google.com/p/lz4/
> 2. mailto:sergey.senozhatsky@xxxxxxxxx
> 3. mailto:sergey.senozhatsky@xxxxxxxxx
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/