Re: Question about extensions to lib/lz4

From: Andrew Morton
Date: Wed Nov 27 2024 - 21:12:42 EST


On Tue, 19 Nov 2024 15:58:39 -0800 Chris Goldsworthy <quic_cgoldswo@xxxxxxxxxxx> wrote:

> Hi Folks,
>
> Qualcomm is designing a LZ4 compression / decompression engine, with the goal of
> being able to do single-pass operations (i.e. we only read input from DDR once
> for compression and decompression). This is achieved by using buffers internal
> to the engine that store:
> - For compression, the running literal we've encountered, which is used as a
> search buffer
> - For decompression, the last part of the running decompressed output we've
> produced
>
> The outcome of using internal (and obviously fixed-size) buffers for the above,
> whilst not making any changes to the LZ4 format, are as follows:
> - For compression, if we fail to produce a match after running out of input
> buffer space, compression will fail.
> - For decompression, if the copy offset for a given block extends beyond
> what we're holding in our buffer, decompression will fail
>
> We don't want to constrain our HW as such whilst maintaining compatibility with
> SW, and allow it to compress and decompress streams of arbitrary lengths.
> Focusing on decompression for now, we've proposed an extension to LZ4 that would
> allow SW to decompress streams compressed by HW like ours, which is described in
> more detail here [1] in a Github discussion on the lz4 repository owned by Yann
> Collet. The changes we've proposed are as follows, though we would want to add
> a static branch check as well to remove overhead for those who do not want to
> use this extension:
>
> diff -rupN "torvalds linux master lib-lz4/lz4_decompress.c" lib-lz4-patched/lz4_decompress.c
> --- "torvalds linux master lib-lz4/lz4_decompress.c"  2024-08-11 09:51:42.000000000 -0700
> +++ lib-lz4-patched/lz4_decompress.c  2024-08-12 06:26:33.986693000 -0700
> @@ -166,6 +166,7 @@ static FORCE_INLINE int LZ4_decompress_g
>       ip += 2;
>       match = op - offset;
>       assert(match <= op); /* check overflow */
> +      if (unlikely(!offset)) continue; /* skip copy with zero offset */
>
>       /* Do not deal with overlapping matches. */
>       if ((length != ML_MASK) &&
> @@ -289,6 +290,7 @@ static FORCE_INLINE int LZ4_decompress_g
>     offset = LZ4_readLE16(ip);
>     ip += 2;
>     match = op - offset;
> +    if (unlikely(!offset)) continue; /* skip copy with zero offset */
>
>     /* get matchlength */
>     length = token & ML_MASK;
>
> Yann Collet has indicated that these changes could be acceptable and implemented as
> part of a LZ4 v2 block format, which would include several other changes as well
> that have been proposed over the years. The timeline for making / socializing
> this would be on the order of years though, if it does go through [1].
>
> So our question is as follows: as part of submitting our driver, would it be
> acceptable to take the above changes?

It's unclear (to me) what are the expected effects upon existing users
of this library code. I'm assuming "none"? If so, please send out a
suitably changelogged standalone patch asap and I can add it to
linux-next for some testing.

When are you expecting to send this new driver out for
review/test/merge, btw?