Re: [f2fs-dev] [PATCH v6] f2fs: compress: support compress level
From: Gao Xiang
Date: Fri Dec 04 2020 - 04:12:41 EST
On Fri, Dec 04, 2020 at 04:50:14PM +0800, Chao Yu wrote:
...
>
> >
> > About the speed, I think which is also limited to storage device and other
> > conditions (I mean the CPU loading during the writeback might be different
> > between lz4 and lz4hc-9 due to many other bounds, e.g. UFS 3.0 seq
> > write is somewhat higher vs VM. lz4 may have higher bandwidth on high
>
> Yeah, I guess my VM have been limited on its storage bandwidth, and its back-end
> could be low-end rotating disk...
Yeah, anyway that's in IO writeback path (no matter the time was working
on IO or CPU calcualation...)
>
> > level devices since it seems some IO bound here... I guess but not sure,
> > since pure in-memory lz4 is fast according to lzbench / lz4 homepage.)
> >
> > Anyway, it's up to f2fs folks if it's useful :) (the CR number is what
> > I expect though... I'm a bit of afraid the CPU runtime loading.)
>
> I just have a glance at CPU usage numbers (my VM has 16 cores):
> lz4hc takes 11% in first half and downgrade to 6% at second half.
> lz4 takes 6% in whole process.
>
> But that's not accruate...
There is some userspace lzbench [1] to benchmark lz4/lz4hc completely
in memory. So it's expected that lz4bench will consume all 100% CPU
with maximum bandwidth (but in-kernel lz4 version is lower though):
Intel Core i7-8700K
Compression Decompression C/R
memcpy 10362 MB/s 10790 MB/s 100.00
lz4 1.9.2 737 MB/s 4448 MB/s 47.60
lz4hc 1.9.2 -9 33 MB/s 4378 MB/s 36.75
So adding more IO time (due to storage device difference) could make
CPU loading lower (also could make the whole process IO bound) but
the overall write bandwidth will be lower as well.
[1] https://github.com/inikep/lzbench
Thanks,
Gao Xiang
>
> Thanks,