Re: [PATCH v4 00/22] slab: replace cpu (partial) slabs with sheaves
From: Vlastimil Babka
Date: Thu Jan 29 2026 - 10:30:14 EST
On 1/29/26 16:18, Hao Li wrote:
> Hi Vlastimil,
>
> I conducted a detailed performance evaluation of the each patch on my setup.
Thanks! What was the benchmark(s) used? Importantly, does it rely on
vma/maple_node objects? So previously those would become kind of double
cached by both sheaves and cpu (partial) slabs (and thus hopefully benefited
more than they should) since sheaves introduction in 6.18, and now they are
not double cached anymore?
> During my tests, I observed two points in the series where performance
> regressions occurred:
>
> Patch 10: I noticed a ~16% regression in my environment. My hypothesis is
> that with this patch, the allocation fast path bypasses the percpu partial
> list, leading to increased contention on the node list.
That makes sense.
> Patch 12: This patch seems to introduce an additional ~9.7% regression. I
> suspect this might be because the free path also loses buffering from the
> percpu partial list, further exacerbating node list contention.
Hmm yeah... we did put the previously full slabs there, avoiding the lock.
> These are the only two patches in the series where I observed noticeable
> regressions. The rest of the patches did not show significant performance
> changes in my tests.
>
> I hope these test results are helpful.
They are, thanks. I'd however hope it's just some particular test that has
these regressions, which can be explained by the loss of double caching.