Re: [PATCH v8 0/3] sched/fair: Optimize cfs_rq and sched_entity allocation for better data locality
From: Zecheng Li
Date: Thu Feb 12 2026 - 14:20:37 EST
Hi Josh,
> > Other workloads without CPU share limits, while also running in a cgroup
> > hierarchy with O(1000) instances, show no obvious regression:
> >
> > sysbench, hackbench - lower is better; ebizzy - higher is better.
> >
> > workload | base | opt | metric
> > ----------+-----------------------+-----------------------+------------
> > sysbench | 63.55, [63.04, 64.05] | 64.36, [62.97, 65.75] | avg latency
> > hackbench | 36.95, [35.45, 38.45] | 37.12, [35.81, 38.44] | time
> > ebizzy | 610.7, [569.8, 651.6] | 613.5, [592.1, 635.0] | record/s
>
> Zecheng, am I reading those benchmark stats wrong, or is the 'opt'
> version slightly worse than 'base'?
The differences are within the error bounds shown in the brackets.
Since the ranges overlap significantly, the delta is noise and
confirms no obvious regression. A different set of runs could reverse
the averages.