Re: [PATCH 0/4] sched/fair: SMT-aware asymmetric CPU capacity
From: Shrikanth Hegde
Date: Sat Mar 28 2026 - 02:52:28 EST
How is the performance measured here? Which benchmark?
I've used an internal NVIDIA suite (based on NVBLAS), I also tried Linpack
and got similar results. I'm planning to repeat the tests using public
benchmarks and share the results as soon as I can.
By any chance you are running number_running_task <= (nr_cpus / smt_threads_per_core),
so it is all fitting nicely?
That's the case that gives me the optimal results.
If you increase those numbers, how does the performance numbers compare?
I tried different number of tasks. The more I approach system saturation
the smaller the benefits are. When I completely saturate the system I don't
see any benefit with this changes, neither regressions, but I guess that's
expected.
Ok. That's good.
I gave hackbench on powerpc with SMT=4, i didn't observe any regressions or improvements.
Only PATCH 4/4 applies in this case as there is no asym_cpu_capacity