Re: [RFC PATCH] workqueue: Automatic affinity scope fallback for single-pod topologies

Next message: Gary Guo: "Re: [PATCH v3 1/3] rust: clk: use the type-state pattern"
Previous message: Waiman Long: "Re: [PATCH v2] audit: Avoid excessive dput/dget in audit_context setup and reset paths"
In reply to: Tejun Heo: "Re: [RFC PATCH] workqueue: Automatic affinity scope fallback for single-pod topologies"
Next in thread: Tejun Heo: "Re: [RFC PATCH] workqueue: Automatic affinity scope fallback for single-pod topologies"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Chuck Lever

Date: Tue Feb 03 2026 - 15:34:29 EST

On 2/3/26 3:29 PM, Tejun Heo wrote:
> On Tue, Feb 03, 2026 at 03:14:46PM -0500, Chuck Lever wrote:
>>> While I understand the problem, I don't think dropping down to core boundary
>>> for unbound workqueues by default makes sense. That may help with some use
>>> cases but cause problem with others.
>>
>> I've never seen a case where it doesn't help. In order to craft an
>> alternative, I'll need to have some examples to avoid. Is it only the
>> SMT case that is concerning?
>
> It's just a lot of separate pools on large machines. If you have relatively
> high concurrency, the number of workers can go pretty high. They'd also
> migrate back and forth more depending on usage pattern and have worse cache
> locality. Imagine you have a bursty workload wandering through the system,
> if you have nr_cores pools, it can easily end up with kworkers > nr_cores *
> max_concurrency.

The patch addresses that, I'd hope, by only switching to per-CPU on
single pod (ie, simple) systems. Larger, more complicated, topologies
should be left unchanged. I imagine that on a single pod machine with a
large number of cores, having per-CPU locking will nearly always be a
win.

--
Chuck Lever