Re: [RFC][PATCH] sched: Cache aware load-balancing

From: Chen, Yu C
Date: Thu Mar 27 2025 - 07:14:47 EST

Next message: Sakari Ailus: "Re: [PATCH 6/6] media: i2c: imx334: Enable runtime PM before sub-device registration"
Previous message: Tanmay Jagdale: "[PATCH V3 2/2] perf: cs-etm: Store previous timestamp in packet queue"
In reply to: Madadi Vineeth Reddy: "Re: [RFC][PATCH] sched: Cache aware load-balancing"
Next in thread: Madadi Vineeth Reddy: "Re: [RFC][PATCH] sched: Cache aware load-balancing"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hi Madadi,

On 3/27/2025 10:43 AM, Madadi Vineeth Reddy wrote:

Hi Peter,

On 25/03/25 17:39, Peter Zijlstra wrote:

Hi all,

One of the many things on the eternal todo list has been finishing the
below hackery.

It is an attempt at modelling cache affinity -- and while the patch
really only targets LLC, it could very well be extended to also apply to
clusters (L2). Specifically any case of multiple cache domains inside a
node.

Anyway, I wrote this about a year ago, and I mentioned this at the
recent OSPM conf where Gautham and Prateek expressed interest in playing
with this code.

So here goes, very rough and largely unproven code ahead :-)

It applies to current tip/master, but I know it will fail the __percpu
validation that sits in -next, although that shouldn't be terribly hard
to fix up.

As is, it only computes a CPU inside the LLC that has the highest recent
runtime, this CPU is then used in the wake-up path to steer towards this
LLC and in task_hot() to limit migrations away from it.

More elaborate things could be done, notably there is an XXX in there
somewhere about finding the best LLC inside a NODE (interaction with
NUMA_BALANCING).

Tested the patch on a 12-core, 96-thread Power10 system using a real-life
workload, DayTrader.

Do all the Cores share the same LLC within 1 node? If this is the case,
the regression might be due to over-migration/task stacking within 1 LLC/node. This patch should be modified that cache aware load balancing/wakeup will not be triggered if there is only 1 LLC within the node IMO.

thanks,
Chenyu

Here is a summary of the runs:

Users | Instances | Throughput vs Base | Avg Resp. Time vs Base
--------------------------------------------------------------
30 | 1 | -25.3% | +50%
60 | 1 | -25.1% | +50%
30 | 3 | -22.8% | +33%

As of now, the patch negatively impacts performance both in terms of
throughput and latency.

I will conduct more extensive testing with both microbenchmarks and
real-life workloads.

Thanks,
Madadi Vineeth Reddy

Next message: Sakari Ailus: "Re: [PATCH 6/6] media: i2c: imx334: Enable runtime PM before sub-device registration"
Previous message: Tanmay Jagdale: "[PATCH V3 2/2] perf: cs-etm: Store previous timestamp in packet queue"
In reply to: Madadi Vineeth Reddy: "Re: [RFC][PATCH] sched: Cache aware load-balancing"
Next in thread: Madadi Vineeth Reddy: "Re: [RFC][PATCH] sched: Cache aware load-balancing"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]