[PATCH v2 0/1] sched: Improve cache locality of RSEQ concurrency IDs

From: Mathieu Desnoyers
Date: Mon Sep 09 2024 - 17:15:38 EST


Intermittent workloads behaving in bursts spaced by more than 100ms
on each CPU exhibit bad cache locality and degraded performance compared
to purely per-cpu data indexing, because concurrency IDs are allocated
over various CPUs and cores, therefore losing cache locality of the
associated data.

This series addresses this shortcoming. I observed speedups up to 16.7x
compared to plain mm_cid indexing in benchmarks.

It applies on top of v6.10.6.

This deprecates the prior "sched: NUMA-aware per-memory-map concurrency
IDs" patch series with a simpler and more general approach.

Feedback is welcome!

Thanks,

Mathieu

Cc: Valentin Schneider <vschneid@xxxxxxxxxx>
Cc: Mel Gorman <mgorman@xxxxxxx>
Cc: Steven Rostedt <rostedt@xxxxxxxxxxx>
Cc: Vincent Guittot <vincent.guittot@xxxxxxxxxx>
Cc: Dietmar Eggemann <dietmar.eggemann@xxxxxxx>
Cc: Ben Segall <bsegall@xxxxxxxxxx>
Cc: Yury Norov <yury.norov@xxxxxxxxx>
Cc: Rasmus Villemoes <linux@xxxxxxxxxxxxxxxxxx>

Mathieu Desnoyers (1):
sched: Improve cache locality of RSEQ concurrency IDs for intermittent
workloads

fs/exec.c | 2 +-
include/linux/mm_types.h | 72 +++++++++++++++++++++++++++++++++++-----
kernel/fork.c | 2 +-
kernel/sched/core.c | 7 ++--
kernel/sched/sched.h | 47 ++++++++++++++++++--------
5 files changed, 103 insertions(+), 27 deletions(-)

--
2.39.2