Re: [PATCH] smp: Evaluate local cond_func() before IPI side-effects
From: Mathieu Desnoyers
Date: Thu Dec 05 2024 - 10:00:02 EST
On 2024-12-03 20:38, Dave Hansen wrote:
On 12/3/24 10:39, Mathieu Desnoyers wrote:
If cond_func() depends on loading shared state updated by other CPU's
IPI handlers func(), then triggering execution of remote CPUs IPI before
evaluating cond_func() may have unexpected consequences.
I always thought this was on purpose so cond_func() can be executed in
parallel with the remote work.
Could we double-check that this doesn't meaningfully slow down IPIs that
have longer work to do?
I notice that this question was not answered. I did do extensive
benchmark of this effect, but I would not expect a significant
impact there, because the cond_func() I've seen (there are very
few users) are all really short, and should be much shorter than
doing the IPI, so I expect a negligible performance overhead.
But we'll see if any bot observe something unexpected.
Caller code:
fs/buffer.c
1530: on_each_cpu_cond(has_bh_in_lru, invalidate_bh_lru, NULL, 1);
#define BH_LRU_SIZE 16
bool has_bh_in_lru(int cpu, void *dummy)
{
struct bh_lru *b = per_cpu_ptr(&bh_lrus, cpu);
int i;
for (i = 0; i < BH_LRU_SIZE; i++) {
if (b->bhs[i])
return true;
}
return false;
}
arch/x86/mm/tlb.c
932: on_each_cpu_cond_mask(tlb_is_not_lazy, flush_tlb_func,
^ this is the small function introduced by Rik's patches.
Thanks,
Mathieu
--
Mathieu Desnoyers
EfficiOS Inc.
https://www.efficios.com