Re: [PATCH V2] genirq/affinity: Handle affinity setting on inactive interrupts correctly
From: Thomas Gleixner
Date: Fri Jul 24 2020 - 16:03:55 EST
John Keeping <john@xxxxxxxxxxxx> writes:
> On Fri, 17 Jul 2020 18:00:02 +0200
> Thomas Gleixner <tglx@xxxxxxxxxxxxx> wrote:
> It seems that this patch breaks perf events on RK3288 because the PMU
> interrupts that should be per-cpu are now all on CPU0 so no events are
> collected from CPUs 1-3 and those interrupts are killed as spurious
> after a few seconds.
> I'm seeing this on 4.19.134 and 5.4.53 but as far as I can tell the
> relevant code hasn't changed through to next-20200723. Reverting the
> backport of this change fixes the problem.
> It looks like what happens is that because the interrupts are not
> per-CPU in the hardware, armpmu_request_irq() calls irq_force_affinity()
> while the interrupt is deactivated and then request_irq() with
> IRQF_PERCPU | IRQF_NOBALANCING.
> Now when irq_startup() runs with IRQ_STARTUP_NORMAL, it calls
> irq_setup_affinity() which returns early because IRQF_PERCPU and
> IRQF_NOBALANCING are set, leaving the interrupt on its original CPU.
Right. My brain tricked me to believe that we made activation mandatory,
but that's not.
I have some ideas for a trivial generic way to solve this without
undoing the commit in question and without going through all the irq
chip drivers. So far everything I came up with is butt ugly. Maybe Marc
has some brilliant idea.
Sorry for the wreckage and thanks for the excellent problem
description. I'll come back to you in the next days.