Re: [PATCH] x86/ioapic: Fix NULL pointer dereference on CPU hotplugafter disabling irqs

From: Ingo Molnar
Date: Thu Jul 26 2012 - 06:21:07 EST



* Tomoki Sekiyama <tomoki.sekiyama.qu@xxxxxxxxxxx> wrote:

> In current Linux, percpu variable `vector_irq' is not always cleared when
> a CPU is offlined. If the CPU that has the disabled irqs in vector_irq is
> hotplugged again, __setup_vector_irq() hits invalid irq vector and may
> crash.
>
> This bug can be reproduced as following;
> # echo 0 > /sys/devices/system/cpu/cpu7/online
> # modprobe -r some_driver_using_interrupts # vector_irq@cpu7 uncleared
> # echo 1 > /sys/devices/system/cpu/cpu7/online # kernel may crash
>
> To fix this problem, this patch clears vector_irq in __fixup_irqs() when
> the CPU is offlined.
>
> This also reverts commit f6175f5bfb4c, which partially fixes this bug by
> clearing vector in __clear_irq_vector(). But in environments with IOMMU IRQ
> remapper, it could fail because cfg->domain doesn't contain offlined CPUs.
> With this patch, the fix in __clear_irq_vector() can be reverted because
> every vector_irq is already cleared in __fixup_irqs() on offlined CPUs.
>
> Signed-off-by: Tomoki Sekiyama <tomoki.sekiyama.qu@xxxxxxxxxxx>
> Cc: Thomas Gleixner <tglx@xxxxxxxxxxxxx>
> Cc: Ingo Molnar <mingo@xxxxxxxxxx>
> Cc: "H. Peter Anvin" <hpa@xxxxxxxxx>
> Cc: Suresh Siddha <suresh.b.siddha@xxxxxxxxx>
> Cc: Yinghai Lu <yinghai@xxxxxxxxxx>
> Cc: Alexander Gordeev <agordeev@xxxxxxxxxx>
> ---
> arch/x86/kernel/apic/io_apic.c | 4 ++--
> arch/x86/kernel/irq.c | 1 +
> 2 files changed, 3 insertions(+), 2 deletions(-)
>
> diff --git a/arch/x86/kernel/apic/io_apic.c b/arch/x86/kernel/apic/io_apic.c
> index 5f0ff59..ac96561 100644
> --- a/arch/x86/kernel/apic/io_apic.c
> +++ b/arch/x86/kernel/apic/io_apic.c
> @@ -1195,7 +1195,7 @@ static void __clear_irq_vector(int irq, struct irq_cfg *cfg)
> BUG_ON(!cfg->vector);
> vector = cfg->vector;
> - for_each_cpu(cpu, cfg->domain)
> + for_each_cpu_and(cpu, cfg->domain, cpu_online_mask)
> per_cpu(vector_irq, cpu)[vector] = -1;
> cfg->vector = 0;
> @@ -1203,7 +1203,7 @@ static void __clear_irq_vector(int irq, struct irq_cfg *cfg)
> if (likely(!cfg->move_in_progress))
> return;
> - for_each_cpu(cpu, cfg->old_domain) {

that's not a valid diff - something in your mailer ate lines or
such. See Documentation/email-clients.txt.

Thanks,

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/