Re: [PATCH 1/7] ia64, kdump: Mask MCA/INIT on freezing cpus
From: Hidetoshi Seto
Date: Mon Jun 22 2009 - 20:34:09 EST
Robin Holt wrote:
>> To avoid this problem, This patch inserts ia64_set_psr_mc() before the
>> deadloop to mask MCA/INIT on cpus going to be frozen. I confirmed that
>> weird log like above are disappeared after applying this patch.
>
> Please do not do this. Turning off MCA/INIT is just a horrible idea.
> When your code has a bug, the INIT of the cpu is the only tool we have
> to find out what it is doing short of putting special hardware onto the
> machine and trying to track it down.
This patch never mask MCA/INIT while the system is running normally.
The first place I inserted the masking is just after panic, and just after
INIT is asserted. This patch doesn't prevent you from taking kdump or
stack trace on your machine.
Maybe I could not catch what you pointed.
One of the problems I'm targeting here is that there is no way to allow
INIT while kernel transition. What are you expecting with INIT if it is
asserted on the beginning of the 2nd kernel?
And note that this patch 1 of 7 is necessary to run the INIT handler of
the 2nd kernel, which might be registered by the 2nd kernel.
> Without thinking about it, I have a gut feeling there must be some way
> to at least allow the MCA/INIT to make it through PROM and be delivered
> to the OS. From there the OS should be able to sort out a way to handle
> kdump and MCAs received during a kdump.
Do you mean that the 2nd kernel should be able to handle MCA/INIT from its
boot up? I guess the word PROM is nearly equal to PAL/SAL firmware, if so
then I don't think there are good generic interface/procedure could be
useful here. Do you have any concrete idea?
Thanks,
H.Seto
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/