[NAK] Re: [PATCH -v2 9/9] ACPI, APEI, Generic Hardware Error SourcePOLL/IRQ/NMI notification type support

From: Ingo Molnar
Date: Mon Oct 25 2010 - 04:46:21 EST



* Huang Ying <ying.huang@xxxxxxxxx> wrote:

> Generic Hardware Error Source provides a way to report platform
> hardware errors (such as that from chipset). It works in so called
> "Firmware First" mode, that is, hardware errors are reported to
> firmware firstly, then reported to Linux by firmware. This way, some
> non-standard hardware error registers or non-standard hardware link
> can be checked by firmware to produce more valuable hardware error
> information for Linux.
>
> This patch adds POLL/IRQ/NMI notification types support.
>
> Because the memory area used to transfer hardware error information
> from BIOS to Linux can be determined only in NMI, IRQ or timer
> handler, but general ioremap can not be used in atomic context, so a
> special version of atomic ioremap is implemented for that.
>
> Signed-off-by: Huang Ying <ying.huang@xxxxxxxxx>
> Reviewed-by: Andi Kleen <ak@xxxxxxxxxxxxxxx>
> ---
> arch/x86/kernel/acpi/boot.c | 1
> arch/x86/kernel/dumpstack.c | 1
> drivers/acpi/apei/ghes.c | 397 ++++++++++++++++++++++++++++++++++++--------
> kernel/panic.c | 1
> lib/ioremap.c | 2
> mm/vmalloc.c | 1
> 6 files changed, 333 insertions(+), 70 deletions(-)

WTF?

Sigh, please integrate all this into EDAC (drivers/edac/) properly, instead of
turning it into YET ANOTHER hardware vendor special hw-errors thing. We can do
better than this. EDAC is almost there: it has support for Nehalem, AMD, a couple
of older chips.

Guys, instead of carving out a special driver area where you can produce crap
without anyone looking too much, and pretending that the EDAC code does not exist,
please try to work with others who are aiming higher and who are using saner
interfaces.

Just look at the higher level structure in drivers/acpi/apei/:

apei-base.c apei-internal.h cper.c einj.c erst.c erst-dbg.c ghes.c hest.c Kconfig Makefile

ghes, einj, cper, erst? Someone's been abbreviating too much.

einj.c: it's about the 3rd separate 'error injection' concept that got introduced
...

Please _THINK_ for goodness's sake and try to get some usable, coherent, generic
interface to users - instead of just directly implementing whatever crap hw
designers came up with.

So unless Linus or Andrew overrules me i'll NAK this before the insanity spreads too
much:

NAKed-by: Ingo Molnar <mingo@xxxxxxx>

(Someone should have NAK-ed it when the first iteration was merged.)

Thanks,

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/