Re: [Patch] kexec: remove KMSG_DUMP_KEXEC (was Re: Query aboutkdump_msg hook into crash_kexec())

From: Vivek Goyal
Date: Tue May 31 2011 - 17:51:45 EST


On Mon, May 30, 2011 at 02:13:33PM +0900, KOSAKI Motohiro wrote:
> (2011/05/27 5:10), Andrew Morton wrote:
> > On Thu, 3 Feb 2011 13:53:01 +0900 (JST)
> > KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx> wrote:
> >
> >>>> I wrote why this is no good idea by another mail. Please see it.
> >>>> Anyway you have a right to don't use this feature.
> >>>>
> >>>
> >>> But you have not explained that why do you need to hook into crash_kexec()
> >>> and you have also not explained why do you need to send out kdump_msg()
> >>> notification if kdump is configured.
> >>>
> >>> Some detailed explanation here would help.
> >>
> >> Hi,
> >>
> >> I send it you now :)
> >>
> >
> > What happened with this? kexec-remove-kmsg_dump_kexec.patch has two acks
> > and one unexplained nack :(
>
> http://groups.google.com/group/linux.kernel/browse_thread/thread/1084f406573d76ac/ee19e34b45f83536?lnk=raot&pli=1
>
> At last mail, Vivek proposed move kms_dump() instead remove. and I asked following question and
> I've got no response. I'm still waiting his.
>
>
> > I'm sorry I've missed this mail long time.
> >
> >> > ---------------------------------------------------------------------
> >> > @@ -74,6 +75,7 @@ NORET_TYPE void panic(const char * fmt, ...)
> >> > dump_stack();
> >> > #endif
> >> > + kmsg_dump(KMSG_DUMP_PANIC);
> >> > /*
> >> > * If we have crashed and we have a crash kernel loaded let it handle
> >> > * everything else.
> >> > * Do we want to call this before we try to display a message?
> >> > */
> >> > crash_kexec(NULL);
> >> > ---------------------------------------------------------------------
> >> And I think to compensate for that somebody introduced additional
> >> kmsg_dump(KEXEC) call inside crash_kexec() and put it under CONFIG
> >> option so that one can change the behavior based on config options.
> >> I think this makes the logic somewhat twisted and an unnecessary call
> >> inside crash_kexec(). So until and unless there is a strong reason we
> >> can get rid of KEXEC event and move kmsg_dump call before crash_kexec()
> >> for now and see how does it go, IMHO.
> >
> >
> > I think I can agree your proposal. But could you please explain why do
> > you think kmsg _before_ kdump and kmsg _in_ kdump are so different?
> > I think it is only C level difference. CPU don't care C function and
> > anyway the kernel call kmsg_dump() because invoke second kernel even
> > if you proposal applied.
> > It is only curious. I'm not against your proposal.
> > Thanks.

Few reasons.

- There is no correlation between crash_kexec() and kdump_msg(). What
you are creating is equivalent of panic notifiers and calling those
notifiers before dump happened. So calling it inside of crash_kexec()
does not make much sense from code point of view.

- Why does somebody need to keep track of event KMSG_DUMP_KEXEC?

- There is one kernel CONFIG option introduce which looks completely
superfluous.

My general take on the whole issue.

- In general I think exporting a hook to module so that they can do
anything before crash is a bad idea. Now this can be overloaded to
do things like sending crash notifications in clustered environement
where we recommend doing it in second kernel.

- Even if we really have to do it, there seemed to be two concern
areas.

- Reliability of kdump_msg() generic infrastructure and its
capability in terms of handling races with other cpus and
NMIs.

- Reliability of module which is getting the callback from
kdump_msg().

I think in one of the mails I was discussing that common infrastructure
between kdump and kmsg_dump() can be put in a separate function, like
stopping all cpus etc to avoid races in generic infrastrucutre and
then first we can all kmsg_dump() and then crash_kexec().

But this still does not provide us any protection against modules getting
control after crash and possiblly worsen the situation.

Thanks
Vivek
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/