Re: Automatic Kernel Bug Report

From: Daniel Bonekeeper
Date: Sun Jul 09 2006 - 16:36:03 EST


On 7/9/06, Diego Calleja <diegocg@xxxxxxxxx> wrote:
El Sun, 9 Jul 2006 16:01:58 -0400,
"Daniel Bonekeeper" <thehazard@xxxxxxxxx> escribió:

> Independent of wheter you think that this is useful or not, do you see
> any cleaner way to send those reports, having in mind that the
> userspace may not be responsive ?

Kdump (http://lse.sourceforge.net/kdump/) would be useful for
many cases.

I agree.

WRT to the bugs where the system completely locks up and doesn't
leaves any option for automatic bug reports: You don't really care
about those, because they're _so_ annoying that people reports them
manually.

Yeah, or they just switch the ethernet card, get it working and go out
for a coffee. =]
In those nasty cases, we really don't have much to do, since we can't
provide a mechanism on the kernel to run when we detect that it is not
responsive (can't we ?). A rapid mindstorm: A frozen system can't
preempt tasks. If we can keep track of the timestamp of the last time
the schedule ran, and we see that it was like 5 or 10 seconds ago, it
means that something is very wrong on the kernel side. We may have
several levels of fucked-up-ness in which, at a certain level,
interrupts are still called (and we can call our code here to check
the sanity of the system). If we see that we didn't schedule for a
long time, we can trigger the report system (then again, ideally we
don't need userspace to do that). Then we just pray, hoping that the
the report gets thru the networking. Of course there is no magic
solution, but this could help on a great deal of cases.


Daniel


--
What this world needs is a good five-dollar plasma weapon.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/