Re: Random GCC segfaults -- Was: [2.6.16] slab error inslab_destroy_objs(): cache `radix_tree_node'...

From: Paolo Ornati
Date: Tue Mar 28 2006 - 09:26:23 EST


On Tue, 28 Mar 2006 15:23:46 +0200
Pavel Machek <pavel@xxxxxx> wrote:

> I'd really get new RAM... If the machine is "known bad", debugging on
> it is likely waste of time.

I know.

The fact is that when I was having memory problems I also have
filesystem corruption associated.

After fixing the first problem (easly reproducible) the filesystem
corruption become more rare.

After fixing the second problem (address detected by DEBUG_SLAB) I have
NEVER seen a single filesystem corruption yet.

Additionally I have tested 2.6.16-rc1 (found BAD after 20 min) and now
I'm re-testing with 2.6.15.6 --> it is compiling by some hours without
a single segfault.

So, I think it could be:

1) a memory problem exposed by the different behaviour of the kernel

2) a kernel BUG somewhere between 2.6.15 / 2.6.16.

Maybe, before using git-bisect, I can simply try to reproduce the
problem using only the first memory module (the bad one) and then try
with only the second one (good).

This should reveal if it is a memory problem or not (or maybe the
combination of GCC eating a lot of memory AND only 256MB of RAM instead
of 512MB will make the system swap a lot resulting in less memory stress
and thus make me unable to reproduce the problem ;)

--
Paolo Ornati
Linux 2.6.15.6 on x86_64
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/