Re: [2.6.13-rc6-git13/sparc64]: Slab corruption (possible stack orbuffer-cache corruption)

From: Tomasz Kłoczko
Date: Tue Sep 13 2005 - 05:12:47 EST


On Mon, 12 Sep 2005, David S. Miller wrote:

From: Tomasz Kłoczko <kloczek@xxxxxxxxxxxxxxxxxx>
Date: Mon, 12 Sep 2005 16:37:04 +0200 (CEST)

On first it looks like stack or buffer-cache corruption.

Slab corruption: (Not tainted) start=fffff8005d9be708, len=808
Redzone: 0x5a2cf071/0x5a2cf071.
Last user: [destroy_inode+100/144](destroy_inode+0x64/0x90)
Call Trace:
[00000000004759f4] free_block+0x160/0x1b4
[0000000000475bb8] cache_flusharray+0x98/0x128
[0000000000475704] kmem_cache_free+0x68/0x94
[00000000004a56c4] destroy_inode+0x64/0x90

One way for destroy_inode() to be called twice on the same
inode would be if atomic_dec_and_test() was buggy in some way.
I think it might be on sparc64.

Therefore, would you mind giving this patch a test?

I will. Thanks.

Dave I have next thing.
In kernel 2.6.13-rc6-git13 I observe relative very intensive emmiting some kernel messages. From yesterday logs:

# grep "^Sep 12" /var/log/messages | grep kernel: | uniq | cut -d " " -f 6- | sort | uniq -c | sort -n | tail -n 2
509 svc: bad direction 268435456, dropping request
653 eth0: Happy Meal out of receive descriptors, packet dropped.

As you see one of this two messagess occures avarange one time per ~two minutes.
Second looks like some error in sunhme.c. eth0 it is:

0001:00:01.1 Ethernet controller: Sun Microsystems Computer Corp. Happy Meal (rev 01)

kloczek
--
-----------------------------------------------------------
*Ludzie nie mają problemów, tylko sobie sami je stwarzają*
-----------------------------------------------------------
Tomasz Kłoczko, sys adm @zie.pg.gda.pl|*e-mail: kloczek@xxxxxxxxxxxxxxxxxx*