Please HELP! System hangs frequently...

Koles Sandor (koles@elender.hu)
Thu, 22 Apr 1999 16:50:45 +0200


We are working on a Dell 6100 server with Linux (usually 20-40 concurrent
user).
Since we had changed the kernel from 2.0.30 to 2.2.x, the system hangs
frequently.
Nothing work except I can ping the box from other machines. The SysRq key
cannot do an Emergency Remount ro, but that writes to the console. The other
SysRq functions seems
to work, but do not nothing (after TERM or KILL signal the tasks remains if
I press
the Alt-SysRq-t keys).

After reboot the filesystem check begins... about 30-40 minutes (34 GB). Our
config:

Dell 6100 4xPPRO 200Mhz, 512Kb Cache
AMI Megaraid controller, 9x9 GB Hard Disk, Raid5, one big partition
768 MB RAM
Now: SMP Linux 2.2.6 with ac1 patch. (Previously: 2.2.3, 2.2.4, 2.2.5
without ac-patches)
Software: iBCS, Progress 8.2
The box is application and file server (with samba).

We use the AMI Megaraid controller only since the new kernel. Before that we
used the
on-board Adaptec controller.

Before the system hangs, the following "crashes" can be shown on the console
(not only the smbd process crashes...):

Apr 22 15:30:50 leon kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000008
Apr 22 15:30:50 leon kernel: current->tss.cr3 = 14360000, pr3 = 14360000
Apr 22 15:30:50 leon kernel: *pde = 00000000
Apr 22 15:30:50 leon kernel: Oops: 0000
Apr 22 15:30:50 leon kernel: CPU: 0
Apr 22 15:30:50 leon kernel: EIP: 0010:[<c0130dae>]
Apr 22 15:30:50 leon kernel: EFLAGS: 00010282
Apr 22 15:30:50 leon kernel: eax: 00000000 ebx: cdd2d200 ecx: cdd2d200
edx: cdd2d200
Apr 22 15:30:50 leon kernel: esi: cdd2d200 edi: 00000000 ebp: 00000001
esp: d3fd5f44
Apr 22 15:30:50 leon kernel: ds: 0018 es: 0018 ss: 0018
Apr 22 15:30:50 leon kernel: Process smbd (pid: 2078, process nr: 193,
stackpage=d3fd5000)
Apr 22 15:30:50 leon kernel: Stack: 00000000 00000001 d3bafcb0 00000004
ffffffeb d3bafcb0 c012d77a d3bafcb0
Apr 22 15:30:50 leon kernel: 00000004 cdd2d200 ffffffe9 00000000
ef92e000 00000000 d3fd4000 c012f916
Apr 22 15:30:50 leon kernel: bfffd758 c012f95b c0127009 cdd2d200
d3fd4000 c0125ff2 cdd2d200 d3fd4000
Apr 22 15:30:50 leon kernel: Call Trace: [<c012d77a>] [<c012f916>]
[<c012f95b>] [<c0127009>] [<c0125ff2>] [<c0107aec>]
Apr 22 15:30:50 leon kernel: Code: 8b 40 08 89 44 24 10 8b 6c 24 10 83 c5 70
8b 54 24 10 8b 72

Thanks for any HELP!
Sandor Koles

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/