2.0.36 kernel faults

B. James Phillippe (bryan@terran.org)
Tue, 6 Apr 1999 12:44:54 -0700 (PDT)


Greetings,

I'm running 2.0.36 with the patch for ipchains, on an intel Pentium system.
I have been seeing several problems lately, which I did not have before.
The first is a string of kernel faults starting after three days of uptime
(which includes some fairly heavy network and processor usage; this system
is a web server, masquerading router and desktop workstation). The second
happened after less than a day of uptime. Here is the first string of
problems, which eventually resulted in a hung system (very slow death):

Apr 5 14:33:20 : general protection: 0000
Apr 5 14:33:20 : CPU: 0
Apr 5 14:33:20 : EIP: 0010:[do_page_fault+582/732]
Apr 5 14:33:20 : EFLAGS: 00010202
Apr 5 14:33:20 : eax: 00d75eff ebx: 025e286e ecx: 00d75e68 edx: 0000003a
Apr 5 14:33:20 : esi: 001abc58 edi: 001abc58 ebp: 00d75e7c esp: 00d75e50
Apr 5 14:33:20 : ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Apr 5 14:33:20 : Process httpd (pid: 13680, process nr: 68, stackpage=00d75000)
Apr 5 14:33:20 : Stack: 00d75e68 00000004 01e25d04 00000000 01ecf63a 001abc58 00000000 001ace08
Apr 5 14:33:20 : 01ecf63a 025e2810 00111c24 00000000 0012dcb1 00000004 00000000 bffffa0c
Apr 5 14:33:20 : bffffa14 00000000 0327d000 00000001 0327d000 0012df62 00000004 00d75f78
Apr 5 14:33:20 : Call Trace: [mem_init+496/556] [filldir+133/164] [do_select+114/632] [alloc_skb+102/332] [tcp_rcv+2476/2588] [tcp_data+325/536] [ip_getsockopt+186/884]
Apr 5 14:33:21 : [arp_ioctl+530/808] [sys_accept+469/612] [sys_sendto+271/512] [do_signal+41/632]
Apr 5 14:33:21 : Code: 0f 90 8b 45 04 50 68 eb 8d 19 00 e8 ee 23 00 00 8d 65 d8 5b

[Runs fine for many hours]

Apr 6 04:09:10 : general protection: 0000
Apr 6 04:09:10 : CPU: 0
Apr 6 04:09:10 : EIP: 0010:[do_page_fault+582/732]
Apr 6 04:09:10 : EFLAGS: 00010202
Apr 6 04:09:10 : eax: ec458dff ebx: 02563810 ecx: 00000001 edx: 001abc58
Apr 6 04:09:10 : esi: 03c24810 edi: 03c24810 ebp: 02b45f3c esp: 02b45f10
Apr 6 04:09:10 : ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Apr 6 04:09:10 : Process sendmail (pid: 25083, process nr: 61, stackpage=02b45000)
Apr 6 04:09:10 : Stack: ec458d01 00000000 03c01a18 02b45f54 00000000 03c24810 00000002 0004800c
Apr 6 04:09:10 : 00163831 00000003 00000001 03c01a18 001251a5 03c01a18 038a9300 03fcf980
Apr 6 04:09:10 : 03c01a18 02563810 03c01a48 0015b4b9 03c01a18 00000000 00000002 038a9300
Apr 6 04:09:10 : Call Trace: [fat_readdirx+1781/2028] [unregister_chrdev+97/104] [ext2_new_block+1537/2348] [ext2_new_block+1656/2348] [ip_fw_masq_icmp+755/1156] [get_empty_filp+28/180] [do_signal+41/632]
Apr 6 04:09:10 : Code: 0f 90 8b 45 04 50 68 eb 8d 19 00 e8 ee 23 00 00 8d 65 d8 5b
Apr 6 04:09:12 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:09:12 : q = 03c01a4c
Apr 6 04:09:12 : *q = 032f2f54
Apr 6 04:09:12 : general protection: 0000
Apr 6 04:09:12 : CPU: 0
Apr 6 04:09:12 : EIP: 0010:[unregister_blkdev+12/104]
Apr 6 04:09:12 : EFLAGS: 00010086
Apr 6 04:09:12 : eax: f000ef6f ebx: 032f2f54 ecx: 02b45f54 edx: f000ef6f
Apr 6 04:09:12 : esi: 03c01a18 edi: 032f2f54 ebp: 00000246 esp: 032f2f44
Apr 6 04:09:12 : ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Apr 6 04:09:12 : Process sendmail (pid: 25085, process nr: 69, stackpage=032f2000)
Apr 6 04:09:12 : Stack: 03c01a18 03b6d800 03fcfb00 03c01a18 004f2c0c 02b45f54 0015b4b9 03c01a18
Apr 6 04:09:12 : 00000000 00000002 03b6d800 bfffdbcc 032f2f8c 00000240 00000000 00000344
Apr 6 04:09:12 : 00000000 03c01a18 0015b530 03b6d800 00000001 00159783 03b6d800 004f2c0c
Apr 6 04:09:12 : Call Trace: [ext2_new_block+1537/2348] [ext2_new_block+1656/2348] [ip_fw_masq_icmp+755/1156] [get_empty_filp+28/180] [do_signal+41/632]
Apr 6 04:09:12 : Code: 8b 42 04 39 d8 74 05 89 c2 eb f5 90 89 4a 04 55 9d ff 4e 1c
Apr 6 04:09:21 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:09:21 : q = 03c01a4c
Apr 6 04:09:21 : *q = 032f2f54
Apr 6 04:12:21 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:12:21 : q = 03c01a4c
Apr 6 04:12:21 : *q = 032f2f54
Apr 6 04:15:32 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:15:32 : q = 03c01a4c
Apr 6 04:15:32 : *q = 032f2f54
Apr 6 04:26:40 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:26:40 : q = 03c01a4c
Apr 6 04:26:40 : *q = 03cd1f54
Apr 6 04:26:40 : general protection: 0000
Apr 6 04:26:40 : CPU: 0
Apr 6 04:26:40 : EIP: 0010:[unregister_blkdev+12/104]
Apr 6 04:26:40 : EFLAGS: 00010086
Apr 6 04:26:40 : eax: f000ef6f ebx: 03cd1f54 ecx: 032f2f54 edx: f000ef6f
Apr 6 04:26:40 : esi: 03c01a18 edi: 03cd1f54 ebp: 00000246 esp: 03cd1f44
Apr 6 04:26:40 : ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Apr 6 04:26:40 : Process sendmail (pid: 25096, process nr: 61, stackpage=03cd1000)
Apr 6 04:26:40 : Stack: 03c01a18 01122100 03fcfa80 03c01a18 011cf414 032f2f54 0015b4b9 03c01a18
Apr 6 04:26:40 : 00000000 00000002 01122100 bfffdbc4 03cd1f8c 00000240 00000000 00000344
Apr 6 04:26:40 : 00000000 03c01a18 0015b530 01122100 00000001 00159783 01122100 011cf414
Apr 6 04:26:40 : Call Trace: [ext2_new_block+1537/2348] [ext2_new_block+1656/2348] [ip_fw_masq_icmp+755/1156] [get_empty_filp+28/180] [do_signal+41/632]
Apr 6 04:26:40 : Code: 8b 42 04 39 d8 74 05 89 c2 eb f5 90 89 4a 04 55 9d ff 4e 1c
Apr 6 04:26:42 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:26:42 : q = 03c01a4c
Apr 6 04:26:42 : *q = 03601f54
Apr 6 04:26:42 : general protection: 0000
Apr 6 04:26:42 : CPU: 0
Apr 6 04:26:42 : EIP: 0010:[unregister_blkdev+12/104]
Apr 6 04:26:42 : EFLAGS: 00010086
Apr 6 04:26:42 : eax: f000ef6f ebx: 03601f54 ecx: 03cd1f54 edx: f000ef6f
Apr 6 04:26:42 : esi: 03c01a18 edi: 03601f54 ebp: 00000246 esp: 03601f44
Apr 6 04:26:42 : ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Apr 6 04:26:42 : Process sendmail (pid: 25098, process nr: 69, stackpage=03601000)
Apr 6 04:26:42 : Stack: 03c01a18 018c4b00 03fcfb00 03c01a18 004f2c0c 03cd1f54 0015b4b9 03c01a18
Apr 6 04:26:42 : 00000000 00000002 018c4b00 bfffdbcc 03601f8c 00000240 00000000 00000344
Apr 6 04:26:42 : 00000000 03c01a18 0015b530 018c4b00 00000001 00159783 018c4b00 004f2c0c
Apr 6 04:26:42 : Call Trace: [ext2_new_block+1537/2348] [ext2_new_block+1656/2348] [ip_fw_masq_icmp+755/1156] [get_empty_filp+28/180] [do_signal+41/632]
Apr 6 04:26:42 : Code: 8b 42 04 39 d8 74 05 89 c2 eb f5 90 89 4a 04 55 9d ff 4e 1c
Apr 6 04:26:48 : wait_queue is bad (eip = 00126dcb)
Apr 6 04:26:48 : q = 03c01a4c
Apr 6 04:26:48 : *q = 03601f54

[System finally hangs, needs to be restarted]

Apr 6 07:44:34 : klogd 1.3-3, log source = /proc/kmsg started.

[Ran for this long, and then...]

Apr 6 12:12:24 : invalid operand: 0000
Apr 6 12:12:24 : CPU: 0
Apr 6 12:12:24 : EIP: 0010:[3c509:el3_probe+-19054728/5440]
Apr 6 12:12:24 : EFLAGS: 00010202
Apr 6 12:12:24 : eax: 00000001 ebx: 0220c001 ecx: 03620f4c edx: 00001001
Apr 6 12:12:24 : esi: 0220c005 edi: 03620f88 ebp: 03620f78 esp: 03620f2c
Apr 6 12:12:24 : ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
Apr 6 12:12:24 : Process rwhod (pid: 437, process nr: 24, stackpage=03620000)
Apr 6 12:12:24 : Stack: 0001ef00 0012baef 00000003 0220c001 00000000 03620fb4 03620fb4 bffff78c
Apr 6 12:12:24 : 03e2a200 0012bbb2 0220c000 03620f7c 03620f78 0001ef00 03620f88 00000000
Apr 6 12:12:24 : 03620fb4 0000000a 03620fb4 0000000a bffff78c 0012bce0 0220c000 00000000
Apr 6 12:12:24 : Call Trace: [do_pipe+147/268] [getname+74/320] [permission+40/212] [block_read+713/1464] [do_signal+41/632]
Apr 6 12:12:24 : Code: 8c f7 ff bf e0 bc 12 00 00 c0 20 02 00 00 00 00 01 00 00 00

-bp

--
B. James Phillippe		. bryan@terran.org
Software Engineer, WGT Inc.	. http://www.terran.org/~bryan

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/