[OOPS] 2.2.17 free_wait+49/116 / possible memory corruption ?

From: Florian Lohoff (flo@rfc822.org)
Date: Thu Oct 05 2000 - 07:29:45 EST


Here,
i suspect memory corruption as the identical kernel runs on
identical hardware with identical installation on 60+ Machines.

Before the crash i see this in the syslog ..

Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000900)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000800)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000a00)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000800)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000800)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000900
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000500
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000c00
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000900
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000300
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000a00
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100

(flo@ping)/tmp# ksymoops -k /dev/null -l /dev/null -m /tmp/System.map-2.2.17 -o /tmp </tmp/oops
ksymoops 2.3.4 on i686 2.2.16. Options used
     -V (default)
     -k /dev/null (specified)
     -l /dev/null (specified)
     -o /tmp (specified)
     -m /tmp/System.map-2.2.17 (specified)

Error (regular_file): read_ksyms /dev/null is not a regular file, ignored
No modules in ksyms, skipping objects
No ksyms, skipping lsmod
Sep 30 14:41:36 mgr2 kernel: Unable to handle kernel NULL pointer dereference at virtual address 000005ed
Sep 30 14:41:36 mgr2 kernel: current->tss.cr3 = 07aea000, %cr3 = 07aea000
Sep 30 14:41:36 mgr2 kernel: *pde = 00000000
Sep 30 14:41:36 mgr2 kernel: Oops: 0000
Sep 30 14:41:36 mgr2 kernel: CPU: 0
Sep 30 14:41:36 mgr2 kernel: EIP: 0010:[free_wait+49/116]
Sep 30 14:41:36 mgr2 kernel: EFLAGS: 00010087
Sep 30 14:41:36 mgr2 kernel: eax: 00000006 ebx: c44bf160 ecx: 000005e9 edx: 000005e9
Sep 30 14:41:36 mgr2 kernel: esi: c44bf15c edi: c44bf000 ebp: 00000287 esp: c7a43eec
Sep 30 14:41:36 mgr2 kernel: ds: 0018 es: 0018 ss: 0018
Sep 30 14:41:36 mgr2 kernel: Process named (pid: 141, process nr: 9, stackpage=c7a43000)
Sep 30 14:41:36 mgr2 kernel: Stack: 00800000 00000018 00000004 00000000 00000000 c012e0ab c44bf000 00000000
Sep 30 14:41:36 mgr2 kernel: 00000018 c012e07c 00000001 00000004 c08f5438 00000000 00000001 c08f5438
Sep 30 14:41:36 mgr2 kernel: de826e24 c7a42000 c7a42000 00000000 00000000 c44bf000 00000009 c2ad7000
Sep 30 14:41:36 mgr2 kernel: Call Trace: [do_select+487/512] [do_select+440/512] [sys_select+881/1176] [sys_gettimeofday+32/148] [system_call+52/56]
Sep 30 14:41:36 mgr2 kernel: Code: 8b 41 04 39 d8 74 09 89 c2 8b 42 04 39 d8 75 f7 89 4a 04 55
Using defaults from ksymoops -t elf32-i386 -a i386

Code; 00000000 Before first symbol
00000000 <_EIP>:
Code; 00000000 Before first symbol
   0: 8b 41 04 mov 0x4(%ecx),%eax
Code; 00000003 Before first symbol
   3: 39 d8 cmp %ebx,%eax
Code; 00000005 Before first symbol
   5: 74 09 je 10 <_EIP+0x10> 00000010 Before first symbol
Code; 00000007 Before first symbol
   7: 89 c2 mov %eax,%edx
Code; 00000009 Before first symbol
   9: 8b 42 04 mov 0x4(%edx),%eax
Code; 0000000c Before first symbol
   c: 39 d8 cmp %ebx,%eax
Code; 0000000e Before first symbol
   e: 75 f7 jne 7 <_EIP+0x7> 00000007 Before first symbol
Code; 00000010 Before first symbol
  10: 89 4a 04 mov %ecx,0x4(%edx)
Code; 00000013 Before first symbol
  13: 55 push %ebp

1 error issued. Results may not be reliable.
You have new mail in /var/spool/mail/flo

And directly afterwards:

(flo@ping)/usr/src/linux# ksymoops -k /dev/null -l /dev/null -m /tmp/System.map-2.2.17 -o /tmp </tmp/oops2
ksymoops 2.3.4 on i686 2.2.16. Options used
     -V (default)
     -k /dev/null (specified)
     -l /dev/null (specified)
     -o /tmp (specified)
     -m /tmp/System.map-2.2.17 (specified)

Error (regular_file): read_ksyms /dev/null is not a regular file, ignored
No modules in ksyms, skipping objects
No ksyms, skipping lsmod
Sep 30 14:41:36 mgr2 kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000804
Sep 30 14:41:36 mgr2 kernel: current->tss.cr3 = 06ea4000, %cr3 = 06ea4000
Sep 30 14:41:36 mgr2 kernel: *pde = 00000000
Sep 30 14:41:36 mgr2 kernel: Oops: 0000
Sep 30 14:41:36 mgr2 kernel: CPU: 0
Sep 30 14:41:36 mgr2 kernel: EIP: 0010:[free_wait+16/116]
Sep 30 14:41:36 mgr2 kernel: EFLAGS: 00010206
Sep 30 14:41:36 mgr2 kernel: eax: c0296608 ebx: 00000000 ecx: c01cf940 edx: c0262ec0
Sep 30 14:41:36 mgr2 kernel: esi: c468d10c edi: 00000800 ebp: 00000000 esp: c5d7deec
Sep 30 14:41:36 mgr2 kernel: ds: 0018 es: 0018 ss: 0018
Sep 30 14:41:36 mgr2 kernel: Process usrd (pid: 30075, process nr: 63, stackpage=c5d7d000)
Sep 30 14:41:36 mgr2 kernel: Stack: c60edbc0 00000000 00000000 00000000 00000000 c012e0ab c468d000 c0110528
Sep 30 14:41:36 mgr2 kernel: 00000000 c012e07c 00000000 00000000 c60edbc0 00000000 c2070900 00000001
Sep 30 14:41:36 mgr2 kernel: 08112be0 00000282 c5d7c000 00000000 00000000 c468d000 0000030a c011acf3
Sep 30 14:41:36 mgr2 kernel: Call Trace: [do_select+487/512] [process_timeout+0/20] [do_select+440/512] [handle_mm_fault+279/316] [sys_select+881/1176] [do_page_fault+263/992] [error_code+45/52]
Sep 30 14:41:36 mgr2 kernel: Code: 8b 47 04 89 c6 c1 e6 04 03 77 08 85 c0 74 3b 90 ff 4f 04 83
Using defaults from ksymoops -t elf32-i386 -a i386

Code; 00000000 Before first symbol
00000000 <_EIP>:
Code; 00000000 Before first symbol
   0: 8b 47 04 mov 0x4(%edi),%eax
Code; 00000003 Before first symbol
   3: 89 c6 mov %eax,%esi
Code; 00000005 Before first symbol
   5: c1 e6 04 shl $0x4,%esi
Code; 00000008 Before first symbol
   8: 03 77 08 add 0x8(%edi),%esi
Code; 0000000b Before first symbol
   b: 85 c0 test %eax,%eax
Code; 0000000d Before first symbol
   d: 74 3b je 4a <_EIP+0x4a> 0000004a Before first symbol
Code; 0000000f Before first symbol
   f: 90 nop
Code; 00000010 Before first symbol
  10: ff 4f 04 decl 0x4(%edi)
Code; 00000013 Before first symbol
  13: 83 00 00 addl $0x0,(%eax)

1 error issued. Results may not be reliable.

Other suspicions ?

Flo

-- 
Florian Lohoff		flo@rfc822.org		      	+49-5201-669912
      "Write only memory - Oops. Time for my medication again ..."

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sat Oct 07 2000 - 21:00:16 EST