Re: oopses

Oskar Pearson (oskar@is.co.za)
Thu, 20 Mar 1997 23:13:09 +0200 (GMT)


Chris Orem wrote:
>
> I was getting these on a system running 2.0.27 I replaced the ram and they
> went away, the system was stable up til a few days ago...
Well, I am doing a mem86 test now.... I don't think that this is the problem.
It's done 3 complete passes, and no errors. I will carry on with it,
and mail people if it's the case.

When I tried to compile memtest86, it died on me, with the gcc proc going
into disk wait and not coming out...

> Ok - I am a newbee with this oops debug stuff, so flame me!
>
> Linux oztop.is.co.za 2.0.28 #3 Thu Mar 20 16:28:32 GMT+2 1997 i586 unknown
>
> Multiple oopses close together...
>
> Unable to handle kernel paging request at virtual address c4000000
> current->tss.cr3 = 0088a000,
> *pde = 00000000
> Oops: 0000
> CPU: 0
> EIP: 0010:[<001232a8>]
> EFLAGS: 00010206
> eax: 04000000 ebx: 00a30302 ecx: 00000400 edx: 000033a9
> esi: 0001739a edi: 00000302 ebp: 00017098 esp: 00839e94
> ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
> Process ld (pid: 596, process nr: 24, stackpage=00839000)
> Stack: 00a36908 00017098 00000302 00000003 002d25d0 001248d9 00000302 00017098
> 00000400 00839f40 00000428 00000000 002d25d0 00017098 00000302 00000100
> 00a364c8 00820302 00000000 00a364c8 00a3650c 00a36440 00839f3c 00000427
> Call Trace: [<001248d9>] [<00124cf7>] [<0011b946>] [<0011ba1a>] [<001215e6>] [<0010a635>]
> Code: 39 28 75 28 66 39 58 04 75 22 8b 4c 24 20 39 48 20 74 22 57
>
> >>EIP: 1232a8 <get_hash_table+30/b4>
> Trace: 1248d9 <brw_page+139/35c>
> Trace: 124cf7 <generic_readpage+73/80>
> Trace: 11b946 <generic_file_read+40e/5b4>
> Trace: 11ba1a <generic_file_read+4e2/5b4>
> Trace: 1215e6 <sys_read+8a/b0>
> Trace: 10a635 <system_call+55/80>
>
> Code: 1232a8 <get_hash_table+30/b4> cmpl %ebp,(%eax)
> Code: 1232aa <get_hash_table+32/b4> jne 1232d4 <get_hash_table+5c/b4>
> Code: 1232ac <get_hash_table+34/b4> cmpw %bx,0x4(%eax)
> Code: 1232b0 <get_hash_table+38/b4> jne 1232d4 <get_hash_table+5c/b4>
> Code: 1232b2 <get_hash_table+3a/b4> movl 0x20(%esp,1),%ecx
> Code: 1232b6 <get_hash_table+3e/b4> cmpl %ecx,0x20(%eax)
> Code: 1232b9 <get_hash_table+41/b4> je 1232dd <get_hash_table+65/b4>
> Code: 1232bb <get_hash_table+43/b4> pushl %edi
> Code: 1232bc <get_hash_table+44/b4>
>
> If I try and run the command again, it goes into endless disk wait
>
> =======================
> Unable to handle kernel paging request at virtual
> address c4000000
> current->tss.cr3 = 00342000,
> *pde = 00000000
> Oops: 0000
> CPU: 0
> EIP: 0010:[<001232a8>]
> EFLAGS: 00010206
> eax: 04000000 ebx: 00030302 ecx: 00817088 edx: 000033a9
> esi: 00033385 edi: 00000302 ebp: 00033087 esp: 007f1e14
> ds: 0018 es: 0018 fs: 002b gs: 002b ss: 0018
> Process find (pid: 698, process nr: 26, stackpage=007f1000)
> Stack: 00033087 00000001 00810302 00000400 00202040 00123a82 00000302 00033087
> 00000400 00033087 00000000 00817000 00000000 00081000 00000302 0015857c
> 00000302 00033087 00000400 00000008 00000000 007f1f44 00817000 00000002
> Call Trace: [<00123a82>] [<0015857c>] [<00158ab4>] [<00158ded>] [<001567ac>] [<0012a686>] [<0012abf7>]
> [<00120f95>] [<0012c507>] [<0012c3cc>] [<0010a635>]
> Code: 39 28 75 28 66 39 58 04 75 22 8b 4c 24 20 39 48 20 74 22 57
>
> >>EIP: 1232a8 <get_hash_table+30/b4>
> Trace: 123a82 <getblk+3a/468>
> Trace: 15857c <inode_getblk+44/198>
> Trace: 158ab4 <ext2_getblk+b8/22c>
> Trace: 158ded <ext2_bread+1d/7c>
> Trace: 1567ac <ext2_readdir+e0/618>
> Trace: 12a686 <lookup+da/f4>
> Trace: 12abf7 <open_namei+273/3c8>
> Trace: 120f95 <do_open+59/124>
> Trace: 12c507 <sys_getdents+97/c8>
> Trace: 12c3cc <filldir>
> Trace: 10a635 <system_call+55/80>
>
> Code: 1232a8 <get_hash_table+30/b4> cmpl %ebp,(%eax)
> Code: 1232aa <get_hash_table+32/b4> jne 1232d4 <get_hash_table+5c/b4>
> Code: 1232ac <get_hash_table+34/b4> cmpw %bx,0x4(%eax)
> Code: 1232b0 <get_hash_table+38/b4> jne 1232d4 <get_hash_table+5c/b4>
> Code: 1232b2 <get_hash_table+3a/b4> movl 0x20(%esp,1),%ecx
> Code: 1232b6 <get_hash_table+3e/b4> cmpl %ecx,0x20(%eax)
> Code: 1232b9 <get_hash_table+41/b4> je 1232dd <get_hash_table+65/b4>
> Code: 1232bb <get_hash_table+43/b4> pushl %edi
> Code: 1232bc <get_hash_table+44/b4> ->should it just stop here?
>
> Seems like disk corruption?
>
> These things come in really fast with 2.0.29 too.
>
> It seems to happen often when I don't have swap (single user mode...)
> and can crash while fsck'ing. At this stage there is still more than
> 10M free.
>
> # This config has EVERYTING in it, it's my laptop, so I generally don't mind
> # if it dies on me. I want a perfect world and a stable kernel tho ;)
> # Automatically generated by make menuconfig: don't edit
> #
>
> #
> # Code maturity level options
> #
> CONFIG_EXPERIMENTAL=y
>
> #
> # Loadable module support
> #
> CONFIG_MODULES=y
> CONFIG_MODVERSIONS=y
> CONFIG_KERNELD=y
>
> #
> # General setup
> #
> # CONFIG_MATH_EMULATION is not set
> CONFIG_NET=y
> # CONFIG_MAX_16M is not set
> CONFIG_PCI=y
> CONFIG_PCI_OPTIMIZE=y
> CONFIG_SYSVIPC=y
> CONFIG_BINFMT_AOUT=y
> CONFIG_BINFMT_ELF=y
> # CONFIG_BINFMT_JAVA is not set
> CONFIG_KERNEL_ELF=y
> CONFIG_M586=y
>
> #
> # Floppy, IDE, and other block devices
> #
> CONFIG_BLK_DEV_FD=y
> CONFIG_BLK_DEV_IDE=y
> # CONFIG_BLK_DEV_HD_IDE is not set
> CONFIG_BLK_DEV_IDECD=y
> # CONFIG_BLK_DEV_IDETAPE is not set
> CONFIG_BLK_DEV_IDE_PCMCIA=y
> CONFIG_BLK_DEV_CMD640=y
> CONFIG_BLK_DEV_CMD640_ENHANCED=y
> CONFIG_BLK_DEV_RZ1000=y
> CONFIG_BLK_DEV_TRITON=y
> # CONFIG_IDE_CHIPSETS is not set
> CONFIG_BLK_DEV_LOOP=y
> CONFIG_BLK_DEV_MD=y
> CONFIG_MD_LINEAR=y
> CONFIG_MD_STRIPED=y
> # CONFIG_BLK_DEV_RAM is not set
> # CONFIG_BLK_DEV_XD is not set
> # CONFIG_BLK_DEV_HD is not set
>
> #
> # Networking options
> #
> # CONFIG_FIREWALL is not set
> CONFIG_NET_ALIAS=y
> CONFIG_INET=y
> CONFIG_IP_FORWARD=y
> CONFIG_IP_MULTICAST=y
> # CONFIG_IP_ACCT is not set
> # CONFIG_IP_ROUTER is not set
> CONFIG_NET_IPIP=y
> # CONFIG_IP_MROUTE is not set
> CONFIG_IP_ALIAS=y
> # CONFIG_INET_PCTCP is not set
> # CONFIG_INET_RARP is not set
> # CONFIG_NO_PATH_MTU_DISCOVERY is not set
> CONFIG_IP_NOSR=y
> CONFIG_SKB_LARGE=y
> # CONFIG_IPX is not set
> # CONFIG_ATALK is not set
> # CONFIG_AX25 is not set
> # CONFIG_BRIDGE is not set
> # CONFIG_NETLINK is not set
>
> #
> # SCSI support
> #
> # CONFIG_SCSI is not set
>
> #
> # Network device support
> #
> CONFIG_NETDEVICES=y
> CONFIG_DUMMY=y
> # CONFIG_EQUALIZER is not set
> # CONFIG_DLCI is not set
> # CONFIG_PLIP is not set
> CONFIG_PPP=y
> CONFIG_SLIP=y
> CONFIG_SLIP_COMPRESSED=y
> # CONFIG_SLIP_SMART is not set
> # CONFIG_SLIP_MODE_SLIP6 is not set
> # CONFIG_NET_RADIO is not set
> # CONFIG_NET_ETHERNET is not set
> # CONFIG_TR is not set
> # CONFIG_FDDI is not set
> # CONFIG_ARCNET is not set
>
> #
> # ISDN subsystem
> #
> # CONFIG_ISDN is not set
>
> #
> # CD-ROM drivers (not for SCSI or IDE/ATAPI drives)
> #
> # CONFIG_CD_NO_IDESCSI is not set
>
> #
> # Filesystems
> #
> CONFIG_QUOTA=y
> # CONFIG_LOCK_MANDATORY is not set
> CONFIG_MINIX_FS=y
> # CONFIG_EXT_FS is not set
> CONFIG_EXT2_FS=y
> # CONFIG_XIA_FS is not set
> CONFIG_FAT_FS=y
> CONFIG_MSDOS_FS=y
> CONFIG_VFAT_FS=y
> # CONFIG_UMSDOS_FS is not set
> CONFIG_PROC_FS=y
> CONFIG_NFS_FS=y
> # CONFIG_ROOT_NFS is not set
> # CONFIG_SMB_FS is not set
> CONFIG_ISO9660_FS=y
> # CONFIG_HPFS_FS is not set
> # CONFIG_SYSV_FS is not set
> # CONFIG_AFFS_FS is not set
> # CONFIG_UFS_FS is not set
>
> #
> # Character devices
> #
> CONFIG_SERIAL=y
> # CONFIG_DIGI is not set
> # CONFIG_CYCLADES is not set
> # CONFIG_STALDRV is not set
> # CONFIG_RISCOM8 is not set
> CONFIG_PRINTER=y
> CONFIG_MOUSE=y
> # CONFIG_ATIXL_BUSMOUSE is not set
> # CONFIG_BUSMOUSE is not set
> # CONFIG_MS_BUSMOUSE is not set
> CONFIG_PSMOUSE=y
> # CONFIG_82C710_MOUSE is not set
> # CONFIG_UMISC is not set
> # CONFIG_QIC02_TAPE is not set
> # CONFIG_FTAPE is not set
> CONFIG_APM=y
> # CONFIG_APM_IGNORE_USER_SUSPEND is not set
> CONFIG_APM_DO_ENABLE=y
> CONFIG_APM_CPU_IDLE=y
> CONFIG_APM_DISPLAY_BLANK=y
> CONFIG_APM_POWER_OFF=y
> # CONFIG_WATCHDOG is not set
> # CONFIG_RTC is not set
>
> #
> # Sound
> #
> CONFIG_SOUND=y
> # CONFIG_PAS is not set
> CONFIG_SB=y
> CONFIG_ADLIB=y
> # CONFIG_GUS is not set
> # CONFIG_MPU401 is not set
> # CONFIG_UART6850 is not set
> # CONFIG_PSS is not set
> # CONFIG_GUS16 is not set
> # CONFIG_GUSMAX is not set
> CONFIG_MSS=y
> # CONFIG_SSCAPE is not set
> # CONFIG_TRIX is not set
> # CONFIG_MAD16 is not set
> # CONFIG_CS4232 is not set
> # CONFIG_MAUI is not set
> CONFIG_AUDIO=y
> CONFIG_MIDI=y
> CONFIG_YM3812=y
> SBC_BASE=220
> SBC_IRQ=7
> SBC_DMA=1
> SB_DMA2=5
> SB_MPU_BASE=0
> SB_MPU_IRQ=-1
> MSS_BASE=530
> MSS_IRQ=11
> MSS_DMA=3
> DSP_BUFFSIZE=65536
> # CONFIG_LOWLEVEL_SOUND is not set
>
> #
> # Kernel hacking
> #
> # CONFIG_PROFILE is not set
>
> So. Any ideas? Anyone having a similar problem?
>
> Oskar
>
> =============================================
> 'experience made art, but inexperience luck.'
> =============================================
>
>
> --== Chris Orem ==--
> --== SYS-ADMIN of Purplenet.net ==--
> --== Keep Your Morals off my Body ==--
> --== Key fingerprint = F98141B1EFBAE07A E21F90A4F300A696 ==--
>
>

Oskar

=============================================
'experience made art, but inexperience luck.'
=============================================