Re: New ide + SMP oops (was RE: SMP + RAID)

Rafael D'Halleweyn (rafael@village.uunet.be)
3 Oct 1999 20:15:29 GMT


>> I am building ide.2.2.12.19991001.patch now.

Andre, Tom, (and all the rest),

I had a problem with the HPT366 on an ABit BP6 sharing an IRQ with a tulip
ethernet card. I had already tried to swap some PCI cards around, but
since their are only 4 interrupts for 2 ethernets, AGP, sound (PCI), bttv,
usb and hpt, there will always be some cards sharing interrupts. The setup
were the hpt and the tulip shared an interrupt was the only functional
combination, except that it crashed after a while.

I had already given up on using the hpt on the bp6, but yesterday I tried
the ide.2.2.13pre14.19991001 patch on a 2.2.13pre14. I checked the
spinlock and the 0x43 -> 0x44, but they both seemed to be fixed already.

I am now running the resulting kernel. I've already had a hdparm -tT and a
tcpblast through the tulip running for some time, and it worked great. The
performance was what I expected.

> then started a bunch of dd's reading from both disks at the same time, and
> producing md5sum's of the output. Soon after, a kernel NULL pointer
> dereference OOPS'ed (attached)

> After this OOPS, everything ran very very slowly. I started getting messages
> on the console about once a second:
> hde: lost interrupt
> hdg: lost interrupt

Indeed, I have seen the same oops. But my system was idle, except for an
xmms (MP3 player).

I have one other small problem: hde (on the hpt) sometimes turns off dma.
This could be the result of a bad cable. The cable I am using is rather
long. But that brings up another problem: how can you connect all the
disks in a tower, when you can only use cables of 18 inch or 46cm?

Anyway, many thanks,

Raf.

Unable to handle kernel NULL pointer dereference at virtual address 00000004
Unable to handle kernel NULL pointer dereference at virtual address 00000004
current->tss.cr3 = 04d56000, %cr3 = 04d56000
current->tss.cr3 = 04d56000, %cr3 = 04d56000
*pde = 00000000
*pde = 00000000
Oops: 0000
CPU: 0
EIP: 0010:[ide_do_request+785/1472]
EFLAGS: 00010296
eax: 000001c0 ebx: c02463d4 ecx: 0000000e edx: 00000000
esi: c024b6c0 edi: c5eb5e74 ebp: 00000046 esp: c5eb5e38
ds: 0018 es: 0018 ss: 0018
Process find (pid: 9263, process nr: 107, stackpage=c5eb5000)
Stack: c7e14900 00004000 00000000 00000000 00000000 c01a8280 00000002 c024b684
00000000 c024b684 c019cbb9 c9fd4ea0 c5eb5e74 00000000 00000246 00000296
c019cbe4 c9fd4ea0 c0199d85 00000000 c012c0c6 c02463d4 0003a017 c9ab7e00
Call Trace: [do_rw_disk+0/704] [do_hwgroup_request+73/96]
[do_ide0_request+20/48] [unplug_device+69/96] [__wait_on_buffer+182/304]
[bread+80/112] [ext2_read_inode+258/1056]
[get_new_inode+156/320] [iget+120/144] [ext2_lookup+100/160]
[real_lookup+79/144] [lookup_dentry+272/432] [__namei+41/96]
[sys_newlstat+46/160] [system_call+52/56]
Code: 0f b6 6a 04 89 ef c1 ef 06 8b 46 40 89 44 24 24 fb 83 ff 01
hda: lost interrupt
hdb: lost interrupt
hda: lost interrupt
...

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/