Re: 2.6.19-mm1 (md/raid1 randomly drops partitions - possible sata_uli problem)

From: Rafael J. Wysocki
Date: Wed Dec 13 2006 - 07:09:34 EST


On Wednesday, 13 December 2006 01:53, Neil Brown wrote:
> On Tuesday December 12, rjw@xxxxxxx wrote:
> > >
> > > So when md writes to write out the superblock, to gets EIO... Odd that
> > > you aren't getting errors for normal writes.
> > >
> > > What devices are the md/raid1 built on?
> >
> > Sata drives, on sata_uli.
> >
> > > >
> > > > I'll try to reproduce it tomorrow and collect some more information.
> > >
> > > Thanks. More information is definitely better than less, so send over
> > > anything you can find.
> >
> > Okay, seems to be readily reproducible, dmesg output from the failing kernel
> > attached.
>
> Weird. You are getting silent write errors...
>
> Can you write to these drives are all? e.g.
>
> dd if=/dev/sdb3 of=/tmp/tmp count=1
> dd if=/tmp/tmp of=/dev/sdb3 oflag=direct
>
> (hopefully 'direct' will cause write errors to be passed up).

Unfortunately I have no access to the machine right now.

> I really think this looks like a sata problem, not an md problem.

That's possible, but everything except for the md RAID seems to work. Strange.

I think I'll wait until the next -mm is out and check if the problem goes away. ;-)

Greetings,
Rafael


--
If you don't have the time to read,
you don't have the time or the tools to write.
- Stephen King
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/