Re: 2.6.19-mm1 (md/raid1 randomly drops partitions)

From: Neil Brown
Date: Mon Dec 11 2006 - 18:55:26 EST


On Tuesday December 12, rjw@xxxxxxx wrote:
> On Monday, 11 December 2006 23:52, Neil Brown wrote:
> > On Monday December 11, rjw@xxxxxxx wrote:
> > > Hi,
> > >
> > > On Monday, 11 December 2006 09:58, Andrew Morton wrote:
> > > >
> > > > Temporarily at
> > > >
> > > > http://userweb.kernel.org/~akpm/2.6.19-mm1/
> > > >
> > > > Will appear later at
> > > >
> > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.19/2.6.19-mm1/
> > >
> > > It caused all of the md RAID1s on my test box to drop one of their partitions,
> > > apparently at random.
> >
> > That's clever....
> >
> > Do you have any kernel logs of this happening? My guess would be the
> > underlying device driver is returned more errors than before, but we
> > need the logs to be sure.
>
> I've only found lots of messages like this:
>
> md: super_written gets error=-5, uptodate=0

So when md writes to write out the superblock, to gets EIO... Odd that
you aren't getting errors for normal writes.

What devices are the md/raid1 built on?

>
> I'll try to reproduce it tomorrow and collect some more information.

Thanks. More information is definitely better than less, so send over
anything you can find.

NeilBrown
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/