Re: FYI: RAID5 unusably unstable through 2.6.14

From: Benjamin LaHaise
Date: Tue Jan 17 2006 - 14:41:47 EST


On Tue, Jan 17, 2006 at 01:35:46PM -0600, Cynbe ru Taren wrote:
> In principle, RAID5 should allow construction of a
> disk-based store which is considerably MORE reliable
> than any individual drive.
>
> In my experience, at least, using Linux RAID5 results
> in a disk storage system which is considerably LESS
> reliable than the underlying drives.

That is a function of how RAID5 works. A properly configured RAID5 array
will have a spare disk to take over in case one of the members fails, as
otherwise you run a serious risk of not being able to recover any data.

> What happens repeatedly, at least in my experience over
> a variety of boxes running a variety of 2.4 and 2.6
> Linux kernel releases, is that any transient I/O problem
> results in a critical mass of RAID5 drives being marked
> 'failed', at which point there is no longer any supported
> way of retrieving the data on the RAID5 device, even
> though the underlying drives are all fine, and the underlying
> data on those drives almost certainly intact.

Underlying disks should not be experiencing transient failures. Are you
sure the problem isn't with the disk controller you're building your array
on top of? At the very least any bug report requires that information to
be able to provide even a basic analysis of what is going wrong.

Personally, I am of the opinion that RAID5 should not be used by the
vast majority of people as the failure modes it entails are far too
complex for most people to cope with.

-ben
--
"You know, I've seen some crystals do some pretty trippy shit, man."
Don't Email: <dont@xxxxxxxxx>.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/