Re: [PATCH] [16/19] HWPOISON: Enable .remove_error_page formigration aware file systems

From: Wu Fengguang
Date: Tue Aug 11 2009 - 09:00:30 EST


On Tue, Aug 11, 2009 at 11:48:36AM +0800, Hidehiro Kawai wrote:
> Wu Fengguang wrote:
> > In fact we proposed a patch for preventing the re-corruption case, see
> >
> > http://lkml.org/lkml/2009/6/11/294
> >
> > However it is hard to answer the (policy) question "How sticky should
> > the EIO bit remain?".
>
> It's a good approach! This approach may also solve my concern,
> the re-corruption issue caused by transient IO errors.
>
> But I also think it needs a bit more consideration. For example,
> if the application has the valid data in the user space buffer,
> it would try to re-write it after detecting an IO error from the
> previous write. In this case, we should clear the sticky error flag.

Yes, and maybe more than that. The IO error issue really deserves an
independent work, which will inevitably involve lots of discussions
with lots of people.

For the data re-corruption problem, "vm.memory_failure_recovery = 0"
should be the most clean workaround for now. Can we settle with that?
Our goal for this initial hwpoison implementation is to achieve good
coverage (not necessarily every possible case :).

Thanks,
Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/