Re: Known and unfixed active data loss bug in MM + XFS with large folios since Dec 2021 (any kernel from 6.1 upwards)

From: Christian Theune
Date: Tue Oct 01 2024 - 03:55:33 EST



> On 1. Oct 2024, at 02:56, Chris Mason <clm@xxxxxxxx> wrote:
>
> I've attached a minimal version of a script we use here to show all the
> D state processes, it might help explain things. The only problem is
> you have to actually ssh to the box and run it when you're stuck.

Thanks, I’ll dig into this next week when I’m back from vacation.

I can set up alerts when this happens and hope that I’ll be fast enough as the situation does seem to resolve itselve at some point. It’s happened quite a bit in the fleet so I guess I should be able to catch it.

Christian

--
Christian Theune · ct@xxxxxxxxxxxxxxx · +49 345 219401 0
Flying Circus Internet Operations GmbH · https://flyingcircus.io
Leipziger Str. 70/71 · 06108 Halle (Saale) · Deutschland
HR Stendal HRB 21169 · Geschäftsführer: Christian Theune, Christian Zagrodnick