Re: md raid6 oops in 6.6.4 stable

From: Genes Lists
Date: Thu Dec 07 2023 - 08:55:58 EST


On 12/7/23 08:30, Bagas Sanjaya wrote:
On Thu, Dec 07, 2023 at 08:10:04AM -0500, Genes Lists wrote:
I have not had chance to git bisect this but since it happened in stable I
thought it was important to share sooner than later.

One possibly relevant commit between 6.6.3 and 6.6.4 could be:

commit 2c975b0b8b11f1ffb1ed538609e2c89d8abf800e
Author: Song Liu <song@xxxxxxxxxx>
Date: Fri Nov 17 15:56:30 2023 -0800

md: fix bi_status reporting in md_end_clone_io

log attached shows page_fault_oops.
Machine was up for 3 days before crash happened.


Can you confirm that culprit by bisection?


That's the plan - however, turn around could be horribly slow if the average wait time to crash is of order a few days between each bisect.
Also machine is currently in use, so will need to deal with that as well. Will do my best.

Fingers crossed someone might just spot something in the meantime.

The commit mentioned above ensures underlying errors are not hidden, so it may simply have revealed some underlying issue and not be the actual 'culprit'.

thanks

gene