Re: [Bug 219609] File corruptions on SSD in 1st M.2 socket of AsRock X600M-STX + Ryzen 8700G
From: Dr. David Alan Gilbert
Date: Tue Jan 28 2025 - 07:53:18 EST
* Stefan (linux-kernel@xxxxxxx) wrote:
> Hi,
>
> Am 28.01.25 um 08:41 schrieb Christoph Hellwig:
> > So basically you need a specific board and a specific CPU, and only
> > one M.2 SSD in the two slots to reproduce it?
>
> more generally, it dependents on which PCIe devices are used. On my PC
> corruptions also disappear if I disable the ethernet controller in the BIOS.
>
> Furthermore it depends on transaction sizes (that's why older kernels
> work), IOMMU, sometimes on volatile write cache and partially on SSD
> type (which may have something to do with the former things).
Is there any characterisation of the corrupted data; last time I looked at the
bz there wasn't.
I mean, is it reliably any of:
a) What's the size of the corruption?
block, cache line, word, bit???
b) Position?
e.g. last word in a block or something?
c) Data?
pile of zero's/ff's junk/etc?
d) Is it a missed write, old data, or partially written block?
Dave
> > Puh. I'm kinda lost on what we could do about this on the Linux
> > side.
>
> Because it also depends on the CPU series, a firmware or hardware issue
> seems to be more likely than a Linux bug.
>
> ATM ASRock is still trying to reproduce the issue. (I'm in contact with
> them to. But they have Chinese new year holidays in Taiwan this week.)
>
> If they can't reproduce it, they have to provide an explanation why the
> issues are seen by so many users.
>
> Regards Stefan
>
>
--
-----Open up your eyes, open up your mind, open up your code -------
/ Dr. David Alan Gilbert | Running GNU/Linux | Happy \
\ dave @ treblig.org | | In Hex /
\ _________________________|_____ http://www.treblig.org |_______/