David Greaves wrote:Having retained this new configuration for a couple of days now I haven't had any problems.Tejun Heo wrote:How reproducible is the problem? Does the problem go away or occur moreI don't think there should be activity on the sda drive during resume
often if you change the drive you write the memory image to?
itself.
[I broke my / md mirror and am using some of that for swap/resume for now]
I did change the swap/resume device to sdd2 (different controller,
onboard sata_via) and there was no EH during resume. The system seemed
OK, wrote a few Gb of video and did a kernel compile.
I repeated this test, no EH during resume, no problems.
I even ran xfs_fsr, the defragment utility, to stress the fs.
I retain this configuration and try again tonight but it looks like
there _may_ be a link between EH during resume and my problems...
I was trying to say that I don't actually see any errors being handled in normal operation.Of course, I don't understand why it *should* EH during resume, it
doesn't during boot or normal operation...
EH occurs during boot, suspend and resume all the time. It just runs in
quiet mode to avoid disturbing the users too much. In your case, EH is
kicking in due to actual exception conditions so it's being verbose to
give clue about what's going on.
It's really weird tho. The PHY RDY status changed events are comingyes - but the erroring device which is not being used is on the same controller as the device with the in-use resume partition.
from the device which is NOT used while resuming
and it's before anyyes
actual PM events are triggered. Your kernel just boots, swsusp realizes
it's resuming and tries to read memory image from the swap device.
While reading, the disk controller raises consecutive PHY readinessYes, that's consistent with what I'm seeing.
changed interrupts. EH recovers them alright but the end result seems
to indicate that the loaded image is corrupt.
So, there's no device suspend/resume code involved at all. The kernelThat's kinda hard; swap and root are on different drives...
just booted and is trying to read data from the drive. Please try with
only the first drive attached and see what happens.