RE: [PATCH V3 1/1] nvme: Add quirk for LiteON CL1 devices running FW 220TQ,22001

From: Gloria Tsai
Date: Wed Oct 28 2020 - 22:20:51 EST


Corrected the description of this bug that SSD will not do GC after receiving shutdown cmd.
Do GC before shutdown -> delete IO Q -> shutdown from host -> breakup GC -> D3hot -> enter PS4 -> have a chance swap block -> use wrong pointer on device SRAM -> over program

SSD gets in low power mode only in suspend, this issue only happens in low power mode.
As for Hibernate, power off, unbinding the driver or even warm boot, EC boot, SSD won't be in low power state, so there'll be no problem.


Regards,
Gloria Tsai
_____________________________________

Sales PM Division
Solid State Storage Technology Corporation
TEL: +886-3-612-3888 ext. 2201
E-Mail: gloria.tsai@xxxxxxxxx
_____________________________________

-----Original Message-----
From: Christoph Hellwig <hch@xxxxxx>
Sent: Thursday, October 29, 2020 1:17 AM
To: Jongpil Jung <jongpuls@xxxxxxxxx>
Cc: Keith Busch <kbusch@xxxxxxxxxx>; Jens Axboe <axboe@xxxxxx>; Christoph Hellwig <hch@xxxxxx>; Sagi Grimberg <sagi@xxxxxxxxxxx>; linux-nvme@xxxxxxxxxxxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx; Gloria Tsai <Gloria.Tsai@xxxxxxxxx>; jongpil19.jung@xxxxxxxxxxx; jongheony.kim@xxxxxxxxxxx; dj54.sohn@xxxxxxxxxxx
Subject: Re: [PATCH V3 1/1] nvme: Add quirk for LiteON CL1 devices running FW 220TQ,22001



This message was sent from outside of the company. Please do not click links or open attachments unless you recognize the source of this email and know the content is safe.


On Wed, Oct 28, 2020 at 06:14:21PM +0900, Jongpil Jung wrote:
> LiteON(SSSTC) CL1 device running FW 220TQ,22001 has bugs with simple
> suspend.
>
> When NVMe device receive D3hot from host, NVMe firmware will do
> garbage collection. While NVMe device do Garbage collection, firmware
> has chance to going incorrect address.

I'm still lost on what this means. The device doesn't receive 'D3hot'
from the host, it receives a shutdown request. And we don't just do that when hibernating, but also when doing an actual shutdown, a suspend to disk, unbinding the driver, etc. So if the device has a problem with shutdowns we really are in trouble.