Re: [PATCH v4] PCI/ACPI: PCI/ACPI: Validate devices with power resources support D3

From: Lukas Wunner
Date: Thu Oct 27 2022 - 01:24:15 EST


On Wed, Oct 26, 2022 at 04:52:37PM -0500, Mario Limonciello wrote:
> Firmware typically advertises that ACPI devices that represent PCIe
> devices can support D3 by a combination of the value returned by
> _S0W as well as the HotPlugSupportInD3 _DSD [1].
>
> `acpi_pci_bridge_d3` looks for this combination but also contains
> an assumption that if an ACPI device contains power resources the PCIe
> device it's associated with can support D3. This was introduced
> from commit c6e331312ebf ("PCI/ACPI: Whitelist hotplug ports for
> D3 if power managed by ACPI").
>
> Some firmware configurations for "AMD Pink Sardine" do not support
> wake from D3 in _S0W for the ACPI device representing the PCIe root
> port used for tunneling. The PCIe device will still be opted into
> runtime PM in the kernel [2] because of the logic within
> `acpi_pci_bridge_d3`. This currently happens because the ACPI
> device contains power resources.

So put briefly, in acpi_pci_bridge_d3() we fail to take wake capabilities
into account and blindly assume that a bridge can be runtime suspended
to D3 if it is power-manageable by ACPI.

By moving the acpi_pci_power_manageable() below the wake capabilities
checks, we avoid runtime suspending a bridge that is not wakeup capable.

The more verbose explanation in the commit message is useful to
understand how the issue was exposed, but it somewhat obscures
the issue itself.


> When the thunderbolt driver is loaded two device links are created:
> * USB4 router <-> PCIe root port for tunneling
> * USB4 router <-> XHCI PCIe device

Those double arrows are a little misleading, a device link is
unidirectional, so it's really <-- and not <->.


> Currently runtime PM is allowed for all of these devices. This means that
> when all consumers are idle long enough, they will enter their deepest allowed
> sleep state. Once all consumers are in their deepest allowed sleep state the
> suppliers will enter the deepest sleep state as well.
>
> * The PCIe root port for tunneling doesn't support waking from D3hot or
> D3cold so it stays in D0.

Huh? I thought it's runtime suspended to D3hot even though it should stay
runtime resumed in D0 because it's not wakeup capable in D3hot?


> * The XHCI PCIe device supports wakeup from D3cold so it goes to D3cold.
> * Both consumers are in their deepest state and the USB4 router supports
> wakeup from D3cold, so it goes into this state.
>
> The expectation is the USB4 router should have also remained in D0 since
> the PCIe root port for tunneling remained in D0 and a device link exists
> between the two devices.

This paragraph sounds like the problem is the router runtime suspended.
IIUC the router could only runtime suspend because its consumer, the
Root Port, runtime suspended. By preventing the Root Port from runtime
suspending, you're implicitly preventing it's supplier (the router)
from suspending.


> Link: https://learn.microsoft.com/en-us/windows-hardware/drivers/pci/dsd-for-pcie-root-ports#identifying-pcie-root-ports-supporting-hot-plug-in-d3 [1]
> Link: https://github.com/torvalds/linux/blob/v6.1-rc1/drivers/pci/pcie/portdrv_pci.c#L126 [2]
> Link: https://github.com/torvalds/linux/blob/v6.1-rc1/drivers/thunderbolt/acpi.c#L29 [3]

I think git.kernel.org links are preferred to 3rd party hosting services.

Thanks,

Lukas