Re: Enumeration issue with QCA9005 AR9462
From: Bjorn Helgaas
Date: Tue Aug 21 2018 - 12:50:48 EST
[+cc Rajat, Ashok]
On Tue, Aug 21, 2018 at 09:25:41AM +0200, Lukas Wunner wrote:
> On Tue, Aug 21, 2018 at 07:47:04AM +0200, Lukas Wunner wrote:
> > On Mon, Aug 20, 2018 at 06:06:24PM -0500, Bjorn Helgaas wrote:
> > > mmyangfl@xxxxxxxxx reported a problem [1]: on v4.17, a QCA9005 AR9462
> > > wifi device was present at boot, but disappeared after suspend/resume.
> > >
> > > He also tested a recent kernel (5c60a7389d79, from Thu Aug 16),
> > > where the suspend/resume problem doesn't seem to happen, but the wifi
> > > device isn't enumerated correctly at boot-time.
> > >
> > > [ 0.928714] pciehp 0000:04:00.0:pcie204: Slot #0 AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug+ Surprise+ Interlock- NoCompl- LLActRep+
> > > [ 0.928752] pciehp 0000:04:00.0:pcie204: Slot(0-1): Card not present
> > > [ 0.928811] pciehp 0000:04:00.0:pcie204: Slot(0-1): Link Up
> > > [ 0.928815] pciehp 0000:04:00.0:pcie204: Slot(0-1): No adapter
> > >
> > > [1] https://bugzilla.kernel.org/show_bug.cgi?id=200839
> > > [2] https://bugzilla.kernel.org/attachment.cgi?id=277923
> >
> > The hardware appears to be broken in that the Presence Detect State bit
> > in the Slot Status register is 0 (Slot Empty) even though the slot is
> > occupied.
> [...]
> > Possible solutions:
> >
> > (a) Be lenient towards broken hardware and accept DLActive+ as a proxy
> > for PresDet+.
> >
> > (b) Add a blacklist to pciehp such that it doesn't bind to [1ae9:0200].
> > The bug reporter writes that "it's a single Half Mini PCIe card,
> > with two chipsets (Wil6110? + AR9462) combined by a PCIe hub".
> > This sounds like it's not really hotpluggable.
> > (Is Mini PCIe hotplug capable at all?)
> >
> > Let me go through the driver and see if (a) is feasible and how intrusive
> > it would be.
>
> So (a) would seem to be feasible, we could add a quirk for devices like
> this to call pciehp_check_link_active() in pciehp_get_adapter_status().
>
> Alternatively, we could generally add a call to pciehp_check_link_active()
> in get_adapter_status(), pciehp_check_presence() and pcie_init() and thus
> avoid a quirk for this specific device.
> The existing call in __pciehp_enable_slot() could actually be removed,
> this code path is only entered if either PDS or DLLLA is set.
>
> And the third option would be to add a quirk like quirk_hotplug_bridge()
> which sets is_hotplug_bridge = 0 on this broken device such that pciehp
> doesn't bind to it in the first place.
It sounds like with (a), you could make this work without having a
Wil6110-specific quirk, i.e., if the Link Status says the link is
active, we assume a device is present. That seems reasonable to me
and it sort of fits with these previous changes:
385895fef6b5 ("PCI: pciehp: Prioritize data-link event over presence detect")
e48f1b67f668 ("PCI: pciehp: Use link change notifications for hot-plug and removal")
Bjorn