Re: [PATCH 2/2] PCI: add CRS support to error handling path

From: Bjorn Helgaas
Date: Tue Sep 13 2016 - 17:47:23 EST


On Tue, Sep 13, 2016 at 05:04:49PM -0400, Sinan Kaya wrote:
> On 9/13/2016 4:01 PM, Bjorn Helgaas wrote:
> > On Thu, Sep 01, 2016 at 07:00:01PM -0400, Sinan Kaya wrote:
> >> The PCIE spec allows an endpoint device to extend the initialization time
> >> beyond 1 second by issuing Configuration Request Retry Status (CRS) for a
> >> vendor ID read request.
> >>
> >> This basically means "I'm busy now, please call me back later".
> >>
> >> There are two moving parts to CRS support from the SW perspective. One part
> >> is to determine if CRS is supported or not. The second part is to set the
> >> CRS visibility register.
> >>
> >> As part of the probe, the Linux kernel sets the above two conditions in
> >> pci_enable_crs function. The kernel is also honoring the returned CRS in
> >> pci_bus_read_dev_vendor_id function if supported. The function will poll up
> >> to specified amount of time while endpoint is returning CRS response.
> >>
> >> The PCIe spec also allows CRS to be issued during cold, warm, hot and FLR
> >> resets.
> >>
> >> The hot reset is initiated by starting a secondary bus reset. This patch is
> >> adding vendor ID read immediately after a bus reset so that the
> >> initialization procedure can be extended by the amount of time endpoint
> >> requires.
> >>
> >> Signed-off-by: Sinan Kaya <okaya@xxxxxxxxxxxxxx>
> >> ---
> >> drivers/pci/pci.c | 39 +++++++++++++++++++++++++++++++++++++++
> >> 1 file changed, 39 insertions(+)
> >>
> >> diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c
> >> index b209378..ebd0fc6 100644
> >> --- a/drivers/pci/pci.c
> >> +++ b/drivers/pci/pci.c
> >> @@ -3829,6 +3829,44 @@ static int pci_pm_reset(struct pci_dev *dev, int probe)
> >> return 0;
> >> }
> >>
> >> +/*
> >> + * Mostly copy paste from pci_walk_bus with the exceptions of hard coded
> >> + * work and removed locks.
> >> + */
> >> +static void pci_bus_probe_crs(struct pci_bus *top)
> >> +{
> >> + struct pci_dev *dev;
> >> + struct pci_bus *bus;
> >> + struct list_head *next;
> >> + int retval;
> >> + u32 l;
> >> +
> >> + bus = top;
> >> + next = top->devices.next;
> >> + for (;;) {
> >> + if (next == &bus->devices) {
> >> + /* end of this bus, go up or finish */
> >> + if (bus == top)
> >> + break;
> >> + next = bus->self->bus_list.next;
> >> + bus = bus->self->bus;
> >> + continue;
> >> + }
> >> + dev = list_entry(next, struct pci_dev, bus_list);
> >> + if (dev->subordinate) {
> >> + /* this is a pci-pci bridge, do its devices next */
> >> + next = dev->subordinate->devices.next;
> >> + bus = dev->subordinate;
> >> + } else
> >> + next = dev->bus_list.next;
> >> +
> >> + retval = pci_bus_read_dev_vendor_id(dev->bus, dev->devfn, &l,
> >> + 60 * 1000);
> >> + if (retval)
> >> + break;
> >> + }
> >> +}
> >
> > Sigh. Man, this is ugly. Maybe we're locked into the current
> > strategy and don't really have a choice, but I really don't like it.
>
> I can add a locked version of the walkbus API.
> Then, I can minimize this code to a couple of lines. How does that sound?

I didn't mean that, I meant the whole idea of having to walk the whole
hierarchy and touch each device. It's sort of like we're enumerating
things, but not really, so this checking is kinda sorta parallel to
the enumeration path.

> > You mentioned several kinds of reset where CRS is allowed. Doesn't this
> > fix only one of them? I know we support at least FLR reset also.
>
> The CRS is for hot reset, warm reset and FLR reset. There is nothing we can do in SW
> for warm reset. This patch is to address hot reset caused by SBR.
>
> I was hoping that Alex would help us for directions on the FLR reset later.

What sort of help from Alex were you hoping for? Is fixing the FLR
path harder than this one? If we're going to fix one path, I'd prefer
to fix them all at the same time rather than tripping over this again
later.