Re: [Xen-devel] [PATCH V3 1/2] Drivers/PCI: Export pcie_has_flr() interface
From: Bjorn Helgaas
Date: Fri Dec 15 2017 - 13:18:17 EST
[+cc Russell, Sinan, Herbert, Srikanth, Derek, Satanand, Felix, Raghu]
On Fri, Dec 15, 2017 at 09:48:02AM -0600, Govinda Tatti wrote:
> On 12/13/2017 3:24 PM, Bjorn Helgaas wrote:
> >On Wed, Dec 13, 2017 at 02:46:57PM -0600, Govinda Tatti wrote:
> >>>>>>-static bool pcie_has_flr(struct pci_dev *dev)
> >>>>>>+bool pcie_has_flr(struct pci_dev *dev)
> >>>>>> {
> >>>>>> u32 cap;
> >>>>>>@@ -3882,6 +3882,7 @@ static bool pcie_has_flr(struct pci_dev *dev)
> >>>>>> pcie_capability_read_dword(dev, PCI_EXP_DEVCAP, &cap);
> >>>>>> return cap & PCI_EXP_DEVCAP_FLR;
> >>>>>> }
> >>>>>>+EXPORT_SYMBOL_GPL(pcie_has_flr);
> >>>>>I'd rather change pcie_flr() so you could *always* call it, and
> >>>>>it would return 0, -ENOTTY, or whatever, based on whether FLR
> >>>>>is supported. Is that feasible?
> >>>>Sure, I will add pcie_has_flr() logic inside pcie_flr() and
> >>>>return appropriate values as suggested by you. Do we still want
> >>>>to retain pcie_has_flr() and its usage inside pci.c?.Otherwise,
> >>>>I will remove it and do required cleanup.
> >>>If you can restructure the code and remove pcie_has_flr() while
> >>>retaining the existing behavior of its callers, that would be
> >>>great.
> >>I checked the current usage of pcie_has_flr() and pcie_flr(). I
> >>have a couple of questions or need some clarification.
> >>
> >>1. pcie_has_flr() usage inside pci_probe_reset_function().
> >>
> >> This function is only calling pcie_has_flr() but not pcie_flr().
> >> Rest of the code is trying to do specific type of reset except
> >> pcie_flr().
> >>
> >> rc = pci_dev_specific_reset(dev, 1);
> >> if (rc != -ENOTTY)
> >> return rc;
> >> if (pcie_has_flr(dev))
> >> return 0;
> >> rc = pci_af_flr(dev, 1);
> >> if (rc != -ENOTTY)
> >> return rc;
> >>
> >> In other-words, I can remove usage of pcie_has_flr() in all
> >> other places in pci.c except in above function.
> >I think we should keep the EXPORT_SYMBOL_GPL() part of a60a2b73ba69
> >("PCI: Export pcie_flr()"), but revert the restructuring part.
> >
> >Prior to a60a2b73ba69, we had
> >
> > int pcie_flr(struct pci_dev *dev, int probe);
> >
> >like all the other reset methods. AFAICT, the addition of
> >pcie_has_flr() was to optimize the path slightly because when
> >drivers call pcie_flr(), they should already know that their
> >hardware supports FLR. But I don't think that optimization is
> >worth the extra code complexity. If we do need to optimize it, we
> >can check this in the core during enumeration and set
> >PCI_DEV_FLAGS_NO_FLR_RESET accordingly.
> Not all code paths are aware of FLR capability and also, not
> using pcie_flr(). For example,
>
> arch/powerpc/platforms/powernv/eeh-powernv.c
I assume you're referring to pnv_eeh_do_flr() (which contains code similar
to pcie_flr()) and pnv_eeh_do_af_flr() (which has code similar to
pci_af_flr()). I agree that those are problematic and would ideally be
unified with the PCI core implementations.
Powerpc has quite a bit of this sort of special-case code for several
reasons, some just historical and some more concrete, so I don't know how
feasible this is.
> drivers/crypto/cavium/nitrox/nitrox_main.c
This has nitrox_reset_device(), which should definitely be replaced with a
core interface.
> drivers/net/ethernet/cavium/liquidio/octeon_mailbox.c
And this has octeon_mbox_process_cmd() which also does a home-grown
PCI_EXP_DEVCTL_BCR_FLR request and also should definitely use a core
interface.
> So, we should consider one of these options.
>
> - set PCI_DEV_FLAGS_NO_FLR_RESET if it is not supported.
> - pcie_flr() should return if it is not supported
>
> If we modify pcie_flr() to return error codes, then we need to modify
> all existing modules that are calling this function.
Yes, of course.
> Please let me know your preference, so that I can move accordingly. Thanks.
I think Christoph volunteered to do some restructuring, but I don't
know his timeframe. If you can, I would probably wait for that
because there's so much overlap here.
The other paths that use PCI_EXP_DEVCTL_BCR_FLR are definitely issues
and should be fixed, but again should wait for the revised pcie_flr()
interface. And if they're not actually required for your Xen issue,
they sound like "nice to have" cleanups that will not gate your Xen
fixes. I added this to my ever-growing list of cleanups to do.
Bjorn