Re: [PATCH v4 06/17] PCI: add SIOV and IMS capability detection
From: Jason Gunthorpe
Date: Fri Nov 06 2020 - 12:51:42 EST
On Fri, Nov 06, 2020 at 08:48:50AM -0800, Raj, Ashok wrote:
> > The IMS flag belongs in the platform not in the devices.
>
> This support is mostly a SW thing right? we don't need to muck with
> platform/ACPI for that matter.
Something needs to tell the guest OS platform what to do, so you need
a place to put it.
Putting it in a per-device PCI cap is horrible and hacky from an
architectural perspective.
> I missed why ACPI tables should carry such information. If kernel doesn't
> want to support those devices its within kernel control. Which means kernel
> will only use the available MSIx interfaces. This is legacy support.
The platform flag tells the guest that it can (or can't) support IMS
*at all*
Primarily a guest would be blocked because the VMM provides no way for
the guest to create addr/data pairs.
Has nothing to do with individual devices.
> > The OS logic would be
> > - If no IMS information found then use IMS (Bare metal)
> > - If the IMS disable flag is found then
> > - If (future) hypercall available and the OS knows how to use it
> > then use IMS
> > - If no hypercall found, or no OS knowledge, fail IMS
> >
> > Our devices can use IMS even in a pure no-emulation
>
> This is true for IMS as well. But probably not implemented in the kernel as
> such. From a HW point of view (take idxd for instance) the facility is
> available to native OS as well. The early RFC supported this for native.
I can't follow what you are trying to say here.
Dave said the IMS cap was to indicate that the VMM supported emulation
of IMS so that the VMM can do the MSI addr/data translation as part of
the emulation.
I'm saying emulation will be too horrible for our devices that don't
require *any* emulation.
It is a bad architecture. The platform needs to handle this globally
for all devices, not special hacky emulations things custom made for
every device out there.
> Native devices can have both MSIx and IMS capability. But as I
> understand this isn't how we have partitioned things in SW today. We
> left IMS only for mdev's. And I agree this would be very useful.
That split is just some decision idxd did, we are thinking about doing
other things in our devices.
Jason