Re: [PATCH v1 1/8] vfio: Add VFIO_IOMMU_PASID_REQUEST(alloc/free)

From: Alex Williamson
Date: Tue Apr 07 2020 - 11:14:51 EST


On Tue, 7 Apr 2020 04:42:02 +0000
"Tian, Kevin" <kevin.tian@xxxxxxxxx> wrote:

> > From: Alex Williamson
> > Sent: Friday, April 3, 2020 11:14 PM
> >
> > On Fri, 3 Apr 2020 05:58:55 +0000
> > "Tian, Kevin" <kevin.tian@xxxxxxxxx> wrote:
> >
> > > > From: Alex Williamson <alex.williamson@xxxxxxxxxx>
> > > > Sent: Friday, April 3, 2020 1:50 AM
> > > >
> > > > On Sun, 22 Mar 2020 05:31:58 -0700
> > > > "Liu, Yi L" <yi.l.liu@xxxxxxxxx> wrote:
> > > >
> > > > > From: Liu Yi L <yi.l.liu@xxxxxxxxx>
> > > > >
> > > > > For a long time, devices have only one DMA address space from
> > platform
> > > > > IOMMU's point of view. This is true for both bare metal and directed-
> > > > > access in virtualization environment. Reason is the source ID of DMA in
> > > > > PCIe are BDF (bus/dev/fnc ID), which results in only device granularity
> > > > > DMA isolation. However, this is changing with the latest advancement in
> > > > > I/O technology area. More and more platform vendors are utilizing the
> > > > PCIe
> > > > > PASID TLP prefix in DMA requests, thus to give devices with multiple
> > DMA
> > > > > address spaces as identified by their individual PASIDs. For example,
> > > > > Shared Virtual Addressing (SVA, a.k.a Shared Virtual Memory) is able to
> > > > > let device access multiple process virtual address space by binding the
> > > > > virtual address space with a PASID. Wherein the PASID is allocated in
> > > > > software and programmed to device per device specific manner.
> > Devices
> > > > > which support PASID capability are called PASID-capable devices. If such
> > > > > devices are passed through to VMs, guest software are also able to bind
> > > > > guest process virtual address space on such devices. Therefore, the
> > guest
> > > > > software could reuse the bare metal software programming model,
> > which
> > > > > means guest software will also allocate PASID and program it to device
> > > > > directly. This is a dangerous situation since it has potential PASID
> > > > > conflicts and unauthorized address space access. It would be safer to
> > > > > let host intercept in the guest software's PASID allocation. Thus PASID
> > > > > are managed system-wide.
> > > >
> > > > Providing an allocation interface only allows for collaborative usage
> > > > of PASIDs though. Do we have any ability to enforce PASID usage or can
> > > > a user spoof other PASIDs on the same BDF?
> > >
> > > An user can access only PASIDs allocated to itself, i.e. the specific IOASID
> > > set tied to its mm_struct.
> >
> > A user is only _supposed_ to access PASIDs allocated to itself. AIUI
> > the mm_struct is used for managing the pool of IOASIDs from which the
> > user may allocate that PASID. We also state that programming the PASID
> > into the device is device specific. Therefore, are we simply trusting
> > the user to use a PASID that's been allocated to them when they program
> > the device? If a user can program an arbitrary PASID into the device,
> > then what prevents them from attempting to access data from another
> > user via the device? I think I've asked this question before, so if
> > there's a previous explanation or spec section I need to review, please
> > point me to it. Thanks,
> >
>
> There are two scenarios:
>
> (1) for PF/VF, the iommu driver maintains an individual PASID table per
> PDF. Although the PASID namespace is global, the per-BDF PASID table
> contains only valid entries for those PASIDs which are allocated to the
> mm_struct. The user is free to program arbitrary PASID into the assigned
> device, but using invalid PASIDs simply hit iommu fault.
>
> (2) for mdev, multiple mdev instances share the same PASID table of
> the parent BDF. However, PASID programming is a privileged operation
> in multiplexing usage, thus must be mediated by mdev device driver.
> The mediation logic will guarantee that only allocated PASIDs are
> forwarded to the device.

Thanks, I was confused about multiple tenants sharing a BDF when PASID
programming to the device is device specific, and therefore not
something we can virtualize. However, the solution is device specific
virtualization via mdev. Thus, any time we're sharing a BDF between
tenants, we must virtualize the PASID programming and therefore it must
be an mdev device currently. If a tenant is the exclusive user of the
BDF, then no virtualization of the PASID programming is required. I
think it's clear now (again). Thanks,

Alex