Re: [PATCH v3 2/3] uio_pci_generic: add MSI/MSI-X support
From: Michael S. Tsirkin
Date: Thu Oct 08 2015 - 10:18:38 EST
On Thu, Oct 08, 2015 at 04:20:12PM +0300, Avi Kivity wrote:
> On 10/08/2015 01:26 PM, Michael S. Tsirkin wrote:
> >On Thu, Oct 08, 2015 at 12:19:20PM +0300, Avi Kivity wrote:
> >>
> >>On 10/08/2015 11:32 AM, Michael S. Tsirkin wrote:
> >>>On Thu, Oct 08, 2015 at 08:33:45AM +0300, Avi Kivity wrote:
> >>>>On 08/10/15 00:05, Michael S. Tsirkin wrote:
> >>>>>On Wed, Oct 07, 2015 at 07:39:16PM +0300, Avi Kivity wrote:
> >>>>>>That's what I thought as well, but apparently adding msix support to the
> >>>>>>already insecure uio drivers is even worse.
> >>>>>I'm glad you finally agree what these drivers are doing is insecure.
> >>>>>
> >>>>>And basically kernel cares about security, no one wants to maintain insecure stuff.
> >>>>>
> >>>>>So you guys should think harder whether this code makes any sense upstream.
> >>>>You simply ignore everything I write, cherry-picking the word "insecure" as
> >>>>if it makes your point. That is very frustrating.
> >>>And I'm sorry about the frustration. I didn't intend to twist your
> >>>words. It's just that I had to spend literally hours trying to explain
> >>>that security matters in kernel, and all I was getting back was a
> >>>summary "there's no security issue because there are other way to
> >>>corrupt memory".
> >>The word security has several meanings. The primary meaning is "defense
> >>against a malicious attacker". In that sense, there is no added value at
> >>all, because the attacker is already root, and can already access all of
> >>kernel and user memory. Even if the attacker is not root, and just has
> >>access to a non-iommu-protected device, they can still DMA to and from any
> >>memory they like.
> >>
> >>This sense of the word however is irrelevant for this conversation; the user
> >>already gave up on it when they chose to use uio_pci_generic (either because
> >>they have no iommu, or because they need the extra performance).
> >>
> >>Do we agree that security, in the sense of defense against a malicious
> >>attacker, is irrelevant for this conversation?
> >No. uio_pci_generic currently can be used in a secure way in
> >a sense that it's protected againt malicious attacker,
> >assuming you bind it to a device that does not do DMA.
>
> The context of the conversation is dpdk, which only supports DMA.
>
> Do we agree that security, in the sense of defense against a malicious
> attacker, is irrelevant for this conversation, taking this under
> consideration?
DPDK has a mode which they call UIO which seems to require people to
disable security. I agree to that. It's unfortunate that naming it UIO
gives the whole infrastructure a bad name for security.
But for upstreaming purposes, this doesn't matter:
we don't merge single use interfaces into Linux,
and I do care about interfaces of the code I maintain
being useful in a secure way.
> >
> >>A secondary meaning is protection against inadvertent bugs. Yes, a faulty
> >>memory write that happens to land in the msix page, can cause a random
> >>memory word to be overwritten. But so can a faulty memory write into the
> >>rings, or the data structures that support virtual->physical translation,
> >>the data structures that describe the packets before translation, the memory
> >>allocator or pool. The patch extends the vulnerable surface, but by a
> >>negligible amount.
> >>
> >>>So I was glad when it looked like there's finally an agreement that yes,
> >>>there's value in validating userspace input and yes, it's insecure
> >>>not to do this.
> >>
> >>
> >>>>It is good practice to defend against root oopsing the kernel, but in some
> >>>>cases it cannot be achieved.
> >>>I originally included ways to fix issues that I pointed out, ranging
> >>>from harder to implement with more overhead but more secure to easier to
> >>>implement with less overhead but less secure. There didn't seem to be
> >>>an understanding that the issues are there at all, so I stopped doing
> >>>that - seemed like a waste of time.
> >>>
> >>>For example, will it kill your performance to reset devices cleanly, on
> >>>open and close,
> >>I don't recall this being mentioned at all.
> >http://mid.gmane.org/20151006005527-mutt-send-email-mst@xxxxxxxxxx
>
> Down at the moment for me.
Grep this discussion for "reset", you will find it.
> >But really, this is just off the top of my head.
> >These are all issues VFIO developers encountered
> >and fixed over the years. Go into that code, read it,
> >and you will discover the issues and the solutions.
>
> vfio is solving a different problem,
It's solving a bunch of problems, including various hardware quirks.
> the problem of security against a
> malicious attacker, one that I'm hoping to agree here that we aren't
> attempting to solve.
You aren't but you should. uio_pci_generic does do it - though using
legacy interrupts as it does it didn't need to do a lot. But it does
check e.g. interrupt mask support, and doesn't just rely on userspace to
DTRT.
> People have been happily using uio_pci_generic despite all those issues.
> All they were missing was msix support. You can't use that to force them to
> overhaul that driver, or to add a new subsystem to vfio.
By the way, there's a very simple way to make generic UIO useful on VFs.
Don't enable bus mastering, trigger a timer once a while.
> >
> >> It seems completely unrelated
> >>to a patch adding msix support to uio_pci_generic.
> >It isn't unrelated. It's because with MSIX patch you are enabling bus
> >mastering in kernel. So if you start device in a bad state it will
> >corrupt kernel memory.
>
> You are right, this patch can regress secure users.
>
> I'd be surprised if there are msix-capable pci devices that do not rely on
> DMA, though.
I would not be surprised at all. The PCI Express specification says:
MSI/MSI-X interrupt support, which is optional for PCI 3.0 devices, is
required for PCI Express devices.
> >
> >>> protect them from writes into MSI config, BAR registers
> >>>and related capablities etc etc?
> >>Obviously the userspace driver has to write to the BAR area.
> >>
> >>If you're talking about the BAR setup registers, yes there is some (tiny)
> >>value in that, but how is it related to this patch?
> >If you don't, moving BARs will move the MSI-X region and
> >protecting it won't help.
>
> Won't it just become invisible if you do?
>
> If userspace starts playing with BARs, you lost already, whether msix is
> enabled or not doesn't matter. It can shadow other BARs, for example.
Of the same device? Sure, but then you only break this device.
At least for PCI Express devices are all behind bridges so
they can't shadow each other BARs.
And for VFs, they don't shadow each other BARs IIRC.
> This is a general weakness of uio_pci_generic, not something exposed by this
> patch.
Patch enables MSIX. Thus the need to protect the MSI-X region. To protect it
you need to make sure it does not move around :)
> >
> >>Protecting the MSI area in the BARs _is_ related to the patch. I agree it
> >>adds value, if small.
> >>
> >>> And if not, why are you people wasting
> >>>time arguing about that?
> >>I you want to use your position as maintainer of uio_pci_generic to get
> >>people to overhaul the driver for you with unrelated changes, they will
> >>object. I can understand a maintainer pointing out the right way to do
> >>something rather than the wrong way. But piling on a list of unrelated
> >>features as prerequisites is, in my opinion, abuse.
> >I don't see them as unrelated. Basically you want to turn
> >uio_pci_generic into vfio/pci except without an IOMMU.
>
> That is not what we want. Simply adding msix support is sufficient for us.
> Everything else was piled on afterwards.
I'm not stopping you in any way. It's just not the kind of half-baked
interface we should include and support in the upstream kernel, IMHO.
> >You will need a
> >lot of VFIO code then. That will need a lot of work. You seem to blame
> >me for this but IMHO that's because patch author has chosen a wrong
> >approach.
> >
> >>Let me repeat that pci_uio_generic is already used for userspace drivers,
> >>with all the issues that you point out, for a long while now. These issues
> >>are not exposed by the requirement to use msix.
> >I answered this already. I don't agree with this.
>
> With the first sentence or the second? I'm trying really hard to understand
> the problem.
Enabling msix in kernel exposes additinonal issues.
> >
> >>You are not protecting the
> >>kernel in any way by blocking the patch, you are only protecting people with
> >>iommu-less configurations from using their hardware.
> >Because it's either this patch or nothing at all? I don't believe that.
> >Someone come along and write a better one.
>
> From your description, I can't imagine what the better patch looks like,
> except as a complete overhaul of uio_pci_generic.
I posted several suggestions over the last several days.
I agree it would be a large change to uio_pci_generic, so
I think a vfio extension would make more sense.
> Our requirement is to enable msix, not to pretend that it is secure while it
> allows DMA to any piece of memory in the system.
>
> >
> >>> The only thing I heard is that it's a hassle.
> >>>That's true (though if you follow my advice and try to share code with
> >>>vfio/pci you get a lot of this logic for free).
> >>My thinking was that vfio was for secure (in the "defense against malicious
> >>attackers" sense) while uio_pci_generic was, de-facto at least, for use by
> >>trusted users.
> >And some are using it in very broken ways. Yes. But now you want
> >to fix this in stone by tying a kernel/userspace interface
> >to their broken ways. I think that would be a mistake.
>
> At heart, the brokenness here is that you allow insecure DMA. No amount of
> changes will fix this.
>
> We need an interface for users that are prepared to give up kernel/user
> protection, either because they have no other choice, or because they have
> performance requirements that mandate it. What extra value does protecting
> the BARs against movement add? Nothing.
Sure. But I am not talking about this usecase. It's an unsupportable
mess, it does not matter for upstream API discussion. Yea, performance.
But where would you stop? Will you next ask me to merge code that
accesses userspace pointers without checking, because performance? I can
easily see DPDK finding a way to make device DMA into
current_thread_info()->flags to wake a CPU that does monitor on it,
instead of an interrupt, because performance. Voila, we now have to
keep struct task layout stable because it's part of userspace ABI. And
so on.
So yes, there's userspace doing crazy things, and we don't want
to break it, but we need to consider the needs of a non-crazy
userspace when we build APIs.
> >
> >>We are in the strange situation that the Alex is open to adding an insecure
> >>mode to vfio,
> >I don't find this strange. It seems to make sense. VFIO is
> >already used with DMA capable devices.
>
> It's strange to me because it's charter was for iommu-protected device
> assignment, while uio_pci_generic is for generic pci userspace.
I don't know where is the VFIO charter, but you can find the UIO charter
under Documentation. DPDK is not using it as designed.
>
> >
> >>while you object to a patch which does not change the security
> >>of uio_pci_generic in any way; it only makes it more usable at the cost of a
> >>tiny increase in the bug surface.
> >I don't agree with this either. This depends on the device.
> >
> >>> So it's an
> >>>understandable argument if you just need something that works, quickly.
> >>>But if it's such a stopgap hack, there's no need to insist on it
> >>>upstream.
> >>It is not more or less a hack than uio_pci_generic allowing DMA,
> >It doesn't. sysfs does.
> >
> >>or
> >>/dev/mem, or the module loading interface, or nommu kernels. Security is
> >>just one aspect of the kernel, not the only one.
> >>
> >>It's perfectly reasonable to taint the kernel when insecure DMA is enabled,
> >>and to allow the administrator to disable the interface completely. What I
> >>don't understand is why, given that the user allows DMA, we should prevent
> >>them from using MSIX in addition.
> >There's no need to prevent MSIX with or without DMA.
> >
> >But UIO uses sysfs for device access. So if we program MSIX we need to
> >extend sysfs to protect a ton of registers that are MSIX related from
> >the user, and do a bunch of setup and cleanup otherwise kernel will be
> >very confused.
> >
> >It might be surprising to you how many registers are MSIX related,
> >but it's true.
> >
>
> Userspace can already do all of these things, confusing the kernel. It
> simply doesn't, which is why everything works. It need only continue not to
> do so for everything to continue to work.
You make it sound as if you are enabling existing APIs for new hardware.
That's not what these patches do, it's a new API you are building,
and new drivers will have to be written to use it. And the API
as defined here has subtle gotchas. VFIO just solved most of them
already.
Instead of all this, I have a simple suggestion.
UIO spec says:
For cards that don't generate interrupts but need to be
polled, there is the possibility to set up a timer that
triggers the interrupt handler at configurable time intervals.
add code to set this up in uio_pci_generic if there's no interrupt, and
wake userspace. This will use existing code. This won't help the new
DPDK interrupt mode (from June 2015), but it helps use it on existing
systems with no new interfaces.
--
MST
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/