Re: mapping PCI registers with write combining (and PAT on x86)...

From: Roland Dreier
Date: Tue Dec 12 2006 - 19:12:42 EST


> So I think we may simplify this but there is pci_mmap_page_range. That
> already handles this for the architectures that currently support it.
> So it is probably the case the fbdev should be changed to use that.

Thanks... I was not aware of pci_mmap_page_range(), but that doesn't
seem to be quite the right interface. It uses vma->vm_pgoff to say
what to remap. A typical use for what I have in mind would be for a
userspace process to open a magic file and do mmap() at some
well-known offset (like 0), and have the kernel driver map the right
PCI registers into userspace, without userspace having to know what
offset to ask for.

This is especially important when the kernel has to handle picking a
"context" or "port" to avoid multiple userspace processes stepping on
each other.

And of course arch/i386/pci/i386.c has the following in its
pci_mmap_page_range() anyway:

/* Write-combine setting is ignored, it is changed via the mtrr
* interfaces on this platform.
*/

so the write_combine parameter is ignored...

> No one had any serious objections to my patches as they were. The actual
> problem was that the patches were incomplete. In particular if you
> mismatch page protections it is possible to get silent data corruption
> or processor crashes. So we need checks to ensure all mappings of
> a given page are using the same protections.
>
> To a certain extent I think adding those checks really is a strawman
> and should not stop the merge effort, because we have this feature and
> those possible bugs on other architectures right now and we don't have
> those checks. But I also think in the long term we need them, it just
> requires several days of going through the mm so we don't leave any
> path uncovered.

It does seem somewhat hard to make sure there aren't multiple mappings
of the same thing, and I'm not sure it's worth trying to avoid it. If
a device driver lets me mmap PCI memory with write-combining on, and
then (as root) I mmap raw PCI resources to get the same memory, whose
fault is it if things break?

I'm kind of an mm dummy but I don't even see a good way to avoid
multiple mappings like that.

- R.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/