Re: [PATCH] x86: Remove readq()/writeq() on 32-bit

From: Ingo Molnar
Date: Wed Apr 29 2009 - 07:57:31 EST



(Linus Cc:-ed)

* David Miller <davem@xxxxxxxxxxxxx> wrote:

> From: Roland Dreier <rdreier@xxxxxxxxx>
> Date: Tue, 28 Apr 2009 12:05:10 -0700
>
> > As discussed in <http://lkml.org/lkml/2009/4/19/164> and follow-ups,
> > readq()/writeq() for 32-bit x86 are implemented as two readl()/writel()
> > operations. This is not atomic (in the sense that another MMIO
> > operation from another CPU or thread can be done in the middle of the
> > two read/writes), and may not access the two halves of the register in
> > the correct order to work with hardware.
> >
> > Rather than silently providing a 32-bit fallback that leaves a
> > possibility for strange driver bugs, it's better to provide readq()
> > and writeq() only for 64-bit architectures, and have a compile failure
> > on 32-bit architectures that forces driver authors to think about what
> > the correct solution is.
> >
> > This essentially reverts 2c5643b1 ("x86: provide readq()/writeq() on
> > 32-bit too") and follow-on commits. If in the future someone wants to
> > provide a generic solution for all 32-bit architectures, that's great,
> > but there's not much point in providing (arguably broken)
> > implementations for only one architecture, since any portable driver
> > will have to implement fallbacks for other architectures anyway.
> >
> > Signed-off-by: Roland Dreier <rolandd@xxxxxxxxx>
>
> Acked-by: David S. Miller <davem@xxxxxxxxxxxxx>
[...]
> > We never seemed to reach closure on this. I would strongly
> > suggest merging something like this, and then if someone has a
> > grand plan to unify all fallbacks, we can add that when it shows
> > up. As it stands, the x86-32 situation is not progress towards
> > that grand unified plans, and does nothing that I can tell
> > beyond setting a trap for drivers.

I still have no particularly strong opinion on this - other the
reluctance i expressed already in the previous threads. My arguments
are not reflected (and not addressed) in the changelog AFAICS, so
let me repeat them here:

Firstly, it doesnt really matter in practice because such use is
very rare and non-spinlocked access to IO regions is even rarer.

What caused 2c5643b1 was that right now we have ugly per driver
#defines and inlines for readq/writeq. See:

git grep 'define.*readq' drivers/

for a rough estimation of what the current practices are. The 32-bit
wrapper we added 6 months ago is the obvious implementation on x86
and that it matches existing wrappers.

Atomicity of a 64-bit IO space access on 32-bit platforms, on an
unknown-bitness transport (it might even be a 64-bit PCI device
bridged over a 32-bit bridge) is obviously not guaranteed.

Not even 64-bit-on-64-bit is really guaranteed to be atomic. The
bitness here is what the CPU runs its _own_ code in (and how it
accesses its cached memory space) - it does not transform over to
the uncached IO bus.

So trying to suggest that 64-bit readq/writeq when running on a
64-bit kernel is somehow atomic (or can be made atomic) is really
wrong IMO. The 32-bit wrapper is simply the expression of how the
CPU would do a 64-bit access even in 64-bit mode anyway [if the
transport is 32-bit].

aligned 32-bit access can be assumed atomic to a certain degree by
virtue of PCI being 32 bit or better - but assuming any 64-bit
IO-space read/write atomicity is wrong on many levels.

Driver authors will have to think about it anyway _even on 64-bit_,
regardless of the existence of a 32-bit fallback.

A driver and hw _might_ be quirky and might require atomicity or
might define a different order of access ... but then _that_ driver
should become ugly, not all the others, right?

So my (slight) preference would be to keep the default generic
implementation and not make any atomicity guarantees - we never made
any. _If_ you want atomicity then provide a readq_atomic() /
writeq_atomic() facility, with various higher level checks that make
it sure that the IO transport is really atomic. (i dont see this
happening any time soon for anything else but some really rare
high-end IO transport.)

For the common case of there not being any atomicity assumption on
the driver case it should result in cleaner code. (assuming all
other 64-bit architectures implement a fallback too)

But ... i might be wrong about it, so i've Cc:-ed Linus who usually
has a rather strong opinion about IO APIs. I'll apply the patch if
Linus acks it (or Linus might take it straight out of email).

Thanks,

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/