-----Original Message----- From: Robin Murphy
[mailto:robin.murphy@xxxxxxx] Sent: Tuesday, November 21, 2017
12:50 AM To: Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>; Eric
Yang <yu.yang_3@xxxxxxx>; iommu@xxxxxxxxxxxxxxxxxxxxxxxxxx Cc:
Daniel Borkmann <daniel@xxxxxxxxxxxxx>; Kees Cook <keescook@xxxxxxxxxxxx>; Geert Uytterhoeven
<geert+renesas@xxxxxxxxx>; Greg Kroah-Hartman
<gregkh@xxxxxxxxxxxxxxxxxxx>; linux- kernel@xxxxxxxxxxxxxxx; David
Miller <davem@xxxxxxxxxxxxx>; Al Viro <viro@xxxxxxxxxxxxxxxxxx>;
Andrey Ryabinin <aryabinin@xxxxxxxxxxxxx>; Andrew Morton
<akpm@xxxxxxxxxxxxxxxxxxxx>; Ingo Molnar <mingo@xxxxxxxxxx> Subject: Re: No check of the size passed to unmap_single in
swiotlb
On 20/11/17 16:26, Konrad Rzeszutek Wilk wrote:
On Mon, Nov 20, 2017 at 08:17:14AM +0000, Eric Yang wrote:our LS1043A 64bits ARM SOC, we found that the invoke of
Hi all,
Hi!
During debug a device only support 32bits DMA(Qualcomm Atheros
AP) in
dma_unmap_single - -> swiotlb_tbl_unmap_single will unmap the
passed "size" anyway even when the "size" is incorrect.
will be refilled by INVALID_PHYS_ADDR, and it will cause the bounce
If the size is larger than it should, the extra entries in
io_tlb_orig_addr array
buffer copy not happen when the one who really used the mis-freed
entries doing DMA data transfers, and this will cause further
unknow behaviors.
swiotlb_tbl_unmap_single, if it is larger than it deserves, just
Here we just fix it temporarily by adding a judge of the "size"
in the
unmap the right size only. Like the code:
as well?
Did the DMA debug API (CONFIG_DMA_API_DEBUG) help in figuring
this issue
*hwdev, phys_addr_t tlb_addr,
[yangyu@titan dash-lts]$ git diff ./lib/swiotlb.c diff --git a/lib/swiotlb.c b/lib/swiotlb.c index
ad1d2962d129..58c97ede9d78 100644 --- a/lib/swiotlb.c +++
b/lib/swiotlb.c @@ -591,7 +591,10 @@ void
swiotlb_tbl_unmap_single(struct device
but Is it useful to add some size check code to prevent real*/ for (i = index + nslots - 1; i >= index; i--) { io_tlb_list[i] = ++count; -
io_tlb_orig_addr[i] = INVALID_PHYS_ADDR; +
if(io_tlb_orig_addr[i] != orig_addr) +
printk("======size wrong, ally down ally down!===\n"); +
else + io_tlb_orig_addr[i] =
INVALID_PHYS_ADDR; } /* * Step 2: merge the returned slots with
the preceding slots,
Although pass a right size of DMA buffer is the responsibility
of the drivers,
damage happen?
There doesn't seem to be much good reason for SWIOTLB to be more
special than other DMA API backends, and not all of them have
enough internal state to be able to make such a check. It's also
not necessarily possible to "prevent damage" anyway - if a driver
does pass a bogus size for dma_unmap_single(..., DMA_FROM_DEVICE),
SWIOTLB might be able to keep itself internally consistent, but it
still can't prevent the arch code in the middle from invalidating
the wrong cache lines and potentially corrupting adjacent memory.
In short, trying to work around broken drivers is a much worse idea
than just fixing those drivers, and that's what we already have
dma-debug for.
Robin.
Hi Robin,
I agree that hacking kernel to fix broken drivers is not acceptable,
actually we spent days to fight driver supplier with this, they do
not want to change their code and want fix it directly in kernel.
I tried Dma-debug yesterday, it works very well, but I think only
the size mismatch check may not be enough for the map entry corrupt
situation, some run-time warning may be better when the real
corruption happen.
For most of the dma-api backend, the size mismatch may do no harm at
all, and even in SWIOTLB itself when the bounce buffer is not used,
the size mismatch do no harm either. In our case, the same buggy
driver works well when board has 2G DDR, but panic frequently in 4G
DDR because of the use of bounce buffer and these corrupted map
entries. it is hard to catch this kind of bugs, for when the
corruption happen, the kernel has all kind of reasons to panic, but
not even one may directly point to the real source.
Add the warning messages is a big convenience for figure this kind of
issues, at least to me and the AP driver supplier, such warnings may
save weeks of hard debug time.