Re: [PATCH v4 00/16] dma-mapping: migrate to physical address-based API

From: Leon Romanovsky
Date: Tue Sep 02 2025 - 05:30:05 EST


On Mon, Sep 01, 2025 at 07:23:02PM -0300, Jason Gunthorpe wrote:
> On Mon, Sep 01, 2025 at 11:47:59PM +0200, Marek Szyprowski wrote:
> > I would like to give those patches a try in linux-next, but in meantime
> > I tested it on my test farm and found a regression in dma_map_resource()
> > handling. Namely the dma_map_resource() is no longer possible with size
> > not aligned to kmalloc()'ed buffer, as dma_direct_map_phys() calls
> > dma_kmalloc_needs_bounce(),
>
> Hmm, it's this bit:
>
> capable = dma_capable(dev, dma_addr, size, !(attrs & DMA_ATTR_MMIO));
> if (unlikely(!capable) || dma_kmalloc_needs_bounce(dev, size, dir)) {
> if (is_swiotlb_active(dev) && !(attrs & DMA_ATTR_MMIO))
> return swiotlb_map(dev, phys, size, dir, attrs);
>
> goto err_overflow;
> }
>
> We shouldn't be checking dma_kmalloc_needs_bounce() on mmio as there
> is no cache flushing so the "dma safe alignment" for non-coherent DMA
> does not apply.
>
> Like you say looks good to me, and more of the surrouding code can be
> pulled in too, no sense in repeating the boolean logic:
>
> if (attrs & DMA_ATTR_MMIO) {
> dma_addr = phys;
> if (unlikely(!dma_capable(dev, dma_addr, size, false)))
> goto err_overflow;
> } else {
> dma_addr = phys_to_dma(dev, phys);
> if (unlikely(!dma_capable(dev, dma_addr, size, true)) ||

I tried to reuse same code as much as possible :(

> dma_kmalloc_needs_bounce(dev, size, dir)) {
> if (is_swiotlb_active(dev))
> return swiotlb_map(dev, phys, size, dir, attrs);
>
> goto err_overflow;
> }
> if (!dev_is_dma_coherent(dev) &&
> !(attrs & DMA_ATTR_SKIP_CPU_SYNC))
> arch_sync_dma_for_device(phys, size, dir);
> }

Like Jason wrote, but in diff format:

diff --git a/kernel/dma/direct.h b/kernel/dma/direct.h
index 92dbadcd3b2f..3f4792910604 100644
--- a/kernel/dma/direct.h
+++ b/kernel/dma/direct.h
@@ -85,7 +85,6 @@ static inline dma_addr_t dma_direct_map_phys(struct device *dev,
unsigned long attrs)
{
dma_addr_t dma_addr;
- bool capable;

if (is_swiotlb_force_bounce(dev)) {
if (attrs & DMA_ATTR_MMIO)
@@ -94,17 +93,19 @@ static inline dma_addr_t dma_direct_map_phys(struct device *dev,
return swiotlb_map(dev, phys, size, dir, attrs);
}

- if (attrs & DMA_ATTR_MMIO)
+ if (attrs & DMA_ATTR_MMIO) {
dma_addr = phys;
- else
+ if (unlikely(dma_capable(dev, dma_addr, size, false)))
+ goto err_overflow;
+ } else {
dma_addr = phys_to_dma(dev, phys);
+ if (unlikely(!dma_capable(dev, dma_addr, size, true)) ||
+ dma_kmalloc_needs_bounce(dev, size, dir)) {
+ if (is_swiotlb_active(dev))
+ return swiotlb_map(dev, phys, size, dir, attrs);

- capable = dma_capable(dev, dma_addr, size, !(attrs & DMA_ATTR_MMIO));
- if (unlikely(!capable) || dma_kmalloc_needs_bounce(dev, size, dir)) {
- if (is_swiotlb_active(dev) && !(attrs & DMA_ATTR_MMIO))
- return swiotlb_map(dev, phys, size, dir, attrs);
-
- goto err_overflow;
+ goto err_overflow;
+ }
}

if (!dev_is_dma_coherent(dev) &&


I created new tag with fixed code.
https://git.kernel.org/pub/scm/linux/kernel/git/leon/linux-rdma.git/tag/?h=dma-phys-Sep-2

Thanks

>
> Jason