Re: [PATCH v1 4/4] iommu/tegra: gart: Optimize map/unmap

From: Dmitry Osipenko
Date: Mon May 07 2018 - 13:38:41 EST

Next message: Paolo Bonzini: "Re: WARNING in __mutex_unlock_slowpath"
Previous message: Frederic Barrat: "Re: [PATCH v2 3/7] powerpc: use task_pid_nr() for TID allocation"
In reply to: Dmitry Osipenko: "Re: [PATCH v1 4/4] iommu/tegra: gart: Optimize map/unmap"
Next in thread: Joerg Roedel: "Re: [PATCH v1 4/4] iommu/tegra: gart: Optimize map/unmap"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 07.05.2018 18:51, Dmitry Osipenko wrote:

[snip]

> Secondly, the interesting part is that mapping / unmapping of a contiguous
> allocation (CMA using DMA API) is slower by ~50% then doing it for a sparse
> allocation (get_pages using bare IOMMU API). /I think/ it's a shortcoming of the
> arch/arm/mm/dma-mapping.c, which also suffers from other inflexibilities that
> Thierry faced recently. Though I haven't really tried to figure out what is the
> bottleneck yet and Thierry was going to re-write ARM's dma-mapping
> implementation anyway, I'll take a closer look at this issue a bit later.

Please scratch my accusation of ARM's dma-mapping, it's not the culprit at all.
I completely forgot that in a case of sparse allocation displays framebuffer
IOMMU mapping is "pinned" to the GART and hence it's not getting dynamically
mapped / unmapped during of my testing. I also forgot to set CPU freq governor
to "perfomance", that reduced 50% to 20% of the above perf difference. The rest
of the testing is unaffected, flushing after whole mapping is still much more
efficient than flushing after modification of each page entry. And yet again,
performance of sparse mapping is nearly the same as of contiguous mapping unless
sparse allocation is large and _very_ fragmented.

Next message: Paolo Bonzini: "Re: WARNING in __mutex_unlock_slowpath"
Previous message: Frederic Barrat: "Re: [PATCH v2 3/7] powerpc: use task_pid_nr() for TID allocation"
In reply to: Dmitry Osipenko: "Re: [PATCH v1 4/4] iommu/tegra: gart: Optimize map/unmap"
Next in thread: Joerg Roedel: "Re: [PATCH v1 4/4] iommu/tegra: gart: Optimize map/unmap"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]