[PATCH v3 0/7] iommu: Allow IOVA rcache range be configured
From: John Garry
Date: Tue Jun 01 2021 - 10:34:13 EST
For streaming DMA mappings involving an IOMMU and whose IOVA len regularly
exceeds the IOVA rcache upper limit (meaning that they are not cached),
performance can be reduced.
This is much more pronounced from commit 4e89dce72521 ("iommu/iova: Retry
from last rb tree node if iova search fails"), as discussed at [0].
IOVAs which cannot be cached are highly involved in the IOVA ageing issue,
as discussed at [1].
This series allows the IOVA rcache range be configured, so that we may
cache all IOVAs per domain, thus improving performance.
A new IOMMU group sysfs file is added - max_opt_dma_size - which is used
indirectly to configure the IOVA rcache range:
/sys/kernel/iommu_groups/X/max_opt_dma_size
This file is updated same as how the IOMMU group default domain type is
updated, i.e. must unbind the only device in the group first.
The inspiration here comes from block layer request queue sysfs
"optimal_io_size" file, in /sys/block/sdX/queue/optimal_io_size
Some figures for storage scenario (when increasing IOVA rcache range to
cover all DMA mapping sizes from the LLD):
v5.13-rc1 baseline: 1200K IOPS
With series: 1800K IOPS
All above are for IOMMU strict mode. Non-strict mode gives ~1800K IOPS in
all scenarios.
[0] https://lore.kernel.org/linux-iommu/20210129092120.1482-1-thunder.leizhen@xxxxxxxxxx/
[1] https://lore.kernel.org/linux-iommu/1607538189-237944-1-git-send-email-john.garry@xxxxxxxxxx/
Differences to v2:
- Drop DMA mapping API to allow LLD set set for now
- Update default domain immediately, instead of in reprobe
- Fix build warning
Differences to v1:
- Many
- Change method to not operate on a 'live' IOMMU domain:
- rather, force device driver to be re-probed once
dma_max_opt_size is set, and reconfig a new IOMMU group then
- Add iommu sysfs max_dma_opt_size file, and allow updating same as how
group type is changed
John Garry (7):
iommu: Reactor iommu_group_store_type()
iova: Allow rcache range upper limit to be flexible
iommu: Allow iommu_change_dev_def_domain() realloc default domain for
same type
iova: Add iova_domain_len_is_cached()
iova: Add init_iova_domain_ext()
iommu: Allow max opt DMA len be set for a group via sysfs
dma-iommu: Use init_iova_domain_ext() for IOVA domain init
drivers/iommu/dma-iommu.c | 17 +++-
drivers/iommu/iommu.c | 172 ++++++++++++++++++++++++++------------
drivers/iommu/iova.c | 63 +++++++++++---
include/linux/iommu.h | 6 ++
include/linux/iova.h | 21 ++++-
5 files changed, 210 insertions(+), 69 deletions(-)
--
2.26.2