Re: [PATCH v3 4/6] iommu: Factor iommu_iotlb_gather_is_disjoint() out
From: Will Deacon
Date: Tue Jun 15 2021 - 06:29:55 EST
On Fri, Jun 11, 2021 at 09:50:31AM -0700, Nadav Amit wrote:
>
>
> > On Jun 11, 2021, at 6:57 AM, Will Deacon <will@xxxxxxxxxx> wrote:
> >
> > On Mon, Jun 07, 2021 at 11:25:39AM -0700, Nadav Amit wrote:
> >> From: Nadav Amit <namit@xxxxxxxxxx>
> >>
> >> Refactor iommu_iotlb_gather_add_page() and factor out the logic that
> >> detects whether IOTLB gather range and a new range are disjoint. To be
> >> used by the next patch that implements different gathering logic for
> >> AMD.
> >>
> >> Cc: Joerg Roedel <joro@xxxxxxxxxx>
> >> Cc: Will Deacon <will@xxxxxxxxxx>
> >> Cc: Jiajun Cao <caojiajun@xxxxxxxxxx>
> >> Cc: Robin Murphy <robin.murphy@xxxxxxx>
> >> Cc: Lu Baolu <baolu.lu@xxxxxxxxxxxxxxx>
> >> Cc: iommu@xxxxxxxxxxxxxxxxxxxxxxxxxx
> >> Cc: linux-kernel@xxxxxxxxxxxxxxx>
> >> Signed-off-by: Nadav Amit <namit@xxxxxxxxxx>
> >> ---
> >> include/linux/iommu.h | 41 +++++++++++++++++++++++++++++++++--------
> >> 1 file changed, 33 insertions(+), 8 deletions(-)
> >
> > [...]
> >
> >> diff --git a/include/linux/iommu.h b/include/linux/iommu.h
> >> index f254c62f3720..b5a2bfc68fb0 100644
> >> --- a/include/linux/iommu.h
> >> +++ b/include/linux/iommu.h
> >> @@ -497,6 +497,28 @@ static inline void iommu_iotlb_sync(struct iommu_domain *domain,
> >> iommu_iotlb_gather_init(iotlb_gather);
> >> }
> >>
> >> +/**
> >> + * iommu_iotlb_gather_is_disjoint - Checks whether a new range is disjoint
> >> + *
> >> + * @gather: TLB gather data
> >> + * @iova: start of page to invalidate
> >> + * @size: size of page to invalidate
> >> + *
> >> + * Helper for IOMMU drivers to check whether a new range is and the gathered
> >> + * range are disjoint.
> >
> > I can't quite parse this. Delete the "is"?
>
> Indeed. Will do (I mean I will do ;-) )
>
> >
> >> For many IOMMUs, flushing the IOMMU in this case is
> >> + * better than merging the two, which might lead to unnecessary invalidations.
> >> + */
> >> +static inline
> >> +bool iommu_iotlb_gather_is_disjoint(struct iommu_iotlb_gather *gather,
> >> + unsigned long iova, size_t size)
> >> +{
> >> + unsigned long start = iova, end = start + size - 1;
> >> +
> >> + return gather->end != 0 &&
> >> + (end + 1 < gather->start || start > gather->end + 1);
> >> +}
> >> +
> >> +
> >> /**
> >> * iommu_iotlb_gather_add_range - Gather for address-based TLB invalidation
> >> * @gather: TLB gather data
> >> @@ -533,20 +555,16 @@ static inline void iommu_iotlb_gather_add_page(struct iommu_domain *domain,
> >> struct iommu_iotlb_gather *gather,
> >> unsigned long iova, size_t size)
> >> {
> >> - unsigned long start = iova, end = start + size - 1;
> >> -
> >> /*
> >> * If the new page is disjoint from the current range or is mapped at
> >> * a different granularity, then sync the TLB so that the gather
> >> * structure can be rewritten.
> >> */
> >> - if (gather->pgsize != size ||
> >> - end + 1 < gather->start || start > gather->end + 1) {
> >> - if (gather->pgsize)
> >> - iommu_iotlb_sync(domain, gather);
> >> - gather->pgsize = size;
> >> - }
> >> + if ((gather->pgsize && gather->pgsize != size) ||
> >> + iommu_iotlb_gather_is_disjoint(gather, iova, size))
> >> + iommu_iotlb_sync(domain, gather);
> >>
> >> + gather->pgsize = size;
> >
> > Why have you made this unconditional? I think it's ok, but just not sure
> > if it's necessary or not.
>
> In regard to gather->pgsize, this function had (and has) an
> invariant, in which gather->pgsize always represents the flushing
> granularity of its range. Arguably, “size" should never be
> zero, but lets assume for the matter of discussion that it might.
>
> If “size” equals to “gather->pgsize”, then the assignment in
> question has no impact.
>
> Otherwise, if “size” is non-zero, then iommu_iotlb_sync() would
> initialize the size and range (see iommu_iotlb_gather_init()),
> and the invariant is kept.
>
> Otherwise, “size” is zero, and “gather” already holds a range,
> so gather->pgsize is non-zero and
> (gather->pgsize && gather->pgsize != size) is true. Therefore,
> again, iommu_iotlb_sync() would be called and initialize the
> size.
>
> I think that this change makes the code much simpler to read.
> It probably has no performance impact as “gather” is probably
> cached and anyhow accessed shortly after.
Thanks. I was just interested in whether it had a functional impact (I don't
think it does) or whether it was just cleanup.
With the updated comment:
Acked-by: Will Deacon <will@xxxxxxxxxx>
Will