Re: [PATCH v8 05/29] iommu: Add a timeout parameter for PRQ response

From: Alex Williamson
Date: Mon Jun 03 2019 - 18:36:07 EST


On Sun, 26 May 2019 18:09:40 +0200
Eric Auger <eric.auger@xxxxxxxxxx> wrote:

> From: Jacob Pan <jacob.jun.pan@xxxxxxxxxxxxxxx>
>
> When an IO page request is processed outside IOMMU subsystem, response
> can be delayed or lost. Add a tunable setup parameter such that user can
> choose the timeout for IOMMU to track pending page requests.
>
> This timeout mechanism is a basic safety net which can be implemented in
> conjunction with credit based or device level page response exception
> handling.
>
> Signed-off-by: Jacob Pan <jacob.jun.pan@xxxxxxxxxxxxxxx>
> ---
> .../admin-guide/kernel-parameters.txt | 8 +++++
> drivers/iommu/iommu.c | 29 +++++++++++++++++++
> 2 files changed, 37 insertions(+)
>
> diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt
> index 138f6664b2e2..b43f0893d252 100644
> --- a/Documentation/admin-guide/kernel-parameters.txt
> +++ b/Documentation/admin-guide/kernel-parameters.txt
> @@ -1813,6 +1813,14 @@
> 1 - Bypass the IOMMU for DMA.
> unset - Use value of CONFIG_IOMMU_DEFAULT_PASSTHROUGH.
>
> + iommu.prq_timeout=
> + Timeout in seconds to wait for page response
> + of a pending page request.
> + Format: <integer>
> + Default: 10
> + 0 - no timeout tracking
> + 1 to 100 - allowed range
> +
> io7= [HW] IO7 for Marvel based alpha systems
> See comment before marvel_specify_io7 in
> arch/alpha/kernel/core_marvel.c.
> diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
> index 13b301cfb10f..64e87d56f471 100644
> --- a/drivers/iommu/iommu.c
> +++ b/drivers/iommu/iommu.c
> @@ -45,6 +45,19 @@ static unsigned int iommu_def_domain_type = IOMMU_DOMAIN_DMA;
> #endif
> static bool iommu_dma_strict __read_mostly = true;
>
> +/*
> + * Timeout to wait for page response of a pending page request. This is
> + * intended as a basic safty net in case a pending page request is not
> + * responded for an exceptionally long time. Device may also implement
> + * its own protection mechanism against this exception.
> + * Units are in jiffies with a range between 1 - 100 seconds equivalent.
> + * Default to 10 seconds.
> + * Setting 0 means no timeout tracking.
> + */
> +#define IOMMU_PAGE_RESPONSE_MAX_TIMEOUT (HZ * 100)
> +#define IOMMU_PAGE_RESPONSE_DEF_TIMEOUT (HZ * 10)
> +static unsigned long prq_timeout = IOMMU_PAGE_RESPONSE_DEF_TIMEOUT;
> +
> struct iommu_group {
> struct kobject kobj;
> struct kobject *devices_kobj;
> @@ -157,6 +170,22 @@ static int __init iommu_dma_setup(char *str)
> }
> early_param("iommu.strict", iommu_dma_setup);
>
> +static int __init iommu_set_prq_timeout(char *str)
> +{
> + unsigned long timeout;
> +
> + if (!str)
> + return -EINVAL;
> + timeout = simple_strtoul(str, NULL, 0);
> + timeout = timeout * HZ;
> + if (timeout > IOMMU_PAGE_RESPONSE_MAX_TIMEOUT)
> + return -EINVAL;
> + prq_timeout = timeout;
> +
> + return 0;
> +}
> +early_param("iommu.prq_timeout", iommu_set_prq_timeout);
> +
> static ssize_t iommu_group_attr_show(struct kobject *kobj,
> struct attribute *__attr, char *buf)
> {

It doesn't seem to make much sense to include this patch without also
including "iommu: handle page response timeout". Was that one lost?
Dropped? Lives elsewhere? Thanks,

Alex