Re: [PATCH] iommu/arm-smmu-v3: Fix event queue overflow acknowledgment

From: Robin Murphy
Date: Wed Mar 08 2023 - 08:21:08 EST


On 2023-03-08 09:20, Tomas Krcka wrote:
When an overflow occurs in the event queue, the SMMU toggles overflow
flag OVFLG in the PROD register.
The evtq thread is supposed to acknowledge the overflow flag by toggling
flag OVACKFLG in the CONS register, otherwise the overflow condition is
still active (OVFLG != OVACKFLG).

Currently the acknowledge register is toggled after clearing the event
queue but is never propagated to the hardware. It would be done next
time when executing evtq thread.

The SMMU still adds elements to the queue when the overflow condition is
active but any subsequent overflow information after clearing the event
queue will be lost.

This change keeps the SMMU in sync as it's expected by design.

If I've understood correctly, the upshot of this is that if the queue has overflowed once, become empty, then somehow goes from empty to full before we manage to consume a single event, we won't print the "events lost" message a second time.

Have you seen this happen in practice? TBH if the event queue ever overflows even once it's indicative that the system is hosed anyway, so it's not clear to me that there's any great loss of value in sometimes failing to repeat a warning for a chronic ongoing operational failure.

It could be argued that we have a subtle inconsistency between arm_smmu_evtq_thread() and arm_smmu_priq_thread() here, but the fact is that the Event queue and PRI queue *do* have different overflow behaviours, so it could equally be argued that inconsistency in the code helps reflect that. FWIW I can't say I have a strong preference either way.

Thanks,
Robin.

Signed-off-by: Tomas Krcka <krckatom@xxxxxxxxx>
Suggested-by: KarimAllah Ahmed <karahmed@xxxxxxxxx>
---
drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 1 +
1 file changed, 1 insertion(+)

diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
index f2425b0f0cd6..acc1ff5ff69b 100644
--- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
+++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
@@ -1579,6 +1579,7 @@ static irqreturn_t arm_smmu_evtq_thread(int irq, void *dev)
/* Sync our overflow flag, as we believe we're up to speed */
llq->cons = Q_OVF(llq->prod) | Q_WRP(llq, llq->cons) |
Q_IDX(llq, llq->cons);
+ queue_sync_cons_out(q);
return IRQ_HANDLED;
}