RE: [PATCH v3] dmaengine: xilinx_dma: Free descriptor lists in order

From: Radhey Shyam Pandey
Date: Tue May 10 2022 - 05:18:14 EST


> -----Original Message-----
> From: Nicholas Graumann <nick.graumann@xxxxxxxxx>
> Sent: Monday, April 11, 2022 7:34 PM
> To: Radhey Shyam Pandey <radheys@xxxxxxxxxx>; Appana Durga Kedareswara
> Rao <appanad@xxxxxxxxxx>; Harini Katakam <harinik@xxxxxxxxxx>; Vinod
> Koul <vkoul@xxxxxxxxxx>; Michal Simek <michals@xxxxxxxxxx>; Appana Durga
> Kedareswara Rao <appanad@xxxxxxxxxx>
> Cc: Nicholas Graumann <nick.graumann@xxxxxxxxx>;
> dmaengine@xxxxxxxxxxxxxxx; linux-arm-kernel@xxxxxxxxxxxxxxxxxxx; linux-
> kernel@xxxxxxxxxxxxxxx
> Subject: [PATCH v3] dmaengine: xilinx_dma: Free descriptor lists in order
>

Minor nit - It's first version. I think we need to spin a new version fixing it?

> If xilinx_dma_terminate_all is called while the AXI DMA is active, the following
> error messages might be seen upon restarting the DMA:
>
> [ 72.556254] xilinx_dma_irq_handler: Channel d053cb5b has errors 10, cdr
> 2c049f80 tdr 2c04a000
> [ 72.557370] xilinx_dma_irq_handler: Channel d053cb5b has errors 100, cdr
> 2c049f80 tdr 2c049f80
>
> From then on the AXI DMA won't process any more descriptors until the DMA
> channel is released and requested again.
>
> The following sequence of events is what causes this to happen:
>
> 1. Some descriptors are prepared with xilinx_dma_tx_submit (so they get
> added to pending_list).
> 2. The DMA is kicked off via call to xilinx_dma_tx_submit (the
> descriptors are moved to active_list).
> 3. While the transfer is active, another descriptor is prepared with
> xilinx_dma_tx_submit (so it goes onto the pending_list) 4. Before the
> transfers complete, xilinx_dma_terminate_all is called.
> That function resets the channel then calls
> xilinx_dma_free_descriptors to free the descriptors.
>
> At that point, pending_list contains a descriptor that is newer (and thus farther
> down the chain of descriptors) than the descriptors prepared in (1). However,
> it gets placed onto the free_seg_list before the older descriptors. From then
> on, the next pointers are no longer valid because the order of the descriptors
> in free_seg_list does not match the order in which the descriptors were
> allocated.
>
> To remedy this, the descriptor lists need to be freed in order from oldest to
> newest, otherwise segments could be added to the free segment list in a
> different order than they were created. This is not an issue for VDMA nor
> CDMA because the driver does not maintain a list of descriptors to free.
>
> Fixes: c0bba3a99f07 ("dmaengine: vdma: Add Support for Xilinx AXI Direct
> Memory Access Engine")
> Signed-off-by: Nicholas Graumann <nick.graumann@xxxxxxxxx>
> Reviewed-by: Radhey Shyam Pandey <radhey.shyam.pandey@xxxxxxxxxx>
Reviewed-by: Radhey Shyam Pandey <radhey.shyam.pandey@xxxxxxxxxx>

> ---
> drivers/dma/xilinx/xilinx_dma.c | 6 +++++-
> 1 file changed, 5 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/dma/xilinx/xilinx_dma.c b/drivers/dma/xilinx/xilinx_dma.c
> index 3ff9fa3d8cd5..3b435449cd0c 100644
> --- a/drivers/dma/xilinx/xilinx_dma.c
> +++ b/drivers/dma/xilinx/xilinx_dma.c
> @@ -884,9 +884,13 @@ static void xilinx_dma_free_descriptors(struct
> xilinx_dma_chan *chan)
>
> spin_lock_irqsave(&chan->lock, flags);
>
> - xilinx_dma_free_desc_list(chan, &chan->pending_list);
> + /*
> + * Descriptor lists must be freed from oldest to newest so that the
> + * order of free_seg_list is maintained.
> + */
> xilinx_dma_free_desc_list(chan, &chan->done_list);
> xilinx_dma_free_desc_list(chan, &chan->active_list);
> + xilinx_dma_free_desc_list(chan, &chan->pending_list);
>
> spin_unlock_irqrestore(&chan->lock, flags); }
> --
> 2.35.1