Re: [PATCH] fpga: zynqmp: Make word align the configuration data
From: Xu Yilun
Date: Sat Apr 01 2023 - 11:55:34 EST
On 2023-03-28 at 09:33:17 +0000, Manne, Nava kishore wrote:
> Hi Yilun,
>
> Please find my response inline.
>
> > -----Original Message-----
> > From: Xu Yilun <yilun.xu@xxxxxxxxx>
> > Sent: Saturday, March 18, 2023 2:55 PM
> > To: Manne, Nava kishore <nava.kishore.manne@xxxxxxx>
> > Cc: mdf@xxxxxxxxxx; hao.wu@xxxxxxxxx; trix@xxxxxxxxxx;
> > michal.simek@xxxxxxxxxx; linux-fpga@xxxxxxxxxxxxxxx; linux-arm-
> > kernel@xxxxxxxxxxxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx
> > Subject: Re: [PATCH] fpga: zynqmp: Make word align the configuration data
> >
> > On 2023-03-14 at 15:12:22 +0530, Nava kishore Manne wrote:
> > > To avoid unwanted copies at firmware(PMUFW) this patch provides a fix
> >
> > The copy happens in firmware? Please help briefly describe the firmware
> > operations in commit message.
> >
>
> Yes, If the firmware receives unaligned Bitstream file from Linux to make them align
> it will do one more copy at firmware and this copy takes much time as firmware code
> runs on microblaze(32-bit processor and runs at lower frequency).
> So, we suggested the users to handle the alignment issues at top layers(Before submitting request to the firmware).
>
> Will update the description in v2.
>
> > > to align programmable logic(PL) configuration data if the data is not
> > > word-aligned. To align the configuration data this patch adds a few
> > > padding bytes and these additional padding bytes will not create any
> > > functional impact on the PL configuration.
> > >
> > > Signed-off-by: Nava kishore Manne <nava.kishore.manne@xxxxxxx>
> > > ---
> > > drivers/fpga/zynqmp-fpga.c | 15 +++++++++++++--
> > > 1 file changed, 13 insertions(+), 2 deletions(-)
> > >
> > > diff --git a/drivers/fpga/zynqmp-fpga.c b/drivers/fpga/zynqmp-fpga.c
> > > index c60f20949c47..70a12dc6e15c 100644
> > > --- a/drivers/fpga/zynqmp-fpga.c
> > > +++ b/drivers/fpga/zynqmp-fpga.c
> > > @@ -15,6 +15,9 @@
> > > /* Constant Definitions */
> > > #define IXR_FPGA_DONE_MASK BIT(3)
> > >
> > > +#define DUMMY_PAD_BYTE 0xFF
> > > +#define FPGA_WORD_SIZE 4
> > > +
> > > /**
> > > * struct zynqmp_fpga_priv - Private data structure
> > > * @dev: Device data structure
> > > @@ -41,18 +44,26 @@ static int zynqmp_fpga_ops_write(struct
> > fpga_manager *mgr,
> > > const char *buf, size_t size)
> > > {
> > > struct zynqmp_fpga_priv *priv;
> > > + int word_align, ret, index;
> > > dma_addr_t dma_addr;
> > > u32 eemi_flags = 0;
> > > char *kbuf;
> > > - int ret;
> > >
> > > priv = mgr->priv;
> > > + word_align = size % FPGA_WORD_SIZE;
> > > + if (word_align)
> > > + word_align = FPGA_WORD_SIZE - word_align;
> > > +
> > > + size = size + word_align;
> >
> > Does the Macro ALIGN() help?
> >
>
> Will fix in v2.
>
> > >
> > > kbuf = dma_alloc_coherent(priv->dev, size, &dma_addr,
> > GFP_KERNEL);
> > > if (!kbuf)
> > > return -ENOMEM;
> > >
> > > - memcpy(kbuf, buf, size);
> >
> > This is historical, but why do the realloc & copy? Any better way?
> >
>
> Firmware internally uses the AXI DMA engine to transfer PL data from memory to the device
> and it supports only continues DMA-able memory access(It will not support scatter-gather memory access).
> So, this extra copy is needed to copy the data from kernel memory(allocated by the firmware subsystem using page allocators)
I see, but seems the issues are generic. Same issues for versal-fpga
1. fpga_mgr_buf_load_mapped(), fpgamanager_ops::write are intended for
contiguous (and maybe dma-safe) buffer but fpga_mgr_firmware_load()
breaks this. This may also affect spi backend drivers.
2. Some drivers needs dma addr rather than cpu addr, this could be
generally supported. But allocating a bounce buffer by
dma_alloc_coherent() is not a good idea.
3. fpga_manager_ops::write() may also be called for each fragment of a
sg buffer, could the firmware OK to handle this case?
I think framework refactoring is needed for these issues. I may not have
time to do the refactoring but will support these topics in first
priority.
Thanks,
Yilun
> to continues DMA-able memory.
>
> > > + for (index = 0; index < word_align; index++)
> > > + kbuf[index] = DUMMY_PAD_BYTE;
> > > +
> > > + memcpy(&kbuf[index], buf, size - index);
> >
> > Generally I object to massive copy in fpga_manager_ops::write if not
> > necessary. If there is an alignment requirement from HW, it should be
> > noticed to the caller in some way, before the buffer is created.
> >
>
> Agree, we should find a way to support this kind of use cases.
>
> Regards,
> Navakishore.
>