Am Mittwoch, dem 21.06.2023 um 23:00 +0800 schrieb Sui Jingfeng:
On 2023/6/21 18:00, Lucas Stach wrote:And that's exactly the point. If it ever turns into something more than
I need this to show that our hardware is truly dma-coherent!static inline enum dma_data_direction etnaviv_op_to_dma_dir(u32 op)Why do you need this? Isn't dma_sync_sgtable_for_cpu a no-op on your
@@ -369,6 +381,7 @@ int etnaviv_gem_cpu_prep(struct drm_gem_object *obj, u32 op,
{
struct etnaviv_gem_object *etnaviv_obj = to_etnaviv_bo(obj);
struct drm_device *dev = obj->dev;
+ struct etnaviv_drm_private *priv = dev->dev_private;
bool write = !!(op & ETNA_PREP_WRITE);
int ret;
@@ -395,7 +408,7 @@ int etnaviv_gem_cpu_prep(struct drm_gem_object *obj, u32 op,
return ret == 0 ? -ETIMEDOUT : ret;
}
- if (etnaviv_obj->flags & ETNA_BO_CACHED) {
+ if (!priv->dma_coherent && etnaviv_obj->flags & ETNA_BO_CACHED) {
platform when the device is coherent?
I have tested that the driver still works like a charm without adding
this code '!priv->dma_coherent'.
But I'm expressing the idea that a truly dma-coherent just device don't
need this.
I don't care if it is a no-op.
It is now, it may not in the future.
a no-op on your platform, then that's probably for a good reason and a
driver should not assume that it knows better than the DMA API
implementation what is or is not required on a specific platform to
make DMA work.
Even it is, the overhead of function call itself still get involved.cpu_prep/fini aren't total fast paths, you already synchronized with
the GPU here, potentially waiting for jobs to finish, etc. If your
platform no-ops this then the function call will be in the noise.
Also, we want to try flush the write buffer with the CPU manually.And if you want absolute correctness then calling dma_sync_sgtable_* is
Currently, we want the absolute correctness in the concept,
not only the rendering results.
the right thing to do, as it can do much more than just manage caches.
Right now it also provides SWIOTLB translation if needed.
Regards,
Lucas