On Wed, 11 Oct 2023 at 17:07, Christian König <christian.koenig@xxxxxxx> wrote:
Am 10.10.23 um 22:23 schrieb Dave Airlie:It's okay, I don't think anyone is doing that, some of the these
Well that is pretty normal use case, AMD works the same way.I think we're then optimizing for different scenarios. Our computeCan I ask why we are optimising for this userspace, this seems
driver will use mostly external objects only, and if shared, I don't
forsee them bound to many VMs. What saves us currently here is that in
compute mode we only really traverse the extobj list after a preempt
fence wait, or when a vm is using a new context for the first time. So
vm's extobj list is pretty large. Each bo's vma list will typically be
pretty small.
incredibly broken.
We've has this sort of problem in the past with Intel letting the tail
wag the horse, does anyone remember optimising relocations for a
userspace that didn't actually need to use relocations?
We need to ask why this userspace is doing this, can we get some
pointers to it? compute driver should have no reason to use mostly
external objects, the OpenCL and level0 APIs should be good enough to
figure this out.
In a multi GPU compute stack you have mostly all the data shared between
different hardware devices.
As I said before looking at just the Vulcan use case is not a good idea
at all.
use-cases are buried in server land and you guys don't communicate
them very well.
multi-gpu compute would I'd hope be moving towards HMM/SVM type
solutions though?
I'm also not into looking at use-cases that used to be important but
might not as important going forward.
Dave.
Christian.
Dave.