Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure

From: Anthony Liguori
Date: Fri Sep 14 2007 - 17:45:28 EST

Next message: Anthony Liguori: "Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure"
Previous message: Keshavamurthy, Anil S: "Re: [patch] Fix BIOS-e820 end address"
In reply to: Zachary Amsden: "Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure"
Next in thread: Zachary Amsden: "Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Zachary Amsden wrote:

On Fri, 2007-09-14 at 16:02 -0500, Anthony Liguori wrote:

Jeremy Fitzhardinge wrote:

Anthony Liguori wrote:

This patch refactors the current hypercall infrastructure to better support live
migration and SMP. It eliminates the hypercall page by trapping the UD
exception that would occur if you used the wrong hypercall instruction for the
underlying architecture and replacing it with the right one lazily.

I guess it would be pretty rude/unlikely for these opcodes to get reused
in other implementations... But couldn't you make the page trap
instead, rather than relying on an instruction fault?

The whole point of using the instruction is to allow hypercalls to be used in many locations. This has the nice side effect of not requiring a central hypercall initialization routine in the guest to fetch the hypercall page. A PV driver can be completely independent of any other code provided that it restricts itself to it's hypercall namespace.

But if the instruction is architecture dependent, and you run on the
wrong architecture, now you have to patch many locations at fault time,
introducing some nasty runtime code / data cache overlap performance
problems. Granted, they go away eventually.

We're addressing that by blowing away the shadow cache and holding the big kvm lock to ensure SMP safety. Not a great thing to do from a performance perspective but the whole point of patching is that the cost is amortized.

I prefer the idea of a hypercall page, but not a central initialization.
Rather, a decentralized approach where PV drivers can detect using CPUID
which hypervisor is present, and a common MSR shared by all hypervisors
that provides the location of the hypercall page.

So then each module creates a hypercall page using this magic MSR and the hypervisor has to keep track of it so that it can appropriately change the page on migration. The page can only contain a single instruction or else it cannot be easily changed (or you have to be able to prevent the guest from being migrated while in the hypercall page).

We're really talking about identical models. Instead of an MSR, the #GP is what tells the hypervisor to update the instruction. The nice thing about this is that you don't have to keep track of all the current hypercall page locations in the hypervisor.

Regards,

Anthony Liguori

Zach

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Anthony Liguori: "Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure"
Previous message: Keshavamurthy, Anil S: "Re: [patch] Fix BIOS-e820 end address"
In reply to: Zachary Amsden: "Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure"
Next in thread: Zachary Amsden: "Re: [kvm-devel] [PATCH] Refactor hypercall infrastructure"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]