Re: + stupid-hack-to-make-mainline-build.patch added to -mm tree
From: Thomas Gleixner
Date: Thu Mar 08 2007 - 02:22:55 EST
On Wed, 2007-03-07 at 17:01 -0800, Daniel Arai wrote:
> Thomas Gleixner wrote:
>
> > You managed to avoid the usage of other code (i.e. PIT / HPET) already,
> > so why is it sooo desireable to emulate apics instead of substituting it
> > by a small and sane replacement ? Just because you happen to have an
> > LAPIC emulator ? That's no reason to wire yourself into the kernel code
> > and make it harder to change and maintain.
>
> There are several reasons why it's desirable to emulate the APIC. As you
> mentioned, we already have APIC emulation, and APIC emulation isn't a huge
> bottleneck on most workloads. Our code works, the Linux code works, and
> replacing both pieces of code with something "small and sane" isn't going to
> improve performance very much, so why bother? Any hypervisor implementation is
> going to be a tradeoff between what's easy to implement in the hypervisor,
> what's easy to implement in the guest operating system, and what's performance
> critical.
It is not about performance. It is about maintainability.
> Secondly, not all (para-)virtualized operating systems will want to use
> abstracted devices. Some virtual operating systems will be given direct access
> to hardware devices, and will need to run the actual driver for that device and
> not some abstracted device driver. So I don't buy your argument that every
> piece of the kernel that interacts with a paravirtualized driver should have a
> "small and sane replacement."
Err. We talk about paravirtualized Linux and not about what you have to
emulate to get Windows running. I don't care at all. Do you really
expect that we have to accept your design decisions, just because they
allow you to make your life easy ? This is exactly what you are using
paravirt ops for: a backdoor to throw your hackery at the kernel and
leave us with the mess of hardwired crap.
> But more importantly, we want a kernel that can run both on native hardware and
> in a paravirtualized environment. Linux doesn't really provide abstractions for
> replacing the appropriate code. We tried to hook into the source code at a
> level that seemed possible.
Again. You just refuse to change your implementation and you want to
keep it by arguing how hard it is because there are no abstractions.
I went through the business of creating abstractions into hardwired
hairballs twice. I know exactly what I'm talking about. It _IS_ hard
work, but at the end it makes the code better and more maintainable. You
do nothing for that, but expect that we live with your addons to the
hairball.
> There's no good way to override __send_IPI_shortcut. I suppose we could add
> paravirt ops for __send_IPI_shortcut and every other op that touches the APIC.
> But there are dozens of functions in apic.c that would need to be included in
> paravirt ops. And for our implementation, we really just want to override
> apic_read and apic_write, since we can make these faster when done through
> hypercalls than through memory accesses. If we were to make these paravirt ops,
> their implementations would be the same, except with a different apic_read and
> apic_write. This is a whole lot of useless code duplication.
No it is not. #include <linux/smp.h> is an abstraction and
__send_IPI ... is the i386 low level implementation.
You insist to hook yourself into the low level code instead of hooking
into the high level code, because it is _YOUR_ implementation and we
have to accept it as is.
This is the completely wrong way. We get the same crap and discussion
for every other architecture we are going to support with paravirt ops.
And probably for every other hypervisor implementation, which has a
different way of doing things.
> Most of the interrupt system is not written in such a way that multiple APICs
> implementations can be selected from at boot time. This is an absolute
> requirement so that the same kernel can boot on native and in a paravirtualized
> environment. While this could be implemented, it seems like a waste of time,
> since we can just emulate something similar to a real interrupt system and not
> change things very much.
Waste of your precious time. I'm working on low level code and
abstractions and from now on I have also to take care not to break
_YOUR_ implementation. You are going to waste _MY_ time and I'm going to
fight that forever.
Your prayer wheel argument of missing abstractions and easiness of
emulating things is annoying. If you think it is better to emulate APIC,
please emulate it without paravirt ops. If you want the speed
improvement, work with us to create the interfaces and abstractions
which are necessary to have a sane, maintainable and useful for all
hypervisors implementation.
tglx
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/