Re: [PATCH] KVM/x86: Increase max vcpu number to 352
From: Radim KrÄmÃÅ
Date: Tue Aug 15 2017 - 12:13:40 EST
(Missed this mail before my last reply.)
2017-08-15 10:10-0400, Konrad Rzeszutek Wilk:
> On Tue, Aug 15, 2017 at 11:00:04AM +0800, Lan Tianyu wrote:
> > On 2017å08æ12æ 03:35, Konrad Rzeszutek Wilk wrote:
> > > Migration with 352 CPUs all being busy dirtying memory and also poking
> > > at various I/O ports (say all of them dirtying the VGA) is no problem?
> >
> > This depends on what kind of workload is running during migration. I
> > think this may affect service down time since there maybe a lot of dirty
> > memory data to transfer after stopping vcpus. This also depends on how
> > user sets "migrate_set_downtime" for qemu. But I think increasing vcpus
> > will break migration function.
>
> OK, so let me take a step back.
>
> I see this nice 'supported' CPU count that is exposed in kvm module.
>
> Then there is QEMU throwing out a warning if you crank up the CPU count
> above that number.
I find the range between "recommended max" and "hard max" VCPU count
confusing at best ... IIUC, it was there because KVM internals had
problems with scaling and we will hit more in the future because some
loops still are linear on VCPU count.
The exposed value doesn't say whether migration will work, because that
is a userspace thing and we're not aware of bottlenecks on the KVM side.
> Red Hat's web-pages talk about CPU count as well.
>
> And I am assuming all of those are around what has been tested and
> what has shown to work. And one of those test-cases surely must
> be migration.
Right, Red Hat will only allow/support what it has tested, even if
upstream has a practically unlimited count. I think the upstream number
used to be raised by Red Hat, which is why upstream isn't at the hard
implementation limit ...
> Ergo, if the vCPU count increase will break migration, then it is
> a regression.
Raising the limit would not break existing guests, but I would rather
avoid adding higher VCPU count as a feature that disables migration.
> Or a fix/work needs to be done to support a higher CPU count for
> migrating?
Post-copy migration should handle higher CPU count and it is the default
fallback on QEMU. Asking the question on a userspace list would yield
better answers, though.
Thanks.