Re: [lkp-robot] [x86/mm] c4c3c3c2d0: will-it-scale.per_process_ops -61.0% regression

From: Andy Lutomirski
Date: Mon Oct 16 2017 - 21:06:53 EST

Next message: Liuwenliang (Lamb): "çå: [PATCH 00/11] KASan for arm"
Previous message: Shawn Lin: "Re: [PATCH v2 3/5] mmc: dw_mmc: Add locking to the CTO timer"
In reply to: Borislav Petkov: "Re: [lkp-robot] [x86/mm] c4c3c3c2d0: will-it-scale.per_process_ops -61.0% regression"
Next in thread: Markus Trippelsdorf: "Re: [lkp-robot] [x86/mm] c4c3c3c2d0: will-it-scale.per_process_ops -61.0% regression"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Mon, Oct 16, 2017 at 3:15 AM, Borislav Petkov <bp@xxxxxxxxx> wrote:
> On Mon, Oct 16, 2017 at 10:39:17AM +0800, kernel test robot wrote:
>>
>> Greeting,
>>
>> FYI, we noticed a -61.0% regression of will-it-scale.per_process_ops due to commit:
>>
>>
>> commit: c4c3c3c2d00826c88b5c02c20e80704664424b9b ("x86/mm: Flush more aggressively in lazy TLB mode")
>> url: https://github.com/0day-ci/linux/commits/Borislav-Petkov/x86-mm-Flush-more-aggressively-in-lazy-TLB-mode/20171011-115901
>>
>>
>> in testcase: will-it-scale
>> on test machine: 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory
>
> Say what now?
>
> This is actually what got applied upstream:
>
> b956575bed91 ("x86/mm: Flush more aggressively in lazy TLB mode")
>
> and AFAICT, that machine is BDW and it should have PCID, right?
>
> Or wait, that's a guest so PCID is probably not even usable for guests.
> Or should we disable it in VMs?

PCID works on new versions of KVM, at least, depending on configuration.

On a PCID machine, with this patch applied, we are still switching CR3
when we go idle (which is presumably what we're hitting here) -- we're
just not flushing anything. The main cost seems to come from
serialization. On my laptop, I think I measured about 80 ns per
non-flushing CR3 load if I do it in a loop. The cost is larger when
it's not in a loop because the pipeline is fuller, I assume. We also
take a bit of a hit because switch_mm is a bit complex. I have a
patch here to try to optimize it:

https://git.kernel.org/pub/scm/linux/kernel/git/luto/linux.git/commit/?h=x86/fixes&id=1caf24d080dac8b9f952600d1e91879aa782131c

On a non-PCID machine, this patch will increase IPIs, which doesn't
seem to be what we're seeing.

The test in question is basically the same thing as a test I ran with
very little in the way of visible regression. I'm wondering if the
real problem is some NUMA oddity.

Xiaolong, can you send us /proc/cpuinfo on this kernel on the test
machine that's seeing this problem?

--Andy

Next message: Liuwenliang (Lamb): "çå: [PATCH 00/11] KASan for arm"
Previous message: Shawn Lin: "Re: [PATCH v2 3/5] mmc: dw_mmc: Add locking to the CTO timer"
In reply to: Borislav Petkov: "Re: [lkp-robot] [x86/mm] c4c3c3c2d0: will-it-scale.per_process_ops -61.0% regression"
Next in thread: Markus Trippelsdorf: "Re: [lkp-robot] [x86/mm] c4c3c3c2d0: will-it-scale.per_process_ops -61.0% regression"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]