Re: [PATCH 00/27] Latest numa/core release, v16

From: Ingo Molnar
Date: Tue Nov 20 2012 - 02:16:59 EST



* Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx> wrote:

> On Mon, Nov 19, 2012 at 12:36 PM, Ingo Molnar <mingo@xxxxxxxxxx> wrote:
> >
> > Hugepages is a must for most forms of NUMA/HPC. This alone
> > questions the relevance of most of your prior numa/core testing
> > results. I now have to strongly dispute your other conclusions
> > as well.
>
> Ingo, stop doing this kind of crap.
>
> Let's make it clear: if the NUMA patches continue to regress
> performance for reasonable loads (and that very much includes
> "no THP") then they won't be merged.
>
> You seem to be in total denial. Every time Mel sends out
> results that show that your patches MAKE PERFORMANCE WORSE you
> blame Mel, or blame the load, and never seem to admit that
> performance got worse.

No doubt numa/core should not regress with THP off or on and
I'll fix that.

As a background, here's how SPECjbb gets slower on mainline
(v3.7-rc6) if you boot Mel's kernel config and turn THP forcibly
off:

(avg: 502395 ops/sec)
(avg: 505902 ops/sec)
(avg: 509271 ops/sec)

# echo never > /sys/kernel/mm/transparent_hugepage/enabled

(avg: 376989 ops/sec)
(avg: 379463 ops/sec)
(avg: 378131 ops/sec)

A ~30% slowdown.

[ How do I know? I asked for Mel's kernel config days ago and
actually booted Mel's very config in the past few days,
spending hours on testing it on 4 separate NUMA systems,
trying to find Mel's regression. In the past Mel was a
reliable tester so I blindly trusted his results. Was that
some weird sort of denial on my part? :-) ]

Every time a regression is reported I take it seriously - and
there were performance regression reports against numa/core not
just from Mel and I'm sure there will be more in the future. For
example I'm taking David Rijentje's fresh performance regression
report seriously as well.

What I have some problem with is Mel sending me his kernel
config as the thing he tested, and which config included:

CONFIG_TRANSPARENT_HUGEPAGE=y
CONFIG_TRANSPARENT_HUGEPAGE_ALWAYS=y

but he apparently went and explicitly disabled THP on top of
that - which was just a weird choice of 'negative test tuning'
to keep unreported. That made me waste quite some time booting
and debugging his config and made the finding of the root cause
of the testing difference unnecessarily hard for me.

Again, that's not an excuse for the performance regression in
the numa/core tree in any way and I'll fix it.

Thanks,

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/