Re: PREEMPT_RT and I-PIPE: the numbers, part 4

From: Ingo Molnar
Date: Mon Jul 11 2005 - 02:08:58 EST

Next message: Marc Haber: "2.6.12.2 tg3 driver doesn't ARP on 8021q 802.1q dot1q VLAN interfaces?"
Previous message: Jon Florence: "Swapping broken on 2.6.9? Limit Page Cache growth?"
In reply to: Karim Yaghmour: "Re: PREEMPT_RT and I-PIPE: the numbers, part 4"
Next in thread: Karim Yaghmour: "Re: PREEMPT_RT and I-PIPE: the numbers, part 4"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

* Karim Yaghmour <karim@xxxxxxxxxxx> wrote:

> With ping floods, as with other things, there is room for improvement,
> but keep in mind that these are standard tests [...]

the problem is that ping -f isnt what it used to be. If you are using a
recent distribution with an updated ping utility, these days the
equivalent of 'ping -f' is something like:

ping -q -l 500 -A -s 10 <target>

and even this variant (and the old variant) needs to be carefully
validated for actual workload generated. Note that this is true for
workloads against vanilla kernels too. (Also note that i did not claim
that the flood ping workload you used is invalid - you have not
published packet rates or interrupt rates that could help us judge how
constant the workload was. I only said that according to my measurements
it's quite unstable, and that you should double-check it. Just running
it and ACK-ing that the packet rates are stable and identical amongst
all of these kernels would be enough to put this concern to rest.)

to see why i think there might be something wrong with the measurement,
just look at the raw numbers:

LMbench running times:
+--------------------+-------+-------+-------+-------+-------+
| Kernel | plain | IRQ | ping | IRQ & | IRQ & |
| | | test | flood | ping | hd |
+====================+=======+=======+=======+=======+=======+
| Vanilla-2.6.12 | 152 s | 150 s | 188 s | 185 s | 239 s |
+====================+=======+=======+=======+=======+=======+
| with RT-V0.7.51-02 | 152 s | 153 s | 203 s | 201 s | 239 s |
+====================+=======+=======+=======+=======+=======+

note that both the 'IRQ' and 'IRQ & hd' test involves interrupts, and
PREEMPT_RT shows overhead within statistical error, but only the 'flood
ping' workload created a ~8% slowdown.

my own testing (whatever it's worth) shows that during flood-pings, the
maximum overhead PREEMPT_RT caused was 4%. I.e. PREEMPT_RT used 4% more
system-time than the vanilla UP kernel when the CPU was 99% dedicated to
handling ping replies. But in your tests not the full CPU was dedicated
to flood ping replies (of course). Your above numbers suggest that under
the vanilla kernel 23% of CPU time was used up by flood pinging.
(188/152 == +23.6%)

Under PREEMPT_RT, my tentative guesstimation would be that it should go
from 23.6% to 24.8% - i.e. a 1.2% less CPU time for lmbench - which
turns into roughly +1 seconds of lmbench wall-clock time slowdown. Not
15 seconds, like your test suggests. So there's a more than an order of
magnitude difference in the numbers, which i felt appropriate sharing :)

_And_ your own hd and stable-rate irq workloads suggest that PREEMPT_RT
and vanilla are very close to each other. Let me repeat the table, with
only the numbers included where there was no flood pinging going on:

LMbench running times:
+--------------------+-------+-------+-------+-------+-------+
| Kernel | plain | IRQ | | | IRQ & |
| | | test | | | hd |
+====================+=======+=======+=======+=======+=======+
| Vanilla-2.6.12 | 152 s | 150 s | | | 239 s |
+====================+=======+=======+=======+=======+=======+
| with RT-V0.7.51-02 | 152 s | 153 s | | | 239 s |
+====================+=======+=======+=======+=======+=======+
| with Ipipe-0.7 | 149 s | 150 s | | | 236 s |
+====================+=======+=======+=======+=======+=======+

these numbers suggest that outside of ping-flooding all IRQ overhead
results are within statistical error.

So why do your "ping flood" results show such difference? It really is
just another type of interrupt workload and has nothing special in it.

> but keep in mind that these are standard tests used as-is by others
> [...]

are you suggesting this is not really a benchmark but a way to test how
well a particular system withholds against extreme external load?

> For one thing, the heavy fluctuation in ping packets may actually
> induce a state in the monitored kernel which is more akin to the one
> we want to measure than if we had a steady flow of packets.

so you can see ping packet flow fluctuations in your tests? Then you
cannot use those results as any sort of benchmark metric.

under PREEMPT_RT, if you wish to tone down the effects of an interrupt
source then all you have to do is something like:

P=$(pidof "IRQ "$(grep eth1 /proc/interrupts | cut -d: -f1 | xargs echo))

chrt -o -p 0 $P # net irq thread
renice -n 19 $P
chrt -o -p 0 5 # softirq-tx
renice -n 19 5
chrt -o -p 0 6 # softirq-rx
renice -n 19 6

and from this point on you should see zero lmbench overhead from flood
pinging. Can vanilla or I-PIPE do that?

Ingo
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Marc Haber: "2.6.12.2 tg3 driver doesn't ARP on 8021q 802.1q dot1q VLAN interfaces?"
Previous message: Jon Florence: "Swapping broken on 2.6.9? Limit Page Cache growth?"
In reply to: Karim Yaghmour: "Re: PREEMPT_RT and I-PIPE: the numbers, part 4"
Next in thread: Karim Yaghmour: "Re: PREEMPT_RT and I-PIPE: the numbers, part 4"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]