Re: 2.6.21-rc1: known regressions (v2) (part 2)

From: Mike Galbraith
Date: Thu Mar 01 2007 - 00:32:19 EST

Next message: Dmitry Torokhov: "Re: PROBLEM: "BUG:" when resuming from suspend-to-ram"
Previous message: Kandan Venkataraman: "[PATCH] Loop device - Tracking page writes made to a loop device through mmap"
Next in thread: Ingo Molnar: "Re: 2.6.21-rc1: known regressions (v2) (part 2)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Thu, 2007-03-01 at 09:01 +1100, Con Kolivas wrote:
> On Wednesday 28 February 2007 15:21, Mike Galbraith wrote:

> > On Wed, 2007-02-28 at 09:58 +1100, Con Kolivas wrote:
> > > On Tuesday 27 February 2007 19:54, Mike Galbraith wrote:
> > > > Agreed.
> > > >
> > > > I was recently looking at that spot because I found that niced tasks
> > > > were taking latency hits, and disabled it, which helped a bunch.
> > >
> > > Ok... as I said above to Ingo, nice means more latency too, and there is
> > > no doubt that if we disable nice as a working feature then the niced
> > > tasks will have less latency. Of course, this ends up meaning that
> > > un-niced tasks no longer receive their better cpu performance.. You're
> > > basically saying that you prefer nice not to work in the setting of HT.
> >
> > No I'm not, but let's go further in that direction just for the sake of
> > argument. You're then saying that you prefer realtime priorities to not
> > work in the HT setting, given that realtime tasks don't participate in
> > the 'single stream me' program.
>
> Where do I say that? I do not presume to manage realtime priorities in any
> way. You're turning my argument about nice levels around and somehow saying
> that because hyperthreading breaks the single stream me semantics by
> parallelising them that I would want to stop that happening. Nowhere have I
> argued that realtime semantics should be changed to somehow work around
> hyperthreading. SMT nice is about managing nice only, and not realtime
> priorities which are independent entities.

I see no real difference between the two assertions. Nice is just a
mechanism to set priority, so I applied your assertion to a different
range of priorities than nice covers, and returned it to show that the
code contradicts itself. It can't be bad for a nice 1 task to run with
a nice 0 task, but OK for a minimum RT task to run with a maximum RT
task. Iff HT without corrective measures breaks nice, then it breaks
realtime priorities as well.

> > I'm saying only that we're defeating the purpose of HT, and overriding a
> > user decision every time we force a sibling idle.
> >
> > > > I also
> > > > can't understand why it would be OK to interleave a normal task with an
> > > > RT task sometimes, but not others.. that's meaningless to the RT task.
> > >
> > > Clearly there would be a reason that code is there... The whole problem
> > > with HT is that as soon as you load up a sibling, you slow down the
> > > logical sibling, hence why this code is there in the first place. Since I
> > > know you're one to test things for yourself, I will put it to you this
> > > way:
> > >
> > > Boot into UP. Time how long it takes to do a million of these in a real
> > > time task:
> > > asm volatile("" : : : "memory");
> > >
> > > Then start up a SCHED_NORMAL task fully cpu bound such as "yes >
> > > /dev/null" and time that again. Obviously the former being a realtime
> > > task will take the same amount of time and the SCHED_NORMAL task will be
> > > starved until the realtime task finishes.
> >
> > Sure.
> >
> > > Now try the same experiment with hyperthreading enabled and an ordinary
> > > SMP kernel. You'll find the realtime task runs at only ~60% performance.
> >
> > So? User asked for HT. That's hardware multiplexing. It ain't free.
> > Buyer beware.
>
> But the buyer is not aware. You are aware because you tinker, but the vast
> majority of users who enable hyperthreading in their shiny pcs are not aware.

Then we need to make them aware of what they're enabling?

> The only thing they know is that if they enable hyperthreading their programs
> run slower in multitasking environments no matter how much they nice the
> other processes. Buyers do not buy hardware knowing that the internal design
> breaks something as fundamental as 'nice'. You seem to presume that most
> people who get hyperthreading are happy to compromise 'nice' in order to get
> their second core working and I put it to you that they do not make that
> decision.

To me it's pretty much black and white. Either you want to split your
cpu into logical units, which means each has less to offer than the
total, or you want all your processing power in one bucket.

> > > That's a
> > > serious performance hit for realtime tasks considering you're running a
> > > SCHED_NORMAL task. The SMT code that you seem to dislike fixes this
> > > problem.
> >
> > I don't think it does actually. Let your RT task sleep regularly, and
> > ever so briefly. We don't evict lower priority tasks from siblings upon
> > wakeup, we only prevent entry... sometimes.
>
> Well you know as well as I do that you're selecting out the exception rather
> than the rule, and statistically overall, it does work.

I don't agree that it's the exception, and if you look at this HT thing
from the split cpu perspective, I'm not sure there's even a problem.

Scrolling down, I see that this is getting too long, and we aren't
communicating anyway, so I'll cut it off here.

> We're going to have to agree to disagree, but feel free to have the final word
> of course.

The above will have to do.

-Mike

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Dmitry Torokhov: "Re: PROBLEM: "BUG:" when resuming from suspend-to-ram"
Previous message: Kandan Venkataraman: "[PATCH] Loop device - Tracking page writes made to a loop device through mmap"
Next in thread: Ingo Molnar: "Re: 2.6.21-rc1: known regressions (v2) (part 2)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]