Re: Async suspend-resume patch w/ completions (was: Re: Asyncsuspend-resume patch w/ rwsems)

From: Linus Torvalds
Date: Thu Dec 10 2009 - 18:30:31 EST

Next message: Linus Torvalds: "Re: [git pull] drm"
Previous message: Phillip Lougher: "Re: [PATCH 1/9] Squashfs: move zlib decompression wrapper code intoa separate file"
In reply to: Rafael J. Wysocki: "Re: Async suspend-resume patch w/ completions (was: Re: Async suspend-resume patch w/ rwsems)"
Next in thread: Rafael J. Wysocki: "Re: Async suspend-resume patch w/ completions (was: Re: Async suspend-resume patch w/ rwsems)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Thu, 10 Dec 2009, Rafael J. Wysocki wrote:
>
> OK, why don't you just say you won't merge anything that doesn't use rwsems

I did! Here's a quote (and it's pretty much the whole email, so it's not
like it was hidden):

- alpine.LFD.2.00.0912081309370.3560@xxxxxxxxxxxxxxxxxxxxx:

"Let me put this simply: I've told you guys how to do it simply, with
_zero_ crap. No "iterating over children". No games. No data structures.
No new infrastructure. Just a single new rwlock per device, and _trivial_
code.

So here's the challenge: try it my simple way first. I've quoted the code
about five million times already. If you _actually_ see some problems,
explain them. Don't make up stupid "iterate over each child" things. Don't
claim totally made-up "leads to difficulties". Don't make it any more
complicated than it needs to be.

Keep it simple. And once you have tried that simple approach, and you
really can show why it doesn't work, THEN you can try something else.

But before you try the simple approach and explain why it wouldn't work, I
simply will not pull anything more complex. Understood and agreed?"

And then later about completions:

- alpine.LFD.2.00.0912081416470.3560@xxxxxxxxxxxxxxxxxxxxx:

"So I think completions should work, if done right. That whole "make the
parent wait for all the children to complete" is fine in that sense. And
I'll happily take such an approach if my rwlock thing doesn't work."

IOW, I'll happily take the completions version, but dammit, I refuse to
take it when there is a simpler approach that does NOT need to iterate,
and does NOT need to re-initialize the data structures each round etc.

That's what I've been arguing against the whole time. It started as
arguing against complex and unnecessary infrastructure, and trying to show
that it _can_ be done so much simpler using existing basic locking.

And I get annoyed when you guys continually seem to want to make it more
complex than it needs to be.

> > And this, for example, is pretty disgusting. Not only is that
> > INIT_COMPLETION purely brought on by the whole problem with completions
> > (they are fundamentally one-shot, but you want to use them over and over
>
> Actually, twice. However, since I don't want to do any async handling in the
> _noirq phases any more, I can get rid of this whole function. Thanks for
> pointing that out to me.

Well, my point was that you'll need to do that

INIT_COMPLETION(dev->power.completion);

thing each suspend and each resume. Exactly because completions are
designed to be "onw-way" things, so you end up having to reset them each
cycle (you just reset them even _more_ than you needed).

Again, my point was that using locks is actually a very _natural_ thing to
do. I really don't understand what problems you and Alan have with just
using locks - we have way more locks in the kernel than we have
completions, so they are the "default" thing to do, and they really are
very natural to use.

[ Ok, so admittedly the actual use of 'struct rw_semaphore' is pretty
unusual, but my point is that people are used to locking semantics in
general, more so than the semantics of completions ]

Completions were literally designed to be used for one-off things - one of
the most common uses is that the 'struct completion' is on the _stack_. It
doesn't get much more one-off than that - and the completions are really
very explicitly designed so that you can do a 'complete()' on something
that will literally disappear from under you as you do it (because the
struct completion might be on the stack of the thing that is waiting for
it, and gets de-allocated when the waiter goes ahead).

That is why 'wait_for_completion()' always has to take the spinlock, for
example - there is no fastpath for completion, because the races for the
waiter releasing things too early are too nasty.

So completions are actually very subtle things - and you don't need any of
that subtlety. I realize that from a user perspective, completions look
very simple, but in many ways they actually have subtler semantics than a
regular lock has.

Linus
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Linus Torvalds: "Re: [git pull] drm"
Previous message: Phillip Lougher: "Re: [PATCH 1/9] Squashfs: move zlib decompression wrapper code intoa separate file"
In reply to: Rafael J. Wysocki: "Re: Async suspend-resume patch w/ completions (was: Re: Async suspend-resume patch w/ rwsems)"
Next in thread: Rafael J. Wysocki: "Re: Async suspend-resume patch w/ completions (was: Re: Async suspend-resume patch w/ rwsems)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]