Re: 4.9.0 regression in pipe-backed iov_iter with systemd-nspawn
From: Linus Torvalds
Date: Thu Jan 12 2017 - 18:02:19 EST
On Thu, Jan 12, 2017 at 2:46 PM, Al Viro <viro@xxxxxxxxxxxxxxxxxx> wrote:
>
> PS: what about the /proc/mounts contents? If it's something 9p-backed kvm,
> your bisect might have been caught on the bug I'd mentioned - if the breakage
> you are seeing in 4.9.3 has started after that commit and before the
> backport of the fix, your bisect could converge there. Does the
> reproducer trigger on 523ac9afc73a + cherry-pick of 8e54cadab447?
Looking at the strace that Alan just posted, I really think it's the
splice change that Alan bisected to.
In particular, this line:
splice(5, NULL, 1, NULL, 9223372036854775807, 0) = -1 EAGAIN
(Resource temporarily unavailable)
and note that the commit in question introduces that -EAGAIN error code.
The old code never returned EAGAIN at all (well, it could do so later,
if NONBLOCK was set, obviously, but that doesn't seem to be the case
here).
So that commit seems to have introduced a new error case, and I
suspect systemd-nospawn simply doesn't handle it. It is expecting
splice_to_pipe() to actually block.
Ergo: I think we need to do a wait_for_space() somewhere, getting rid
of the EAGAIN.
Looking at the callers of "do_splice_to()", we already have the
wait_for_space() in do_splice(), but we do *not* have it in the
do_splice_from() case when both the input and output file descriptors
are pipes.
Hmm?
Linus