Re: [PATCH] af_unix: fix regression in read after shutdown
From: Vincent Whitchurch
Date: Thu Nov 25 2021 - 05:12:14 EST
On Wed, Nov 24, 2021 at 12:45:43AM +0100, Cong Wang wrote:
> On Fri, Nov 19, 2021 at 01:05:21PM +0100, Vincent Whitchurch wrote:
> > On kernels before v5.15, calling read() on a unix socket after
> > shutdown(SHUT_RD) or shutdown(SHUT_RDWR) would return the data
> > previously written or EOF. But now, while read() after
> > shutdown(SHUT_RD) still behaves the same way, read() after
> > shutdown(SHUT_RDWR) always fails with -EINVAL.
>
> Maybe just lift the socket tate check in unix_stream_read_generic()?
That would have have handled the specific case of read(2) on
SOCK_STREAM, but the sk->sk_state is checked in many other places in
af_unix.c so there would still be userspace-visible behaviour changes in
several other situations, which could cause regressions. Just to give
one such example, the sendfile(2) call in the following program gets
killed by SIGPIPE on earlier kernels but would now instead start to
return -ENOTCONN:
#include <err.h>
#include <errno.h>
#include <stdio.h>
#include <sys/socket.h>
#include <sys/unistd.h>
#include <sys/sendfile.h>
#include <sys/types.h>
#include <fcntl.h>
int main(int argc, char *argv[]) {
int sock[2];
int ret;
ret = socketpair(AF_UNIX, SOCK_STREAM, 0, sock);
if (ret < 0)
err(1, "socketpair");
ret = shutdown(sock[0], SHUT_RDWR);
if (ret < 0)
err(1, "shutdown");
ssize_t bytes = sendfile(sock[0], open(argv[0], O_RDONLY), NULL, 16);
if (bytes < 0)
err(1, "sendfile");
printf("sendfile %zd bytes\n", bytes);
return 0;
}
> >
> > This behaviour change was apparently inadvertently introduced as part of
> > a bug fix for a different regression caused by the commit adding sockmap
> > support to af_unix, commit 94531cfcbe79c359 ("af_unix: Add
> > unix_stream_proto for sockmap"). Those commits, for unclear reasons,
> > started setting the socket state to TCP_CLOSE on shutdown(SHUT_RDWR),
>
> Not sure why it is unclear here, for an connection oriented socket, it
> can be closed for just one direction, in this case we want to prevent it
> from being redirected in sockmap, hence the point of the commits.
I must admit I'm not really familiar with either af_unix.c or sockmap,
but clearly the existing code in af_unix.c does not expect sk_state to
be changed in shutdown. If we want to prevent UNIX sockets which have
had shutdown(SHUT_RDWR) called on then from being redirect to sockmap,
then maybe some other flag should be used to achieve that?
(Also, I wonder why the code added by the patch handled SHUT_RDWR
differently from a SHUT_RD followed by a SHUT_WR?)
> > while this state change had previously only been done in
> > unix_release_sock().
> >
> > Restore the original behaviour. The sockmap tests in
> > tests/selftests/bpf continue to pass after this patch.
>
> Isn't this because we don't have shutdown() in sockmap tests?
That may well be the case, I just assumed that the tests added along
with the new feature were comprehensive.