Re: DM Regression in 4.16-rc1 - read() returns data when it shouldn't
From: Mike Snitzer
Date: Wed Feb 14 2018 - 18:05:32 EST
On Wed, Feb 14 2018 at 3:39pm -0500,
NeilBrown <neilb@xxxxxxxx> wrote:
> On Wed, Feb 14 2018, Milan Broz wrote:
>
> > Hi,
> >
> > the commit (found by bisect)
> >
> > commit 18a25da84354c6bb655320de6072c00eda6eb602
> > Author: NeilBrown <neilb@xxxxxxxx>
> > Date: Wed Sep 6 09:43:28 2017 +1000
> >
> > dm: ensure bio submission follows a depth-first tree walk
> >
> > cause serious regression while reading from DM device.
> >
> > The reproducer is below, basically it tries to simulate failure we see in cryptsetup
> > regression test: we have DM device with error and zero target and try to read
> > "passphrase" from it (it is test for 64 bit offset error path):
> >
> > Test device:
> > # dmsetup table test
> > 0 10000000 error
> > 10000000 1000000 zero
> >
> > We try to run this operation:
> > lseek64(fd, 5119999988, SEEK_CUR); // this should seek to error target sector
> > read(fd, buf, 13); // this should fail, if we seek to error part of the device
> >
> > While on 4.15 the read properly fails:
> > Seek returned 5119999988.
> > Read returned -1.
> >
> > for 4.16 it actually succeeds returning some random data
> > (perhaps kernel memory, so this bug is even more dangerous):
> > Seek returned 5119999988.
> > Read returned 13.
> >
> > Full reproducer below:
> >
> > #define _GNU_SOURCE
> > #define _LARGEFILE64_SOURCE
> > #include <stdio.h>
> > #include <stddef.h>
> > #include <stdint.h>
> > #include <stdlib.h>
> > #include <unistd.h>
> > #include <fcntl.h>
> > #include <inttypes.h>
> >
> > int main (int argc, char *argv[])
> > {
> > char buf[13];
> > int fd;
> > //uint64_t offset64 = 5119999999;
> > uint64_t offset64 = 5119999988;
> > off64_t r;
> > ssize_t bytes;
> >
> > system("echo -e \'0 10000000 error\'\\\\n\'10000000 1000000 zero\' | dmsetup create test");
> >
> > fd = open("/dev/mapper/test", O_RDONLY);
> > if (fd == -1) {
> > printf("open fail\n");
> > return 1;
> > }
> >
> > r = lseek64(fd, offset64, SEEK_CUR);
> > printf("Seek returned %" PRIu64 ".\n", r);
> > if (r < 0) {
> > printf("seek fail\n");
> > close(fd);
> > return 2;
> > }
> >
> > bytes = read(fd, buf, 13);
> > printf("Read returned %d.\n", (int)bytes);
> >
> > close(fd);
> > return 0;
> > }
> >
> >
> > Please let me know if you need more info to reproduce it.
>
> Thanks for the detailed report. I haven't tried to reproduce, but the
> code looks very weird.
> The patch I posted "Date: Wed, 06 Sep 2017 09:43:28 +1000" had:
> + struct bio *b = bio_clone_bioset(bio, GFP_NOIO,
> + md->queue->bio_split);
> + bio_advance(b, (bio_sectors(b) - ci.sector_count) << 9);
> + bio_chain(b, bio);
> + generic_make_request(b);
> + break;
>
> The code in Linux has:
>
> struct bio *b = bio_clone_bioset(bio, GFP_NOIO,
> md->queue->bio_split);
> ci.io->orig_bio = b;
> bio_advance(bio, (bio_sectors(bio) - ci.sector_count) << 9);
> bio_chain(b, bio);
> ret = generic_make_request(bio);
> break;
>
> So the wrong bio is sent to generic_make_request().
> Mike: do you remember how that change happened? I think there were
> discussions following the patch, but I cannot find anything about making
> that change.
Mikulas had this feedback:
https://www.redhat.com/archives/dm-devel/2017-November/msg00159.html
You replied with:
https://www.redhat.com/archives/dm-devel/2017-November/msg00165.html
Where you said "Yes, you are right something like that would be better."
And you then provided a follow-up patch:
https://www.redhat.com/archives/dm-devel/2017-November/msg00175.html
That we discussed and I said I'd just fold it into the original, and you
agreed and thanked me for checking with you ;)
https://www.redhat.com/archives/dm-devel/2017-November/msg00208.html
Anyway, I'm just about to switch to Daddy Daycare mode (need to get my
daughter up from her nap, feed her dinner, etc) so: I'll circle back to
this tomorrow morning.
Thanks,
Mike