Re: [Xen-devel] [PATCH 3/3] xen/blk[front|back]: Enhance discardsupport with secure erasing support.

From: Ian Campbell
Date: Wed Oct 12 2011 - 06:54:07 EST


On Tue, 2011-10-11 at 21:57 +0100, Konrad Rzeszutek Wilk wrote:
> > My later response to it should include it:
> > http://lists.xensource.com/archives/html/xen-devel/2011-10/msg00652.html
> >
> > >
> > > Further I wonder why you don't use the "reserved" field instead of
> > > extending the structure at the end.
> >
> > <blinks> I completly missed it. That would definitly work as well.
> >
> > Let me redo it with that in mind.
>
> I've posted the Xen hypervisor ABI one that thread above. The implementation
> of that looks as follow:

Ian.

>
> commit ae33f998d66c5982af533bda25c2b6c4f863789f
> Author: Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>
> Date: Mon Oct 10 10:58:40 2011 -0400
>
> xen/blk[front|back]: Enhance discard support with secure erasing support.
>
> Part of the blkdev_issue_discard(xx) operation is that it can also
> issue a secure discard operation that will permanantly remove the
> sectors in question. We advertise that we can support that via the
> 'discard-secure' attribute and on the request, if the 'secure' bit
> is set, we will attempt to pass in REQ_DISCARD | REQ_SECURE.
>
> CC: Li Dongyang <lidongyang@xxxxxxxxxx>
> [v1: Used 'flag' instead of 'secure:1' bit]
> [v2: Use 'reserved 'uint8_t' as a flag]
> Signed-off-by: Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>
>
> diff --git a/drivers/block/xen-blkback/blkback.c b/drivers/block/xen-blkback/blkback.c
> index 94e659d..4f33c13 100644
> --- a/drivers/block/xen-blkback/blkback.c
> +++ b/drivers/block/xen-blkback/blkback.c
> @@ -362,7 +362,7 @@ static int xen_blkbk_map(struct blkif_request *req,
> {
> struct gnttab_map_grant_ref map[BLKIF_MAX_SEGMENTS_PER_REQUEST];
> int i;
> - int nseg = req->nr_segments;
> + int nseg = req->u1.nr_segments;
> int ret = 0;
>
> /*
> @@ -422,13 +422,16 @@ static void xen_blk_discard(struct xen_blkif *blkif, struct blkif_request *req)
> int status = BLKIF_RSP_OKAY;
> struct block_device *bdev = blkif->vbd.bdev;
>
> - if (blkif->blk_backend_type == BLKIF_BACKEND_PHY)
> + if (blkif->blk_backend_type == BLKIF_BACKEND_PHY) {
> + unsigned long secure = (blkif->vbd.discard_secure &&
> + (req->u1.flag & BLKIF_OP_DISCARD_FLAG_SECURE)) ?
> + BLKDEV_DISCARD_SECURE : 0;
> /* just forward the discard request */
> err = blkdev_issue_discard(bdev,
> req->u.discard.sector_number,
> req->u.discard.nr_sectors,
> - GFP_KERNEL, 0);
> - else if (blkif->blk_backend_type == BLKIF_BACKEND_FILE) {
> + GFP_KERNEL, secure);
> + } else if (blkif->blk_backend_type == BLKIF_BACKEND_FILE) {
> /* punch a hole in the backing file */
> struct loop_device *lo = bdev->bd_disk->private_data;
> struct file *file = lo->lo_backing_file;
> @@ -618,6 +621,9 @@ static int dispatch_rw_block_io(struct xen_blkif *blkif,
> struct blk_plug plug;
> bool drain = false;
>
> + /* Check that the number of segments is sane. */
> + nseg = req->u1.nr_segments;

This field is invalid (at least with these semantics) if req->operation
== BLKIF_OP_DISCARD so reading it here and clearing it later when you
decide it is invalid is just confusing. Why not read it inside the
switch iff it is valid?

> +
> switch (req->operation) {
> case BLKIF_OP_READ:
> blkif->st_rd_req++;
> @@ -636,6 +642,7 @@ static int dispatch_rw_block_io(struct xen_blkif *blkif,
> case BLKIF_OP_DISCARD:
> blkif->st_ds_req++;
> operation = REQ_DISCARD;
> + nseg = 0; /* The nr_segments and flag share the same space. */
> break;
> default:
> operation = 0; /* make gcc happy */
> @@ -643,8 +650,6 @@ static int dispatch_rw_block_io(struct xen_blkif *blkif,
> break;
> }
>
> - /* Check that the number of segments is sane. */
> - nseg = req->nr_segments;
> if (unlikely(nseg == 0 && operation != WRITE_FLUSH &&
> operation != REQ_DISCARD) ||
> unlikely(nseg > BLKIF_MAX_SEGMENTS_PER_REQUEST)) {
> diff --git a/drivers/block/xen-blkback/common.h b/drivers/block/xen-blkback/common.h
> index e638457..c6a5462 100644
> --- a/drivers/block/xen-blkback/common.h
> +++ b/drivers/block/xen-blkback/common.h
> @@ -76,7 +76,10 @@ struct blkif_x86_32_request_discard {
>
> struct blkif_x86_32_request {
> uint8_t operation; /* BLKIF_OP_??? */
> - uint8_t nr_segments; /* number of segments */
> + union {
> + uint8_t nr_segments; /* number of segments */
> + uint8_t flag; /* flag for blkif_x86_32_request_discard*/
> + } u1;

this is a bit of a mess, it sucks that common fields and per-operation
ones are all mixed up like this, but
union {
struct {
nr_segments;
} rw;
struct {
flags;
} discard;
} u
would at least make it more obvious what was what in the code
dereferencing these fields.

Having seen the confusion which arises from reusing this reserved field
I'm not convinced that it is worthwhile. If we don't do that then we can
have a more sane layout which makes it more explicit what is what:

struct blkif_request {
uint8_t operation; /* BLKIF_OP_??? */
uint8_t nr_segments; /* number of segments */
blkif_vdev_t handle; /* only for read/write requests */
uint64_t id; /* private guest value, echoed in resp */
union {
struct {
blkif_sector_t sector_number;/* start sector idx on disk (r/w only) */
struct blkif_request_segment seg[BLKIF_MAX_SEGMENTS_PER_REQUEST];
} rw;
struct {
whatever;
flags;
} discard;
struct {

somethign else
} anotherop;
} u;
};

handle isn't really only r/w, is it? If it is then I think we should
just repeat the id fields within the union and pad so the offset is
correct:

struct blkif_request {
uint8_t operation; /* BLKIF_OP_??? */
union {
struct {
uint8_t nr_segments; /* number of segments */
blkif_vdev_t handle;
uint64_t id; /* private guest value, echoed in resp */
blkif_sector_t sector_number;/* start sector idx on disk (r/w only) */
struct blkif_request_segment seg[BLKIF_MAX_SEGMENTS_PER_REQUEST];
} rw;
struct {
uint8_t flags;
blkif_vdev_t __pad; /* was "handle: only for read/write requests */
uint64_t id; /* private guest value, echoed in resp */
} discard;
struct {
somethign else;
padding
id;
} anotherop;
} u;
};

it's lame but it avoid relying on "only for op xx, yy" type comments to
get things right.


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/