Re: [PATCH 1/2] overflow: Implement size_t saturating arithmetic helpers

From: Kees Cook
Date: Mon Sep 20 2021 - 22:53:58 EST


On Mon, Sep 20, 2021 at 03:06:33PM -0700, Nick Desaulniers wrote:
> On Mon, Sep 20, 2021 at 11:09 AM Kees Cook <keescook@xxxxxxxxxxxx> wrote:
> > [...]
> > +/*
> > + * Internal logic for size_add(). Takes variable names from UNIQUE_ID
> > + * so that the local variables here will never collide with other local
> > + * variables (for example, with itself).
> > + */
> > +#define __size_add(addend1, addend2, __addend1, __addend2, __sum) \
> > +({ \
> > + size_t __sum; \
> > + size_t __addend1 = (addend1); \
> > + size_t __addend2 = (addend2); \
> > + if (check_add_overflow(__addend1, __addend2, &__sum)) \
> > + __sum = SIZE_MAX; \
> > + __sum; \
> > +})
> > +
> > +/**
> > + * size_add() - Calculate size_t addition with saturation at SIZE_MAX
> > + *
> > + * @addend1: first addend
> > + * @addend2: second addend
> > + *
> > + * Returns: calculate @addend1 + @addend2, where both values are
> > + * evaluated as size_t, with any overflow causing the return value to
> > + * be SIZE_MAX.
> > + */
> > +#define size_add(addend1, addend2) \
> > + __must_check_size(__size_add(addend1, addend2, \
> > + __UNIQUE_ID(__addend1_), \
> > + __UNIQUE_ID(__addend2_), \
> > + __UNIQUE_ID(__sum_)))
>
> Is the use of __UNIQUE_ID really necessary? Is the point to avoid some
> kind of variable shadowing? (As opposed to just using names for the
> new variables in the scope of the statement expressions? ie.

Yes, when composed[1], they would shadow (under -Wshadow). I'd rather
not knowingly add yet more[2] shadowed variables to the kernel. :)

[1] https://godbolt.org/z/1rM6Ko1j3
[2] https://github.com/KSPP/linux/issues/152

> +#define __size_add(addend1, addend2, __sum) \
> +({ \
> + size_t __sum; \
> + if (check_add_overflow((size_t)__addend1, (size_t)__addend2,
> &__sum)) \
> + __sum = SIZE_MAX; \
> + __sum; \
> +})
>
> Do the double-underscore-prefixed really need to be a separate
> #define, or can their definitions be inlined into the expansion sites;
> there seems like there's no other users of the
> double-underscore-prefixed versions otherwise. ie.
>
> #define size_add(addend1, addend2) \
> __must_check_size(({ \
> size_t sum; \
> if (check_add_overflow((size_t)addend1, (size_t)addend2), &sum; \
> sum = SIZE_MAX; \
> sum; \
> })

Right, there aren't, but that's the way to pass such variable names in.
(See minmax.h.) This also leaves the door open for using these helpers
as constant expressions, if the need arises.

> > + err |= check_one_size_helper(SIZE_MAX, size_add, SIZE_MAX, -3);
>
> Sorry, is this ^ case saying that we expect SIZE_MAX + -3 == SIZE_MAX?
> This is because the helpers performed unsigned arithmetic on size_t?

Correct. When I wrote this I hadn't yet found any cases of needing to
shrink an allocation size that followed a common pattern like this. But it
turns out we do have some:

drivers/infiniband/core/sa_query.c: sa_dev = kzalloc(struct_size(sa_dev, port, e - s + 1), GFP_KERNEL);
drivers/infiniband/core/user_mad.c: umad_dev = kzalloc(struct_size(umad_dev, ports, e - s + 1), GFP_KERNEL);
drivers/net/ethernet/intel/iavf/iavf_virtchnl.c: len = struct_size(vti, list, adapter->num_tc - 1);

I'll need to add size_sub() as well.

Thanks for looking this over!

--
Kees Cook