Re: [PATCH v2 0/4] Checkpoint/Restore: Show in proc IDs of objectsthat can be shared between tasks

From: Cyrill Gorcunov
Date: Sat Nov 19 2011 - 10:34:17 EST


On Sat, Nov 19, 2011 at 12:10:12PM +0400, Vasiliy Kulikov wrote:
> On Sat, Nov 19, 2011 at 11:57 +0400, Vasiliy Kulikov wrote:
> > Doing something like hash(cookie1 ++ obj ++ cookie) would leak only the
> > equation of two objects, but it can be still dangerous - learn hashes of
> > (a) objects created at boot time (their addresses are known) and (b)
> > some objects, which allocation scheme is known (i.e. we know
> > kmem_cache_alloc() gives us specific addresses with high probability),
> > and then compare the hashes against other objects after (a) and (b)
> > objects are kfree'd.
> >

First of all, thanks a *huge* for comments Vasiliy! Yes, agreed that plain
single xor is not sufficient here.

> >
> > What is the highest timeframe which must maintain the property of unique
> > ids? Is it the whole system lifetime or probably [dump start; dump
> > end] and we can change the cookie many times? Can we probably shorten

Yes, dump-start/dump-end is a mininum timeframe.

> > the time even? Can we ensure that during this timeframe no new kernel
> > objects will be created (unrealistic, but would be great)?
> >

We might use PT_SEIZED as such flag and don't allow to allocate new kernel
objects but it will bring too much complexity into kernel code I think,
which is not what we want eventually ;)

> > Also, I didn't understand from the quoted text who will use it - only
> > the dumper or this interface is exposed to all userspace processes and
> > anybody may learn hash(&kern_obj) for any kern_obj he may reference?
>

It's limited to /proc/$pid/

> Also, if one should have an ability to learn IDs of specific object
> types and the set of types is very limited, it's much safer to have one
> increasing u64 counter for each created object of one of these types.
> The exposed to userspace data will be:
>
> ID = hash(counter ^ cookie)
>
> cookie is generated at boot time, once. counter is a single
> variable, one for all exposed kernel object types.
>
> ID will be unpredictable if hash() is cryptographically secure, and
> counter is not duplicated. So, for each newly created object the ID is
> the new random value, which is unique and says nothing to userspace about
> either kernel object addresses or the counter itself.
>
> The cost:
>
> 1) counter storing for each kernel object exposed through this interface.
>

Yes, this is main concern.

> 2) object creation will be slowed down by hash().
>

This is not that important I think, since it's not a time-critical operation.

> Also, one thought - is it safe to say two kernel objects are the same to
> userspace? :) I don't see anything obviously dangerous, though.
>

Cyrill
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/