Re: [PATCHv4 3/6] mm/zpool: implement common zpool api to zbud/zsmalloc

From: Andrew Morton
Date: Tue Jun 24 2014 - 19:09:17 EST


On Tue, 24 Jun 2014 11:39:12 -0400 Dan Streetman <ddstreet@xxxxxxxx> wrote:

> On Mon, Jun 23, 2014 at 5:46 PM, Andrew Morton
> <akpm@xxxxxxxxxxxxxxxxxxxx> wrote:
> > On Mon, 2 Jun 2014 18:19:43 -0400 Dan Streetman <ddstreet@xxxxxxxx> wrote:
> >
> >> Add zpool api.
> >>
> >> zpool provides an interface for memory storage, typically of compressed
> >> memory. Users can select what backend to use; currently the only
> >> implementations are zbud, a low density implementation with up to
> >> two compressed pages per storage page, and zsmalloc, a higher density
> >> implementation with multiple compressed pages per storage page.
> >>
> >> ...
> >>
> >> +/**
> >> + * zpool_create_pool() - Create a new zpool
> >> + * @type The type of the zpool to create (e.g. zbud, zsmalloc)
> >> + * @flags What GFP flags should be used when the zpool allocates memory.
> >> + * @ops The optional ops callback.
> >> + *
> >> + * This creates a new zpool of the specified type. The zpool will use the
> >> + * given flags when allocating any memory. If the ops param is NULL, then
> >> + * the created zpool will not be shrinkable.
> >> + *
> >> + * Returns: New zpool on success, NULL on failure.
> >> + */
> >> +struct zpool *zpool_create_pool(char *type, gfp_t flags,
> >> + struct zpool_ops *ops);
> >
> > It is unconventional to document the API in the .h file. It's better
> > to put the documentation where people expect to find it.
> >
> > It's irritating for me (for example) because this kernel convention has
> > permitted me to train my tags system to ignore prototypes in headers.
> > But if I want to find the zpool_create_pool documentation I will need
> > to jump through hoops.
>
> Got it, I will move it to the .c file.
>
> I noticed you pulled these into -mm, do you want me to send follow-on
> patches for these changes, or actually update the origin patches and
> resend the patch set?

Full resend, I guess. I often add things which are
not-quite-fully-baked to give them a bit of testing, check for
integration with other changes, etc.

> >
> >>
> >> ...
> >>
> >> +
> >> +struct zpool *zpool_create_pool(char *type, gfp_t flags,
> >> + struct zpool_ops *ops)
> >> +{
> >> + struct zpool_driver *driver;
> >> + struct zpool *zpool;
> >> +
> >> + pr_info("creating pool type %s\n", type);
> >> +
> >> + spin_lock(&drivers_lock);
> >> + driver = zpool_get_driver(type);
> >> + spin_unlock(&drivers_lock);
> >
> > Racy against unregister. Can be solved with a standard get/put
> > refcounting implementation. Or perhaps a big fat mutex.

Was there a decision here?

> >> +void zpool_destroy_pool(struct zpool *zpool)
> >> +{
> >> + pr_info("destroying pool type %s\n", zpool->type);
> >> +
> >> + spin_lock(&pools_lock);
> >> + list_del(&zpool->list);
> >> + spin_unlock(&pools_lock);
> >> + zpool->driver->destroy(zpool->pool);
> >> + kfree(zpool);
> >> +}
> >
> > What are the lifecycle rules here? How do we know that nobody else can
> > be concurrently using this pool?
>
> Well I think with zpools, as well as direct use of zsmalloc and zbud
> pools, whoever creates a pool is responsible for making sure it's no
> longer in use before destroying it.

Sounds reasonable. Perhaps there's some convenient WARN_ON we can put
in here to check that.

> I think in most use cases, pool
> creators won't be sharing their pools, so there should be no issue
> with concurrent use. In fact, concurrent pool use it probably a bad
> idea in general - zsmalloc for example relies on per-cpu data during
> handle mapping, so concurrent use of a single pool might result in the
> per-cpu data being overwritten if multiple users of a single pool
> tried to map and use different handles from the same cpu.

That's all a bit waffly. Either we support concurrent use or we don't!

> Should some use/sharing restrictions be added to the zpool documentation?

Sure. And the code if possible. If a second user tries to use a pool
which is already in use, that attempt should just fail, with WARN,
printk, return -EBUSY, whatever.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/