[PATCH/RFC] MM slub: add a sysfs entry to show the calculatednumber of fallback slabs

From: Richard Kennedy
Date: Fri Nov 12 2010 - 06:28:37 EST


Add a slub sysfs entry to show the calculated number of fallback slabs.

Using the information already available it is straightforward to
calculate the number of fallback & full size slabs. We can then track
which slabs are particularly effected by memory fragmentation and how
long they take to recover.

There is no change to the mainline code, the calculation is only
performed on request, and the value is available without having to
enable CONFIG_SLUB_STATS.

Note that this could give the wrong value if the user changes the slab
order via the sysfs interface.

Signed-off-by: Richard Kennedy <richard@xxxxxxxxxxxxxxx>
---


As we have the information needed to do this calculation is seem useful
to expose it and provide another way to understand what is happening
inside the memory manager.

On my desktop workloads (kernel compile etc) I'm seeing surprisingly
little slab fragmentation. Do you have any suggestions for test cases
that will fragment the memory?

I copied the code to count the total objects from the slabinfo s_show
function, but as I don't need the partial count I didn't extract it into
a helper function.

regards
Richard


diff --git a/mm/slub.c b/mm/slub.c
index 8fd5401..8c79eaa 100644
--- a/mm/slub.c
+++ b/mm/slub.c
@@ -4043,6 +4043,46 @@ static ssize_t destroy_by_rcu_show(struct kmem_cache *s, char *buf)
}
SLAB_ATTR_RO(destroy_by_rcu);

+/* The number of fallback slabs can be calculated to give an
+ * indication of how fragmented this slab is.
+ * This is a snapshot of the current makeup of this cache.
+ *
+ * Given
+ *
+ * total_objects = (nr_fallback_slabs * objects_per_fallback_slab) +
+ * ( nr_normal_slabs * objects_per_slab)
+ * and
+ * nr_slabs = nr_normal_slabs + nr_fallback_slabs
+ *
+ * then we can easily calculate nr_fallback_slabs.
+ *
+ * Note that this can give the wrong answer if the user has changed the
+ * order of this slab via sysfs.
+ */
+
+static ssize_t fallback_show(struct kmem_cache *s, char *buf)
+{
+ unsigned long nr_objects = 0;
+ unsigned long nr_slabs = 0;
+ unsigned long nr_fallback = 0;
+ unsigned long acc;
+ int node;
+
+ if (oo_order(s->oo) != oo_order(s->min)) {
+ for_each_online_node(node) {
+ struct kmem_cache_node *n = get_node(s, node);
+ nr_slabs += atomic_long_read(&n->nr_slabs);
+ nr_objects += atomic_long_read(&n->total_objects);
+ }
+ acc = nr_objects - nr_slabs * oo_objects(s->min);
+ acc /= (oo_objects(s->oo) - oo_objects(s->min));
+ nr_fallback = nr_slabs - acc;
+ }
+ return sprintf(buf, "%lu\n", nr_fallback);
+}
+SLAB_ATTR_RO(fallback);
+
+
#ifdef CONFIG_SLUB_DEBUG
static ssize_t slabs_show(struct kmem_cache *s, char *buf)
{
@@ -4329,6 +4369,7 @@ static struct attribute *slab_attrs[] = {
&reclaim_account_attr.attr,
&destroy_by_rcu_attr.attr,
&shrink_attr.attr,
+ &fallback_attr.attr,
#ifdef CONFIG_SLUB_DEBUG
&total_objects_attr.attr,
&slabs_attr.attr,


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/