Re: [PATCH -V5 12/14] memcg: move HugeTLB resource count to parent cgroup on memcg removal

From: Aneesh Kumar K.V
Date: Mon Apr 09 2012 - 06:01:50 EST


KAMEZAWA Hiroyuki <kamezawa.hiroyu@xxxxxxxxxxxxxx> writes:

> (2012/04/07 3:50), Aneesh Kumar K.V wrote:
>
>> From: "Aneesh Kumar K.V" <aneesh.kumar@xxxxxxxxxxxxxxxxxx>
>>
>> This add support for memcg removal with HugeTLB resource usage.
>>
>> Signed-off-by: Aneesh Kumar K.V <aneesh.kumar@xxxxxxxxxxxxxxxxxx>
>
>
> Hmm
>
>

....
...

>> + csize = PAGE_SIZE << compound_order(page);
>> + /*
>> + * uncharge from child and charge the parent. If we have
>> + * use_hierarchy set, we can never fail here. In-order to make
>> + * sure we don't get -ENOMEM on parent charge, we first uncharge
>> + * the child and then charge the parent.
>> + */
>> + if (parent->use_hierarchy) {
>
>
>> + res_counter_uncharge(&memcg->hugepage[idx], csize);
>> + if (!mem_cgroup_is_root(parent))
>> + ret = res_counter_charge(&parent->hugepage[idx],
>> + csize, &fail_res);
>
>
> Ah, why is !mem_cgroup_is_root() checked ? no res_counter update for
> root cgroup ?

My mistake. Earlier version of the patch series didn't charge/uncharge the root
cgroup during different operations. Later as per your review I updated
the charge/uncharge path to charge root cgroup. I missed to update this code.

>
> I think it's better to have res_counter_move_parent()...to do ops in atomic.
> (I'll post a patch for that for my purpose). OR, just ignore res->usage if
> parent->use_hierarchy == 1.
>
> uncharge->charge will have a race.



How about the below

diff --git a/mm/memcontrol.c b/mm/memcontrol.c
index 7b6e79a..5b4bc98 100644
--- a/mm/memcontrol.c
+++ b/mm/memcontrol.c
@@ -3351,24 +3351,24 @@ int mem_cgroup_move_hugetlb_parent(int idx, struct cgroup *cgroup,

csize = PAGE_SIZE << compound_order(page);
/*
- * uncharge from child and charge the parent. If we have
- * use_hierarchy set, we can never fail here. In-order to make
- * sure we don't get -ENOMEM on parent charge, we first uncharge
- * the child and then charge the parent.
+ * If we have use_hierarchy set we can never fail here. So instead of
+ * using res_counter_uncharge use the open-coded variant which just
+ * uncharge the child res_counter. The parent will retain the charge.
*/
if (parent->use_hierarchy) {
- res_counter_uncharge(&memcg->hugepage[idx], csize);
- if (!mem_cgroup_is_root(parent))
- ret = res_counter_charge(&parent->hugepage[idx],
- csize, &fail_res);
+ unsigned long flags;
+ struct res_counter *counter;
+
+ counter = &memcg->hugepage[idx];
+ spin_lock_irqsave(&counter->lock, flags);
+ res_counter_uncharge_locked(counter, csize);
+ spin_unlock_irqrestore(&counter->lock, flags);
} else {
- if (!mem_cgroup_is_root(parent)) {
- ret = res_counter_charge(&parent->hugepage[idx],
- csize, &fail_res);
- if (ret) {
- ret = -EBUSY;
- goto err_out;
- }
+ ret = res_counter_charge(&parent->hugepage[idx],
+ csize, &fail_res);
+ if (ret) {
+ ret = -EBUSY;
+ goto err_out;
}
res_counter_uncharge(&memcg->hugepage[idx], csize);
}


>
>> + } else {
>> + if (!mem_cgroup_is_root(parent)) {
>> + ret = res_counter_charge(&parent->hugepage[idx],
>> + csize, &fail_res);
>> + if (ret) {
>> + ret = -EBUSY;
>> + goto err_out;
>> + }
>> + }
>> + res_counter_uncharge(&memcg->hugepage[idx], csize);
>> + }
>
>
> Just a notice. Recently, Tejun changed failure of pre_destory() to show WARNING.
> Then, I'd like to move the usage to the root cgroup if use_hierarchy=0.
> Will it work for you ?

That should work.


>
>> + /*
>> + * caller should have done css_get
>> + */
>
>
> Could you explain meaning of this comment ?
>

inherited from mem_cgroup_move_account. I guess it means css cannot go
away at this point. We have done a css_get on the child. For a generic
move_account function may be the comment is needed. I guess in our case
the comment is redundant ?

-aneesh

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/