Re: [PATCH] rtmutex: multiple candidate owners without unrelatedboosting

From: Lai Jiangshan
Date: Thu Dec 23 2010 - 02:23:43 EST


On 12/17/2010 04:55 AM, Darren Hart wrote:
> On 12/14/2010 12:07 PM, Thomas Gleixner wrote:
>> B1;2401;0cOn Tue, 14 Dec 2010, Lai Jiangshan wrote:
>>> Not advantage nor disadvantage
>>> 1) Even we support multiple candidate owners, we hardly cause "thundering herd"
>>> the number of candidate owners is likely 1.
>>
>> I'm not convinced about that function, see comments below
>>
>>> 2) two APIs are changed.
>>> rt_mutex_owner() will not return pending owner
>>> rt_mutex_next_owner() always return the top owner, it is a candidate owner.
>>> will not return NULL if we only have a pending owner.
>>> I have fixed the code that use these APIs.
>>
>> I think we want a separate function which can be used by futex code,
>> but that's a detail.
>
>
> We do use both functions in the futex code.
>
>
>> @@ -778,15 +778,6 @@ static int wake_futex_pi(u32 __user *uaddr, u32 uval, struct futex_q *this)
>> new_owner = rt_mutex_next_owner(&pi_state->pi_mutex);
>>
>> /*
>> - * This happens when we have stolen the lock and the original
>> - * pending owner did not enqueue itself back on the rt_mutex.
>> - * Thats not a tragedy. We know that way, that a lock waiter
>> - * is on the fly. We make the futex_q waiter the pending owner.
>> - */
>> - if (!new_owner)
>> - new_owner = this->task;
>> -
>
>
> If I'm reading Lai's statement above correctly, the change actually just
> eliminates the need for the !new_owner check, as in that case it will
> return current. Is this correct? And indeed, Lai's patch removes it.
> This looks correct to me.
>


After this patch applied, the topwaiter will not be deququed when the lock
is released(any waiter is dequeued only when it really get the lock or give up).
So the wait list will not be empty if someone is still waiting on.
In this code, this->task is waiting, so rt_mutex_next_owner() will not return NULL.

>
>> @@ -1647,18 +1638,20 @@ static int fixup_owner(u32 __user *uaddr, int fshared, struct futex_q *q,
>> /*
>> * pi_state is incorrect, some other task did a lock steal and
>> * we returned due to timeout or signal without taking the
>> - * rt_mutex. Too late. We can access the rt_mutex_owner without
>> - * locking, as the other task is now blocked on the hash bucket
>> - * lock. Fix the state up.
>
>
> How does this patch change this behavior? Removing the comment and
> adding the lock says that "the other task is now able to contend for the
> pi_mutex".

After this patch applied, it is possible that:
current task calls rt_mutex_trylock() and returns faill
&& the rt_mutex_owner() return NULL.

This happen when the lock is just released and the highest prio waiter
is wokenup, but it has not taken the lock yet. So we have to
call rt_mutex_next_owner() to get the highest prio waiter and set it to
q->pi_state->owner.

The top waiter is possible changed by priority boosting. so we need
to hold the wait_lock when access to it.

>
>
>> + * rt_mutex. Too late.
>> */
>> + raw_spin_lock(&q->pi_state->pi_mutex.wait_lock);
>> owner = rt_mutex_owner(&q->pi_state->pi_mutex);
>> + if (!owner)
>> + owner = rt_mutex_next_owner(&q->pi_state->pi_mutex);
>> + raw_spin_unlock(&q->pi_state->pi_mutex.wait_lock);
>> ret = fixup_pi_state_owner(uaddr, q, owner, fshared);
>> goto out;
>> }
>>
>> /*
>> * Paranoia check. If we did not take the lock, then we should not be
>> - * the owner, nor the pending owner, of the rt_mutex.
>> + * the owner of the rt_mutex.
>
>
> Is this changed because we could now be a "candidate owner" ?

It now is not possible "candidate owner" either.
but since it is a "Paranoia check", we don't need to check all things and
add unneed overhead.

>
>
>> */
>> if (rt_mutex_owner(&q->pi_state->pi_mutex) == current)
>> printk(KERN_ERR "fixup_owner: ret = %d pi-mutex: %p "
>
>
> The other uses of rt_mutex_owner in futex.c don't appear to be impacted
> by the change in API described above as they just compare the result to
> current (futex_lock_pi and futex_wait_requeue_pi).
>
>

Rignt, the other uses of rt_mutex_owner() do not be impacted.

Thanks a lot.
Lai.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/