Re: [PATCH] locking/osq: Drop the overload of osq lock
From: panxinhui
Date: Sun Jun 26 2016 - 11:11:46 EST
> 在 2016年6月26日,22:29,Boqun Feng <boqun.feng@xxxxxxxxx> 写道:
>
> On Sun, Jun 26, 2016 at 03:08:20PM +0800, panxinhui wrote:
> [snip]
>>> @@ -106,6 +109,9 @@ bool osq_lock(struct optimistic_spin_queue *lock)
>>> node->prev = prev;
>>> WRITE_ONCE(prev->next, node);
>>>
>>> + old = old - 1;
>>> + vpc = vcpu_preempt_count();
>>> +
>>> /*
>>> * Normally @prev is untouchable after the above store; because at that
>>> * moment unlock can proceed and wipe the node element from stack.
>>> @@ -118,8 +124,14 @@ bool osq_lock(struct optimistic_spin_queue *lock)
>>> while (!READ_ONCE(node->locked)) {
>>> /*
>>> * If we need to reschedule bail... so we can block.
>>> + * An over-committed guest with more vCPUs than pCPUs
>>> + * might fall in this loop and cause a huge overload.
>>> + * This is because vCPU A(prev) hold the osq lock and yield out,
>>> + * vCPU B(node) wait ->locked to be set, IOW, wait till
>>> + * vCPU A run and unlock the osq lock.
>>> + * NOTE that vCPU A and vCPU B might run on same physical cpu.
>>> */
>>> - if (need_resched())
>>> + if (need_resched() || vcpu_is_preempted(old) || vcpu_has_preempted(vpc))
>>> goto unqueue;
>>>
>>
>> the prev might change, so we need read node->prev every loop, then check vcpu preempted.
>>
>
> Right you are on the possibility of the prev's change, however, even if
> we reread node->prev, the prev is still not stable after we read, that
> is the prev can change after we read in the loop and before we check the
but the next loops we will detect the owner is preempted, right?
> vcpu preemption in the next loop, therefore whether the reread is
well,think the case below.
3 vcpu try to get the osq lock. IF you don’t re-read the prev.
A B C
preempted
detect A preempted, then break loops
and do unqueue
spin loops, because B (the prev ) is running,
preempted
detect B preempted then break loops and do queue.// looks like it’ too late to break the loops
running
SO we can get the rules,
IF the queue nodes looks like
vcpu A,B,C,D,E,F,G,….. more spinners
say, if vcpu B,F is preempted, then, we could know vcpu C,D,E should stop the spinning. right? and all other vcpu after F, say G,H,…should stop spinning too.
thanks
xinhui
> Regards,
> Boqun
>
>>> cpu_relax_lowlatency();
>>