Re: [PATCH] locking/osq: Drop the overload of osq lock
From: xinhui
Date: Mon Jun 27 2016 - 06:20:18 EST
On 2016å06æ27æ 15:55, Peter Zijlstra wrote:
On Sun, Jun 26, 2016 at 12:59:01PM +0800, panxinhui wrote:
å 2016å6æ26æï03:12ïPeter Zijlstra <peterz@xxxxxxxxxxxxx> åéï
On Sun, Jun 26, 2016 at 01:27:51AM +0800, panxinhui wrote:
by the way I still think mutex_unlock has a big overload too.
Do you mean overhead?
oh, maybe you are right.
mutex_unlock âs implementation uses inc_return variant on ppc, and
thatâs expensive. I am thinking of using cmpxchg instead.
That statement doesn't make any sense. PPC is an LL/SC arch, inc_return
and cmpxchg are the 'same' LL/SC loop.
This is a little optimize.
if there are lock waiters, the lockval is minus X, when we call unlock, it will inc the lockval, if it is <= 0, enter unlockslowpath to wakeup the waiters, and set lockval to 1 in the slowpath.
SO there is no need to inc lockval if it is already a minus number. therefore we can save one store or loads/stores in LL/SC loops
the base idea is from code below,
if (!atomic_read(&lk)//no need to call atomic_inc_return which is expensive.
atomic_inc_return(&lk))