On Tue, Mar 19, 2019 at 10:31 PM Subhra Mazumdar <subhra.mazumdar@xxxxxxxxxx>Is the core wide lock primarily responsible for the regression? I ran upto patch
wrote:
On 3/18/19 8:41 AM, Julien Desfossez wrote:On further investigation, we could see that the contention is mostly in the
way rq locks are taken. With this patchset, we lock the whole core if
cpu.tag is set for at least one cgroup. Due to this, __schedule() is more or
less serialized for the core and that attributes to the performance loss
that we are seeing. We also saw that newidle_balance() takes considerably
long time in load_balance() due to the rq spinlock contention. Do you think
it would help if the core-wide locking was only performed when absolutely
needed ?