Re: [PATCH v2 2/2] locking/qrwlock: Don't contend with readers when setting _QW_WAITING

From: Waiman Long
Date: Fri Jun 12 2015 - 18:58:36 EST

Next message: Bjorn Helgaas: "Re: [PATCH 1/1] PCI: X-Gene: Disable Configuration Request Retry Status for X-Gene v1 PCIe"
Previous message: Oleg Nesterov: "Re: [PATCH 06/12] x86/mm: Enable and use the arch_pgd_init_late() method"
In reply to: Ingo Molnar: "Re: [PATCH v2 2/2] locking/qrwlock: Don't contend with readers when setting _QW_WAITING"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 06/12/2015 04:45 AM, Ingo Molnar wrote:

* Waiman Long<waiman.long@xxxxxx> wrote:

Mind posting the microbenchmark?

I have attached the tool that I used for testing.

Thanks, that's interesting!

Btw., we could also do something like this in user-space, in tools/perf/bench/, we
have no 'perf bench locking' subcommand yet.

We already build and measure simple x86 kernel methods there such as memset() and
memcpy():

triton:~/tip> perf bench mem memcpy -r all
# Running 'mem/memcpy' benchmark:

Routine default (Default memcpy() provided by glibc)
# Copying 1MB Bytes ...

1.385195 GB/Sec
4.982462 GB/Sec (with prefault)

Routine x86-64-unrolled (unrolled memcpy() in arch/x86/lib/memcpy_64.S)
# Copying 1MB Bytes ...

1.627604 GB/Sec
5.336407 GB/Sec (with prefault)

Routine x86-64-movsq (movsq-based memcpy() in arch/x86/lib/memcpy_64.S)
# Copying 1MB Bytes ...

2.132233 GB/Sec
4.264465 GB/Sec (with prefault)

Routine x86-64-movsb (movsb-based memcpy() in arch/x86/lib/memcpy_64.S)
# Copying 1MB Bytes ...

1.490935 GB/Sec
7.128193 GB/Sec (with prefault)

Locking primitives would certainly be more complex build in user-space - but we
could shuffle things around in kernel headers as well to make it easier to test in
user-space.

That's how we can build lockdep in user-space for example, see tools/lib/lockdep.

Just a thought.

Thanks,

Ingo

I guess we can build user-space version of spinlock and rwlock, but we can't do that for sleeping lock like mutex and rwsem. Preemption in user space will also affect how those locking test will behave. Anyway, I will give it a thought on how to do that in perf bench when I have time.

Cheers,
Longman
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Bjorn Helgaas: "Re: [PATCH 1/1] PCI: X-Gene: Disable Configuration Request Retry Status for X-Gene v1 PCIe"
Previous message: Oleg Nesterov: "Re: [PATCH 06/12] x86/mm: Enable and use the arch_pgd_init_late() method"
In reply to: Ingo Molnar: "Re: [PATCH v2 2/2] locking/qrwlock: Don't contend with readers when setting _QW_WAITING"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]