[PATCH 0/2] optimise ctx's refs grabbing in io_uring

From: Pavel Begunkov
Date: Tue Dec 17 2019 - 17:29:17 EST


Optimise percpu_ref_tryget() by not calling it for each request, but
batching it. This gave a measurable performance boost, though with
a bit unconventional(/unrealistic?) workload.

There is still one step to add, which is not implemented with
patchset, and will amortise the effect calls to io_uring_enter().

rebased on top of for-5.6/io_uring

Pavel Begunkov (2):
pcpu_ref: add percpu_ref_tryget_many()
io_uring: batch getting pcpu references

fs/io_uring.c | 11 ++++++++---
include/linux/percpu-refcount.h | 24 ++++++++++++++++++++----
2 files changed, 28 insertions(+), 7 deletions(-)

--
2.24.0