[PATCHSET v4 0/7] Add support for epoll min_wait
From: Jens Axboe
Date: Thu Dec 01 2022 - 13:12:09 EST
Hi,
tldr - we saw a 6-7% CPU reduction with this patch. See patch 6 for
full numbers.
This adds support for EPOLL_CTL_MIN_WAIT, which allows setting a minimum
time that epoll_wait() should wait for events on a given epoll context.
Some justification and numbers are in patch 6, patches 1-5 are really
just prep patches or cleanups, and patch 7 adds the API to set min_wait.
I've decided against adding a syscall for this due to the following
reasons:
1) We, Meta, don't need the syscall variant.
2) It's unclear how best to do a clean syscall interface for this. We're
already out of arguments with the pwait/pwait2 variants.
With the splitting of the API into a separate patch, anyone who wishes
to have/use a syscall interface would be tasked with doing that
themselves.
No real changes in this release, just minor tweaks. Would appreciate
some review on this so we can get it moving forward. I obviously can't
start real deployments at Meta before I have the API upstream, or at
least queued for upstream. So we're currently stuck in limbo with this.
Also available here:
https://git.kernel.dk/cgit/linux-block/log/?h=epoll-min_ts
Since v3:
- Split the ctl addition into separate patch
- Gate setup of min_wait on !ewq.timed_out
- Add comment on calling ctl with wait == 0 is a no-op
--
Jens Axboe