[PATCH v1 0/6] reduce workqueue and timer noise
From: Gilad Ben-Yossef
Date: Thu May 03 2012 - 10:57:06 EST
Timers and work queues both provide a useful way to defer work
for a later time or a different context. However, when the
timer or work item runs, it interrupts the CPU it is running on.
This is good if it is doing useful work, but it turns out this
is not always the case.
This patch set tries to locate and address code paths where work queues
items and timers are scheduled on CPUs where they have no useful work
to do and adapet them to be more selective.
This includes:
- Introducing helper function to schedule work queue items on a subset
of CPUs in the system.
- Use the helper function to schedule work items to attempt to drain
LRUs only on CPUs where there are LRU pages.
- Stop running the per cpu work item that does per-cpu pages reclaim
and VM statistics on CPUs that did not have any VM activity for the
last second (time frame configurable) and re-start it when VM
activity is detected.
- Fix a bug that prevented the timer code to to not program the
underlying HW timer to fire periodically when no future timer
event exists for a CPU
Changelog:
- The vmstat_update patch was changed to use a scapegoat CPU as
suggested by Christoph Lameter when the patch was previously
discussed in response to Frederic Weisbecker's adaptive tick
patch set.
Also included is a testing only patch, not intdented for mainline,
that turns the clock source watchdog into a config option which
I used while testing the timer code fix change.
The patch was boot tested on 32bit x86 in 8 way SMP and UP VMs.
For you reference, I keep a todo list for these and other noise sources
at: https://github.com/gby/linux/wiki
The git branched can be fetched from the git repo at
git@xxxxxxxxxx:gby/linux.git on the reduce_workqueue_and_timers_noise_v1
branch
Gilad Ben-Yossef (6):
timer: make __next_timer_interrupt explicit about no future event
workqueue: introduce schedule_on_each_cpu_mask
workqueue: introduce schedule_on_each_cpu_cond
mm: make lru_drain selective where it schedules work
mm: make vmstat_update periodic run conditional
x86: make clocksource watchdog configurable (not for mainline)
arch/x86/Kconfig | 9 +++-
include/linux/vmstat.h | 2 +-
include/linux/workqueue.h | 4 ++
kernel/time/clocksource.c | 2 +
kernel/timer.c | 31 ++++++++++-----
kernel/workqueue.c | 73 ++++++++++++++++++++++++++++++----
mm/swap.c | 25 +++++++++++-
mm/vmstat.c | 95 ++++++++++++++++++++++++++++++++++++++-------
8 files changed, 204 insertions(+), 37 deletions(-)
Signed-off-by: Gilad Ben-Yossef <gilad@xxxxxxxxxxxxx>
CC: Thomas Gleixner <tglx@xxxxxxxxxxxxx>
CC: Tejun Heo <tj@xxxxxxxxxx>
CC: John Stultz <johnstul@xxxxxxxxxx>
CC: Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx>
CC: KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx>
CC: Mel Gorman <mel@xxxxxxxxx>
CC: Mike Frysinger <vapier@xxxxxxxxxx>
CC: David Rientjes <rientjes@xxxxxxxxxx>
CC: Hugh Dickins <hughd@xxxxxxxxxx>
CC: Minchan Kim <minchan.kim@xxxxxxxxx>
CC: Konstantin Khlebnikov <khlebnikov@xxxxxxxxxx>
CC: Christoph Lameter <cl@xxxxxxxxx>
CC: Chris Metcalf <cmetcalf@xxxxxxxxxx>
CC: Hakan Akkan <hakanakkan@xxxxxxxxx>
CC: Max Krasnyansky <maxk@xxxxxxxxxxxx>
CC: Frederic Weisbecker <fweisbec@xxxxxxxxx>
CC: linux-kernel@xxxxxxxxxxxxxxx
CC: linux-mm@xxxxxxxxx
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/