[tip:sched/numa] sched/numa: Do statistics calculation using local variables only

From: tip-bot for Rik van Riel
Date: Tue Jan 28 2014 - 14:26:34 EST


Commit-ID: 35664fd41e1c8cc4f0b89f6a51db5af39ba50640
Gitweb: http://git.kernel.org/tip/35664fd41e1c8cc4f0b89f6a51db5af39ba50640
Author: Rik van Riel <riel@xxxxxxxxxx>
AuthorDate: Mon, 27 Jan 2014 17:03:46 -0500
Committer: Ingo Molnar <mingo@xxxxxxxxxx>
CommitDate: Tue, 28 Jan 2014 15:03:17 +0100

sched/numa: Do statistics calculation using local variables only

The current code in task_numa_placement calculates the difference
between the old and the new value, but also temporarily stores half
of the old value in the per-process variables.

The NUMA balancing code looks at those per-process variables, and
having other tasks temporarily see halved statistics could lead to
unwanted numa migrations. This can be avoided by doing all the math
in local variables.

This change also simplifies the code a little.

Signed-off-by: Rik van Riel <riel@xxxxxxxxxx>
Acked-by: Mel Gorman <mgorman@xxxxxxx>
Signed-off-by: Peter Zijlstra <peterz@xxxxxxxxxxxxx>
Cc: Chegu Vinod <chegu_vinod@xxxxxx>
Link: http://lkml.kernel.org/r/1390860228-21539-8-git-send-email-riel@xxxxxxxxxx
Signed-off-by: Ingo Molnar <mingo@xxxxxxxxxx>
---
kernel/sched/fair.c | 12 ++++--------
1 file changed, 4 insertions(+), 8 deletions(-)

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index 8fc3a82..4c44990 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -1513,12 +1513,9 @@ static void task_numa_placement(struct task_struct *p)
long diff, f_diff, f_weight;

i = task_faults_idx(nid, priv);
- diff = -p->numa_faults_memory[i];
- f_diff = -p->numa_faults_cpu[i];

/* Decay existing window, copy faults since last scan */
- p->numa_faults_memory[i] >>= 1;
- p->numa_faults_memory[i] += p->numa_faults_buffer_memory[i];
+ diff = p->numa_faults_buffer_memory[i] - p->numa_faults_memory[i] / 2;
fault_types[priv] += p->numa_faults_buffer_memory[i];
p->numa_faults_buffer_memory[i] = 0;

@@ -1532,13 +1529,12 @@ static void task_numa_placement(struct task_struct *p)
f_weight = div64_u64(runtime << 16, period + 1);
f_weight = (f_weight * p->numa_faults_buffer_cpu[i]) /
(total_faults + 1);
- p->numa_faults_cpu[i] >>= 1;
- p->numa_faults_cpu[i] += f_weight;
+ f_diff = f_weight - p->numa_faults_cpu[i] / 2;
p->numa_faults_buffer_cpu[i] = 0;

+ p->numa_faults_memory[i] += diff;
+ p->numa_faults_cpu[i] += f_diff;
faults += p->numa_faults_memory[i];
- diff += p->numa_faults_memory[i];
- f_diff += p->numa_faults_cpu[i];
p->total_numa_faults += diff;
if (p->numa_group) {
/* safe because we can only change our own group */
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/