Re: [PATCH v5 1/4] sched/fair: add util_est on top of PELT
From: Peter Zijlstra
Date: Wed Mar 07 2018 - 04:39:51 EST
On Tue, Mar 06, 2018 at 07:58:51PM +0100, Peter Zijlstra wrote:
> On Thu, Feb 22, 2018 at 05:01:50PM +0000, Patrick Bellasi wrote:
> > +static inline void util_est_enqueue(struct cfs_rq *cfs_rq,
> > + struct task_struct *p)
> > +{
> > + unsigned int enqueued;
> > +
> > + if (!sched_feat(UTIL_EST))
> > + return;
> > +
> > + /* Update root cfs_rq's estimated utilization */
> > + enqueued = READ_ONCE(cfs_rq->avg.util_est.enqueued);
> > + enqueued += _task_util_est(p);
> > + WRITE_ONCE(cfs_rq->avg.util_est.enqueued, enqueued);
> > +}
> It appears to me this isn't a stable situation and completely relies on
> the !nr_running case to recalibrate. If we ensure that doesn't happen
> for a significant while the sum can run-away, right?
>
> Should we put a max in enqueue to avoid this?
Thinking about this a bit more; would it make sense to adjust the
running sum/avg on migration? Something along the lines of:
util_avg = se->load_avg / (cfs_rq->load_avg + se->load_avg);
(which disregards cgroups), because that should more or less be the time
it ends up running, given the WFQ rule.
That way the disparity between tasks migrating into the CPU at u=1 and
them going to sleep at u<1 is much smaller and the above sum doesn't run
away nearly as wild (it still needs some upper bound though).