Re: [RFC][PATCH 7/8] perf/x86: Optimize x86_schedule_events()

From: Stephane Eranian
Date: Wed Mar 20 2019 - 15:30:41 EST


On Wed, Mar 20, 2019 at 6:11 AM Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
>
> On Tue, Mar 19, 2019 at 04:55:16PM -0700, Stephane Eranian wrote:
> > On Thu, Mar 14, 2019 at 6:11 AM Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
> > > @@ -858,8 +864,20 @@ int x86_schedule_events(struct cpu_hw_ev
> > > x86_pmu.start_scheduling(cpuc);
> > >
> > > for (i = 0, wmin = X86_PMC_IDX_MAX, wmax = 0; i < n; i++) {
> > > - c = x86_pmu.get_event_constraints(cpuc, i, cpuc->event_list[i]);
> > > - cpuc->event_constraint[i] = c;
> > > + c = cpuc->event_constraint[i];
> > > +
> > > + /*
> > > + * Request constraints for new events; or for those events that
> > > + * have a dynamic constraint due to the HT workaround -- for
> > > + * those the constraint can change due to scheduling activity
> > > + * on the other sibling.
> > > + */
> > > + if (!c || ((c->flags & PERF_X86_EVENT_DYNAMIC) &&
> > > + is_ht_workaround_active(cpuc))) {
> > > +
> > > + c = x86_pmu.get_event_constraints(cpuc, i, cpuc->event_list[i]);
> > > + cpuc->event_constraint[i] = c;
> > > + }
>
> > On this one, I think there may be a problem with events with
> > shared_regs constraints.
>
> Hmm...
>
> > Constraint is dynamic as it depends on other events which share the
> > same MSR, yet it is not marked as DYNAMIC.
>
> it returns &emptyconstraint or a table constraint, depending on register
> state.
>
> > But this may be okay because these other events are all on the same
> > CPU and thus scheduled during the same ctx_sched_in(). Yet with the
> > swapping in intel_alt_er(), we need to double-check that we cannot
> > reuse a constraint which could be stale.
>
> > I believe this is okay, just double-check.
>
> I'm not sure I see a problem.
>
> So if we're the first event on a shared register, we claim the register
> and scheduling succeeds (barring other constraints).
>
> If we're the second event on a shared register (and have conflicting
> register state), we get the empty constraint. This _will_ cause
> scheduling to fail. We'll not cache the state and punt it back to the
> core code.
>
> So no future scheduling pass will come to see a shared reg constraint
> that could've changed.
>
> Now, there is indeed the intel_alt_er() thing, which slightly
> complicates this; then suppose we schedule an event on RSP0, another on
> RSP1, then remove the RSP0 one. Even in that case, the remaining RSP1
> event will not change its constraint, since intel_fixup_er() rewrites
> the event to be a native RSP1 event.
>
> So that too reduces to the prior case.
>
I came the same conclusion later yesterday. I think this is okay.

>
> That said; I have simplified the above condition to:
>
> @@ -858,8 +858,17 @@ int x86_schedule_events(struct cpu_hw_ev
> x86_pmu.start_scheduling(cpuc);
>
> for (i = 0, wmin = X86_PMC_IDX_MAX, wmax = 0; i < n; i++) {
> - c = x86_pmu.get_event_constraints(cpuc, i, cpuc->event_list[i]);
> - cpuc->event_constraint[i] = c;
> + c = cpuc->event_constraint[i];
> +
> + /*
> + * Request constraints for new events; or for those events that
> + * have a dynamic constraint -- for those the constraint can
> + * change due to external factors (sibling state, allow_tfa).
> + */
> + if (!c || (c->flags & PERF_X86_EVENT_DYNAMIC)) {
> + c = x86_pmu.get_event_constraints(cpuc, i, cpuc->event_list[i]);
> + cpuc->event_constraint[i] = c;
> + }
>
Right now DYNAMIC is only casued by the HT bug, but it could change later on but
the logic here would remain. If HT workaround is disabled, then no
evnt is tagged with
DYNAMIC.

> wmin = min(wmin, c->weight);
> wmax = max(wmax, c->weight);
>
> Because any dynamic event can change from one moment to the next.
I agree.