Re: [PATCH] lib/flex_proportions.c: Use abs() when percpu_counter is negative.

From: chi wu
Date: Tue May 18 2021 - 06:23:14 EST


Jan Kara <jack@xxxxxxx> 于2021年5月18日周二 下午4:59写道:
>
> On Tue 18-05-21 11:42:53, chi wu wrote:
> > Chi Wu <wuchi.zero@xxxxxxxxx> 于2021年5月17日周一 下午11:53写道:
> > >
> > > The value of percpu_counter_read() may become negative after
> > > running percpu_counter_sum() in fprop_reflect_period_percpu().
> > > The value of variable 'num' will be zero in fprop_fraction_percpu()
> > > when using percpu_counter_read_positive(), but if using the abs of
> > > percpu_counter_read() will be close to the correct value.
> > >
> >
> > I realized that I was wrong as follow:
> > (a) the decay rule is broken, the negative means the difference in
> > decay here.
> > (b) as the target event increasing, proportion of the event will
> > decrease to 0 firstly and then it will increase. The logic is bad.
> > 1. abs(-50) / abs(100) = 50% //+50 to 2
> > 2. abs(0) / abs(150) = 0 % //+50 to 3
> > 3. abs(50)/abs(200) = 25%
> >
> > Anyway, the percpu_counter_sum() had cost a lost performance,
> > may be we could get a little benefits from that. So could we add a
> > variable to stroe the decay value, we will get the value when
> > percpu_counter_read() is negative?
>
> The result of percpu_counter_read() is inherently inexact (but fast! ;). It
> can be upto number_of_cpus * counter_batch away from the real counter
> value. But do you observe any practical problems with this inaccuracy on
> your system? Sure, cache memory won't be split among devices exactly
> according to writeout proportion but that usually does not matter.
>
> Honza
>

Thanks, Got it.
Just try to optimize the fuse (with strictlimit feature)performance
issue: The writing thread will be paused and runs slowly, when the
proportion of fuse-bdi is 0.
The issue is normal,and one of reasons is the characteristics of
percpu_counter batch. Even the pages are writeout, we may be could not
get the real proportion value due to side effects of counter
performance. It's just a slight disappointment.

> > > ---
> > > lib/flex_proportions.c | 8 ++++----
> > > 1 file changed, 4 insertions(+), 4 deletions(-)
> > >
> > > diff --git a/lib/flex_proportions.c b/lib/flex_proportions.c
> > > index 451543937524..3ac79ca2c441 100644
> > > --- a/lib/flex_proportions.c
> > > +++ b/lib/flex_proportions.c
> > > @@ -147,7 +147,7 @@ void fprop_fraction_single(struct fprop_global *p,
> > > seq = read_seqcount_begin(&p->sequence);
> > > fprop_reflect_period_single(p, pl);
> > > num = pl->events;
> > > - den = percpu_counter_read_positive(&p->events);
> > > + den = abs(percpu_counter_read(&p->events));
> > > } while (read_seqcount_retry(&p->sequence, seq));
> > >
> > > /*
> > > @@ -209,7 +209,7 @@ static void fprop_reflect_period_percpu(struct fprop_global *p,
> > > val = percpu_counter_sum(&pl->events);
> > >
> > > percpu_counter_add_batch(&pl->events,
> > > - -val + (val >> (period-pl->period)), PROP_BATCH);
> > > + -val + (val >> (period - pl->period)), PROP_BATCH);
> > > } else
> > > percpu_counter_set(&pl->events, 0);
> > > pl->period = period;
> > > @@ -234,8 +234,8 @@ void fprop_fraction_percpu(struct fprop_global *p,
> > > do {
> > > seq = read_seqcount_begin(&p->sequence);
> > > fprop_reflect_period_percpu(p, pl);
> > > - num = percpu_counter_read_positive(&pl->events);
> > > - den = percpu_counter_read_positive(&p->events);
> > > + num = abs(percpu_counter_read(&pl->events));
> > > + den = abs(percpu_counter_read(&p->events));
> > > } while (read_seqcount_retry(&p->sequence, seq));
> > >
> > > /*
> > > --
> > > 2.17.1
> > >
> --
> Jan Kara <jack@xxxxxxxx>
> SUSE Labs, CR