Re: [PATCH v3] perf bench sched pipe: Add -G/--cgroups option
From: Arnaldo Carvalho de Melo
Date: Tue Oct 17 2023 - 08:16:06 EST
Em Mon, Oct 16, 2023 at 02:44:23PM -0700, Namhyung Kim escreveu:
> On Mon, Oct 16, 2023 at 1:35 PM Arnaldo Carvalho de Melo <acme@xxxxxxxxxx> wrote:
> > Em Mon, Oct 16, 2023 at 12:55:33PM -0300, Arnaldo Carvalho de Melo escreveu:
> > > Em Mon, Oct 16, 2023 at 12:51:52PM -0300, Arnaldo Carvalho de Melo escreveu:
> > > > Now back at testing with with cgroups.
> >
> > > Humm, even without the -G I get:
> >
> > > [root@five ~]# perf stat -e context-switches,cgroup-switches perf bench sched pipe -l 10000
> > > # Running 'sched/pipe' benchmark:
> > > # Executed 10000 pipe operations between two processes
> >
> > > Total time: 0.082 [sec]
> >
> > > 8.246400 usecs/op
> > > 121265 ops/sec
> >
> > > Performance counter stats for 'perf bench sched pipe -l 10000':
> >
> > > 20,002 context-switches
> > > 20,002 cgroup-switches
> >
> > Same number, but then I forgot to add the 'taskset -c 0' part of the
> > command line, if I have it:
> >
> > [root@five ~]# perf stat -e context-switches,cgroup-switches taskset -c 0 perf bench sched pipe -l 10000
> > # Running 'sched/pipe' benchmark:
> > # Executed 10000 pipe operations between two processes
> >
> > Total time: 0.072 [sec]
> >
> > 7.231500 usecs/op
> > 138283 ops/sec
> >
> > Performance counter stats for 'taskset -c 0 perf bench sched pipe -l 10000':
> >
> > 20,002 context-switches
> > 3 cgroup-switches
> >
> > 0.082855113 seconds time elapsed
> >
> > 0.007765000 seconds user
> > 0.074020000 seconds sys
> >
> >
> > [root@five ~]# perf stat -e context-switches,cgroup-switches taskset -c 0 perf bench sched pipe -l 10000 -G AAA,BBB
> > # Running 'sched/pipe' benchmark:
> > # Executed 10000 pipe operations between two processes
> >
> > Total time: 0.093 [sec]
> >
> > 9.341800 usecs/op
> > 107045 ops/sec
> >
> > Performance counter stats for 'taskset -c 0 perf bench sched pipe -l 10000 -G AAA,BBB':
> >
> > 20,004 context-switches
> > 20,003 cgroup-switches
> >
> > 0.103436330 seconds time elapsed
> >
> > 0.018109000 seconds user
> > 0.063058000 seconds sys
> >
> >
> > [root@five ~]#
> >
> > I.e. it works as in your results, but can you please spell out why that
> > 'taskset -c 0' is needed to get these results?
> >
> > I wasn't expecting the same number of cgroup-switches when not using
> > 'taskset -c 0' :-\
>
> Without taskset, each task is likely to run on different CPUs
> and other tasks (including idle) on that CPU would be in
> different cgroup so it'll create cgroup switches everytime
> if they run in AAA or BBB.
Sure, and the 'perf stat' is counting _just_ that workload and its
children (no -a).
Can you please add this to the cset commit log message? I.e. describe
the test setup thoroughly to help in reviewing and for us to quickly
understand what is being tested, the purpose of the test and how the
results match our expectations.
This will be specially helpful in the future, when we do bisects, try to
understand why changes were made, etc.
Even in the man page this information would be useful in helping users
to understand the purpose of the 'perf bench' -G option. So I think its
better to have it there instead of in the cset commit log message.
> With taskset, both sender and receiver would run on the
> same CPU. So it'd see the impact of cgroup switches
> with this option.
Thanks!
- Arnaldo