Re: 2.6.25-rc1: volanoMark 45% regression

From: Zhang, Yanmin
Date: Thu Feb 14 2008 - 02:43:40 EST


On Wed, 2008-02-13 at 17:37 +0530, Srivatsa Vaddagiri wrote:
> On Wed, Feb 13, 2008 at 03:15:16PM +0530, Balbir Singh wrote:
> > Zhang, Yanmin wrote:
> > > volanoMark has 45% regression with kernel 2.6.25-rc1 on my both 8-core
> > > stoakley and 16-core Tigerton.
> > >
> > > I used bisect to locate below patch.
> > >
> > > commit 58e2d4ca581167c2a079f4ee02be2f0bc52e8729
> > > Author: Srivatsa Vaddagiri <vatsa@xxxxxxxxxxxxxxxxxx>
> > > Date: Fri Jan 25 21:08:00 2008 +0100
> > >
> > > sched: group scheduling, change how cpu load is calculated
> > >
> > >
> > >
> > > hackbench has about 30% regression on 16-core tigerton, but has about 10% improvement
> > > on 8-core stoakley.
> > >
> > > In addition, tbench has about 6% regression on my 8-core stoakley and
> > > 25% regression on 16-core stoakley.
I verified tbench regression is not caused by the same patch. I am digging tbench now.

> Some other benchmarks, like netperf/aim7
> > > also have some regression. I will verify if they are all related to the
> > > patch.
> > >
> > > -yanmin
> >
> > Hi, Yamin,
> >
> > Thanks for reporting the issue? Any chance we could getthe Oprofile output for
> > the run?
I got oprofile data but it didn't show clear evidence.

When doing volanoMark testing, vmstat showed the good kernel's context switch
is about 1100000, but the bad kernel's context switch is 720000. Good kernel's
idle is about 1%, and bad kernel's idle is about 5%.

> The exact commandline and .config being used would also help.
I used some scripts to start volanoMark.

Netperf loop UDP-RR-1/512's 10% regression on 16-core tigerton is also related to the patch.
If I set CONFIG_FAIR_GROUP_SCHED=n, there is no the netperf regression. I bind the netserver
process to a core and bind the client to another core in another processor.

It's hard to debug into netperf regression if it's caused by scheduler.

>
> Yamin,
> I would also like to know against which previous version is this
> regression being compared with. Is it 2.6.24?
Yes.

> Did you have
> CONFIG_FAIR_USER_SCHED enabled in both cases?
Yes.

CONFIG_FAIR_GROUP_SCHED=y
CONFIG_FAIR_USER_SCHED=y

> It would also help to know if you
> see the same regression with FAIR_GROUP_SCHED turned off.
No regression if CONFIG_FAIR_GROUP_SCHED=n.

-yanmin



--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/