Re: [RFC 0/3] reduce latency of direct async compaction

From: Aaron Lu
Date: Thu Dec 03 2015 - 06:53:14 EST

Next message: Emil Velikov: "Re: [PATCH 9/9] drm/vc4: Add an interface for capturing the GPU state after a hang."
Previous message: Xunlei Pang: "Re: [PATCH] sched/core: Clear the root_domain cpumasks in init_rootdomain()"
In reply to: Aaron Lu: "Re: [RFC 0/3] reduce latency of direct async compaction"
Next in thread: Vlastimil Babka: "Re: [RFC 0/3] reduce latency of direct async compaction"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Thu, Dec 03, 2015 at 07:35:08PM +0800, Aaron Lu wrote:
> On Thu, Dec 03, 2015 at 10:38:50AM +0100, Vlastimil Babka wrote:
> > On 12/03/2015 10:25 AM, Aaron Lu wrote:
> > > On Thu, Dec 03, 2015 at 09:10:44AM +0100, Vlastimil Babka wrote:
> > >> Aaron, could you try this on your testcase?
> > >
> > > The test result is placed at:
> > > https://drive.google.com/file/d/0B49uX3igf4K4enBkdVFScXhFM0U
> > >
> > > For some reason, the patches made the performace worse. The base tree is
> > > today's Linus git 25364a9e54fb8296837061bf684b76d20eec01fb, and its
> > > performace is about 1000MB/s. After applying this patch series, the
> > > performace drops to 720MB/s.
> > >
> > > Please let me know if you need more information, thanks.
> >
> > Hm, compaction stats are at 0. The code in the patches isn't even running.
> > Can you provide the same data also for the base tree?
>
> My bad, I uploaded the wrong data :-/
> I uploaded again:
> https://drive.google.com/file/d/0B49uX3igf4K4UFI4TEQ3THYta0E
>
> And I just run the base tree with trace-cmd and found that its
> performace drops significantly(from 1000MB/s to 6xxMB/s), is it that
> trace-cmd will impact performace a lot? Any suggestions on how to run
> the test regarding trace-cmd? i.e. should I aways run usemem under
> trace-cmd or only when necessary?

I just run the test with the base tree and with this patch series
applied(head), I didn't use trace-cmd this time.

The throughput for base tree is 963MB/s while the head is 815MB/s, I
have attached pagetypeinfo/proc-vmstat/perf-profile for them.

Attachment: base.tar
Description: Unix tar archive

Attachment: head.tar
Description: Unix tar archive

Next message: Emil Velikov: "Re: [PATCH 9/9] drm/vc4: Add an interface for capturing the GPU state after a hang."
Previous message: Xunlei Pang: "Re: [PATCH] sched/core: Clear the root_domain cpumasks in init_rootdomain()"
In reply to: Aaron Lu: "Re: [RFC 0/3] reduce latency of direct async compaction"
Next in thread: Vlastimil Babka: "Re: [RFC 0/3] reduce latency of direct async compaction"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]