Re: [PATCH 1/2] mm, memcg: Try charging a page before setting page up to date

From: Mel Gorman
Date: Wed May 20 2015 - 12:15:31 EST


On Wed, May 20, 2015 at 11:29:23AM -0400, Johannes Weiner wrote:
> On Wed, May 20, 2015 at 01:50:44PM +0100, Mel Gorman wrote:
> > Historically memcg overhead was high even if memcg was unused. This has
> > improved a lot but it still showed up in a profile summary as being a
> > problem.
> >
> > /usr/src/linux-4.0-vanilla/mm/memcontrol.c 6.6441 395842
> > mem_cgroup_try_charge 2.950% 175781
> > __mem_cgroup_count_vm_event 1.431% 85239
> > mem_cgroup_page_lruvec 0.456% 27156
> > mem_cgroup_commit_charge 0.392% 23342
> > uncharge_list 0.323% 19256
> > mem_cgroup_update_lru_size 0.278% 16538
> > memcg_check_events 0.216% 12858
> > mem_cgroup_charge_statistics.isra.22 0.188% 11172
> > try_charge 0.150% 8928
> > commit_charge 0.141% 8388
> > get_mem_cgroup_from_mm 0.121% 7184
> >
> > That is showing that 6.64% of system CPU cycles were in memcontrol.c and
> > dominated by mem_cgroup_try_charge. The annotation shows that the bulk of
> > the cost was checking PageSwapCache which is expected to be cache hot but is
> > very expensive. The problem appears to be that __SetPageUptodate is called
> > just before the check which is a write barrier. It is required to make sure
> > struct page and page data is written before the PTE is updated and the data
> > visible to userspace. memcg charging does not require or need the barrier
> > but gets unfairly hit with the cost so this patch attempts the charging
> > before the barrier. Aside from the accidental cost to memcg there is the
> > added benefit that the barrier is avoided if the page cannot be charged.
> > When applied the relevant profile summary is as follows.
> >
> > /usr/src/linux-4.0-chargefirst-v2r1/mm/memcontrol.c 3.7907 223277
> > __mem_cgroup_count_vm_event 1.143% 67312
>
> Out of curiosity, I'm still consistently reading this function at
> around 0.7%. Are you profiling this single-threadedly or for the
> entire run? For profiling 80 single-threaded iterations, I get:
>

Single-threaded. The mmtests benchmark in question supports gathering one
profile per thread count so it's just the 1 thread profile I included in
the changelog. The CPU in question is a i7-3770

--
Mel Gorman
SUSE Labs
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/