Re: [patch 1/2] mm: page-writeback: fix dirty_balance_reservesubtraction from dirtyable memory
From: Michal Hocko
Date: Tue Jan 28 2014 - 08:49:35 EST
On Fri 24-01-14 17:03:03, Johannes Weiner wrote:
> The dirty_balance_reserve is an approximation of the fraction of free
> pages that the page allocator does not make available for page cache
> allocations. As a result, it has to be taken into account when
> calculating the amount of "dirtyable memory", the baseline to which
> dirty_background_ratio and dirty_ratio are applied.
>
> However, currently the reserve is subtracted from the sum of free and
> reclaimable pages, which is non-sensical and leads to erroneous
> results when the system is dominated by unreclaimable pages and the
> dirty_balance_reserve is bigger than free+reclaimable. In that case,
> at least the already allocated cache should be considered dirtyable.
>
> Fix the calculation by subtracting the reserve from the amount of free
> pages, then adding the reclaimable pages on top.
>
> Signed-off-by: Johannes Weiner <hannes@xxxxxxxxxxx>
Makes sense
Reviewed-by: Michal Hocko <mhocko@xxxxxxx>
> ---
> mm/page-writeback.c | 52 +++++++++++++++++++++++-----------------------------
> 1 file changed, 23 insertions(+), 29 deletions(-)
>
> diff --git a/mm/page-writeback.c b/mm/page-writeback.c
> index 63807583d8e8..79cf52b058a7 100644
> --- a/mm/page-writeback.c
> +++ b/mm/page-writeback.c
> @@ -191,6 +191,25 @@ static unsigned long writeout_period_time = 0;
> * global dirtyable memory first.
> */
>
> +/**
> + * zone_dirtyable_memory - number of dirtyable pages in a zone
> + * @zone: the zone
> + *
> + * Returns the zone's number of pages potentially available for dirty
> + * page cache. This is the base value for the per-zone dirty limits.
> + */
> +static unsigned long zone_dirtyable_memory(struct zone *zone)
> +{
> + unsigned long nr_pages;
> +
> + nr_pages = zone_page_state(zone, NR_FREE_PAGES);
> + nr_pages -= min(nr_pages, zone->dirty_balance_reserve);
> +
> + nr_pages += zone_reclaimable_pages(zone);
> +
> + return nr_pages;
> +}
> +
> static unsigned long highmem_dirtyable_memory(unsigned long total)
> {
> #ifdef CONFIG_HIGHMEM
> @@ -201,8 +220,7 @@ static unsigned long highmem_dirtyable_memory(unsigned long total)
> struct zone *z =
> &NODE_DATA(node)->node_zones[ZONE_HIGHMEM];
>
> - x += zone_page_state(z, NR_FREE_PAGES) +
> - zone_reclaimable_pages(z) - z->dirty_balance_reserve;
> + x += zone_dirtyable_memory(zone);
> }
> /*
> * Unreclaimable memory (kernel memory or anonymous memory
> @@ -238,9 +256,11 @@ static unsigned long global_dirtyable_memory(void)
> {
> unsigned long x;
>
> - x = global_page_state(NR_FREE_PAGES) + global_reclaimable_pages();
> + x = global_page_state(NR_FREE_PAGES);
> x -= min(x, dirty_balance_reserve);
>
> + x += global_reclaimable_pages();
> +
> if (!vm_highmem_is_dirtyable)
> x -= highmem_dirtyable_memory(x);
>
> @@ -289,32 +309,6 @@ void global_dirty_limits(unsigned long *pbackground, unsigned long *pdirty)
> }
>
> /**
> - * zone_dirtyable_memory - number of dirtyable pages in a zone
> - * @zone: the zone
> - *
> - * Returns the zone's number of pages potentially available for dirty
> - * page cache. This is the base value for the per-zone dirty limits.
> - */
> -static unsigned long zone_dirtyable_memory(struct zone *zone)
> -{
> - /*
> - * The effective global number of dirtyable pages may exclude
> - * highmem as a big-picture measure to keep the ratio between
> - * dirty memory and lowmem reasonable.
> - *
> - * But this function is purely about the individual zone and a
> - * highmem zone can hold its share of dirty pages, so we don't
> - * care about vm_highmem_is_dirtyable here.
> - */
> - unsigned long nr_pages = zone_page_state(zone, NR_FREE_PAGES) +
> - zone_reclaimable_pages(zone);
> -
> - /* don't allow this to underflow */
> - nr_pages -= min(nr_pages, zone->dirty_balance_reserve);
> - return nr_pages;
> -}
> -
> -/**
> * zone_dirty_limit - maximum number of dirty pages allowed in a zone
> * @zone: the zone
> *
> --
> 1.8.4.2
>
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to majordomo@xxxxxxxxxx For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>
--
Michal Hocko
SUSE Labs
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/