Re: [RFC][PATCH v2 8/8] Don't deactivate many touched page

From: Minchan Kim
Date: Thu Dec 10 2009 - 21:11:52 EST


Hi, Kosaki.

On Thu, Dec 10, 2009 at 4:35 PM, KOSAKI Motohiro
<kosaki.motohiro@xxxxxxxxxxxxxx> wrote:
> Changelog
> Âo from v1
> Â - Fix comments.
> Â - Rename too_many_young_bit_found() with too_many_referenced()
> Â Â [as Rik's mention].
> Âo from andrea's original patch
> Â - Rebase topon my patches.
> Â - Use list_cut_position/list_splice_tail pair instead
> Â Â list_del/list_add to make pte scan fairness.
> Â - Only use max young threshold when soft_try is true.
> Â Â It avoid wrong OOM sideeffect.
> Â - Return SWAP_AGAIN instead successful result if max
> Â Â young threshold exceed. It prevent the pages without clear
> Â Â pte young bit will be deactivated wrongly.
> Â - Add to treat ksm page logic
>
> Many shared and frequently used page don't need deactivate and
> try_to_unamp(). It's pointless while VM pressure is low, the page
> might reactivate soon. it's only makes cpu wasting.
>
> Then, This patch makes to stop pte scan if wipe_page_reference()
> found lots young pte bit.
>
> Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx>
> Reviewed-by: Rik van Riel <riel@xxxxxxxxxx>
> ---
> Âinclude/linux/rmap.h | Â 18 ++++++++++++++++++
> Âmm/ksm.c       |  Â4 ++++
> Âmm/rmap.c      Â|  19 +++++++++++++++++++
> Â3 files changed, 41 insertions(+), 0 deletions(-)
>
> diff --git a/include/linux/rmap.h b/include/linux/rmap.h
> index 499972e..ddf2578 100644
> --- a/include/linux/rmap.h
> +++ b/include/linux/rmap.h
> @@ -128,6 +128,24 @@ int wipe_page_reference_one(struct page *page,
> Â Â Â Â Â Â Â Â Â Â Â Â Â Âstruct page_reference_context *refctx,
> Â Â Â Â Â Â Â Â Â Â Â Â Â Âstruct vm_area_struct *vma, unsigned long address);
>
> +#define MAX_YOUNG_BIT_CLEARED 64
> +/*

This idea is good at embedded system which don't have access bit by hardware.

Such system emulates access bit as minor page fault AFAIK.
It means when VM clears young bit, kernel mark page table as non-permission
or something for refaulting.
So when next touch happens that address, kernel can do young bit set again.
It would be rather costly operation than one which have access bit by hardware.

So this idea is good in embedded system.
But 64 is rather big. many embedded system don't have many processes.
So I want to scale this number according to memory size like
inactive_raio for example.

Thanks for good idea and effort. :)

--
Kind regards,
Minchan Kim
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/