Re: [PATCH v7 0/6] ksm: support tracking KSM-placed zero-pages

From: David Hildenbrand
Date: Mon Apr 17 2023 - 04:03:54 EST


On 13.04.23 07:46, yang.yang29@xxxxxxxxxx wrote:
From: xu xin <xu.xin16@xxxxxxxxxx>

The core idea of this patch set is to enable users to perceive the number
of any pages merged by KSM, regardless of whether use_zero_page switch has
been turned on, so that users can know how much free memory increase is
really due to their madvise(MERGEABLE) actions. But the problem is, when
enabling use_zero_pages, all empty pages will be merged with kernel zero
pages instead of with each other as use_zero_pages is disabled, and then
these zero-pages are no longer monitored by KSM.

The motivations to do this is seen at:
https://lore.kernel.org/lkml/202302100915227721315@xxxxxxxxxx/

In one word, we hope to implement the support for KSM-placed zero pages
tracking without affecting the feature of use_zero_pages, so that app
developer can also benefit from knowing the actual KSM profit by getting
KSM-placed zero pages to optimize applications eventually when
/sys/kernel/mm/ksm/use_zero_pages is enabled.


Thanks for the update!

the patch uses pte_mkdirty (related with architecture) to mark KSM-placed
zero pages. Some architecture(like sparc64) treat R/O dirty PTEs as
writable, which will break KSM pages state (wrprotect) and affect

With [1] that should be resolved and we should be able to enable it unconditionally.

Further, ideally this should get based on [2], such that we can include the zeropages in the ksm and per-mm profit calculation.

Last but not least, I realized that we also have to handle the case when khugepaged replaces a shared zeropage by a THP. I think that should be easy by adjusting the counters in the the is_zero_pfn() handling in mm/khugepaged.c:__collapse_huge_page_copy().

the KSM functionality. For safety, we restrict this feature only to the
tested and known-working architechtures (ARM, ARM64, and X86) fow now.

Change log
----------
v6->v7:
This is an all-newed version which is different from v6 which relys on KSM's
rmap_item. The patch series don't rely on rmap_item but pte_dirty, so the
general handling of tracking KSM-placed zero-pages is simplified a lot.

For safety, we restrict this feature only to the tested and known-working
architechtures (ARM, ARM64, and X86) fow now.

Yeah, with [1] this can be further simplified.


I'll be on vacation starting on Thursday for ~1.5 weeks, not sure if I get to review before that. But it's unlikely that we'll make the upcoming merge windows, so I guess we still have time (especially, for [1] and [2] to land)


[1] https://lkml.kernel.org/r/20230411142512.438404-4-david@xxxxxxxxxx
[2] https://lkml.kernel.org/r/20230413233115.1878303-1-shr@xxxxxxxxxxxx

--
Thanks,

David / dhildenb