Re: [PATCH] arch, mm: introduce arch_tlb_gather_mmu_lazy (was: Re: [RESEND PATCH] mm, oom_reaper: gather each vma to prevent) leaking TLB entry

From: Minchan Kim
Date: Mon Nov 13 2017 - 20:46:15 EST

Next message: Guenter Roeck: "Re: [v8, 4/5] x86/xsave: Make XSAVE check the base CPUID features before enabling"
Previous message: Guenter Roeck: "Re: video: fbdev: Convert timers to use timer_setup()"
In reply to: Michal Hocko: "Re: [PATCH] arch, mm: introduce arch_tlb_gather_mmu_lazy (was: Re: [RESEND PATCH] mm, oom_reaper: gather each vma to prevent) leaking TLB entry"
Next in thread: Michal Hocko: "Re: [PATCH] arch, mm: introduce arch_tlb_gather_mmu_lazy (was: Re: [RESEND PATCH] mm, oom_reaper: gather each vma to prevent) leaking TLB entry"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Mon, Nov 13, 2017 at 10:51:07AM +0100, Michal Hocko wrote:
> On Mon 13-11-17 09:28:33, Minchan Kim wrote:
> [...]
> > Thanks for the patch, Michal.
> > However, it would be nice to do it tranparently without asking
> > new flags from users.
> >
> > When I read tlb_gather_mmu's description, fullmm is supposed to
> > be used only if there is no users and full address space.
> >
> > That means we can do it API itself like this?
> >
> > void arch_tlb_gather_mmu(...)
> >
> > tlb->fullmm = !(start | (end + 1)) && atomic_read(&mm->mm_users) == 0;
>
> I do not have a strong opinion here. The optimization is quite subtle so
> calling it explicitly sounds like a less surprising behavior to me
> longterm. Note that I haven't checked all fullmm users.

With description of tlb_gather_mmu and 4d6ddfa9242b, set fullmm to true
should guarantees there is *no users* of the mm_struct so I think
my suggestion is not about optimization but to keep the semantic
"there should be no one who can access address space when entire
address space is destroyed".

If you want to be more explicit, we should add some description
about "where can we use lazy mode". I think it should tell the
internal of some architecture for user to understand. I'm not
sure it's worth although we can do it transparently.

I'm not strong against with you approach, either.

Anyway, I think Wang Nan's patch is already broken.
http://lkml.kernel.org/r/%3C20171107095453.179940-1-wangnan0@xxxxxxxxxx%3E

Because unmap_page_range(ie, zap_pte_range) can flush TLB forcefully
and free pages. However, the architecture code for TLB flush cannot
flush at all by wrong fullmm so other threads can write freed-page.

Thanks.

Next message: Guenter Roeck: "Re: [v8, 4/5] x86/xsave: Make XSAVE check the base CPUID features before enabling"
Previous message: Guenter Roeck: "Re: video: fbdev: Convert timers to use timer_setup()"
In reply to: Michal Hocko: "Re: [PATCH] arch, mm: introduce arch_tlb_gather_mmu_lazy (was: Re: [RESEND PATCH] mm, oom_reaper: gather each vma to prevent) leaking TLB entry"
Next in thread: Michal Hocko: "Re: [PATCH] arch, mm: introduce arch_tlb_gather_mmu_lazy (was: Re: [RESEND PATCH] mm, oom_reaper: gather each vma to prevent) leaking TLB entry"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]