Re: [PATCH] mm/rmap: remove unnecessary page_table_lock

From: Yajun Deng
Date: Tue Apr 23 2024 - 04:35:18 EST


April 23, 2024 at 4:18 PM, "David Hildenbrand" <david@xxxxxxxxxx> wrote:



>
> On 23.04.24 09:53, Yajun Deng wrote:
>
> >
> > April 22, 2024 at 7:24 PM, "David Hildenbrand" <david@xxxxxxxxxx> wrote:
> >
> > > >>
> >
> > >
> > > On 22.04.24 12:52, Yajun Deng wrote:
> > >
> >
> > page_table_lock is a lock that for page table, we won't change page
> >
> > table in __anon_vma_prepare(). As we can see, it works well in
> >
> > anon_vma_clone(). They do the same operation.
> >
> > >
> > > We are reusing mm->page_table_lock to serialize, not the *actual* low-level page table locks that really protect PTEs.
> > >
> > > With that locking gone, there would be nothing protection vma->anon_vma.
> > >
> > > Note that anon_vma_clone() is likely called with the mmap_lock held in write mode, which is not the case for __anon_vma_prepare() ...
> > >
> >
> > Yes, anon_vma_clone() is called with the mmap_lock held. I added mmap_assert_write_locked(dst->vm_mm) to prove it.
> >
> > I added mmap_assert_write_locked(vma->vm_mm) in __anon_vma_prepare() at the same time, it shows __anon_vma_prepare()
> >
> > is also called with the mmap_lock held too.
> >
>
> Make sure you actually have lockdep built in and enabled.
>

This is my config.
CONFIG_LOCKDEP=n
CONFIG_DEBUG_VM=y

I did another test.
I put mmap_assert_write_locked(mm) before 'set_bit(MMF_OOM_SKIP, &mm->flags)' in mmap.c, it's outside the lock.
It will crash when on boot. I think mmap_assert_write_locked() works.


> __anon_vma_prepare() is for example called from do_anonymous_page() where we might only hold the mmap_lock in read mode (or not at all IIRC with VMA in read mode).
>
> -- Cheers,
>
> David / dhildenb
>