Re: [PATCH 5/10] mm/fs: route MADV_REMOVE to FALLOC_FL_PUNCH_HOLE

From: Hugh Dickins
Date: Tue May 22 2012 - 11:12:16 EST


On Mon, 21 May 2012, john stultz wrote:
> On Sat, May 12, 2012 at 5:13 AM, Hugh Dickins <hughd@xxxxxxxxxx> wrote:
> > Now tmpfs supports hole-punching via fallocate(), switch madvise_remove()
> > to use do_fallocate() instead of vmtruncate_range(): which extends
> > madvise(,,MADV_REMOVE) support from tmpfs to ext4, ocfs2 and xfs.
> >
> > There is one more user of vmtruncate_range() in our tree, staging/android's
> > ashmem_shrink(): convert it to use do_fallocate() too (but if its unpinned
> > areas are already unmapped - I don't know - then it would do better to use
> > shmem_truncate_range() directly).
>
> I suspect shmem_truncate_range directly would be the right approach,
> but am not totally sure.
> Arve: Any thoughts?
>
> Hugh: Do you have a git tree with this set available somewhere? I was
> working on my own tmpfs support for FALLOC_FL_PUNCH_HOLE, along with
> my volatile range work, so I'd like to rebase on top of your work
> here.

I don't, no, just the patch series posted.

I had hoped by now to say that it's in linux-next (though it would be
at the daily rebased end, which probably doesn't help you), but not yet.

If shmem_truncate_range() is all you need, then that doesn't depend on
these patches at all - but I expect you are aiming to be more general.

Hugh

>
> thanks
> -john
>
>
> >
> > Based-on-patch-by: Cong Wang <amwang@xxxxxxxxxx>
> > Signed-off-by: Hugh Dickins <hughd@xxxxxxxxxx>
> > ---
> >  drivers/staging/android/ashmem.c |    8 +++++---
> >  mm/madvise.c                     |   15 +++++++--------
> >  2 files changed, 12 insertions(+), 11 deletions(-)
> >
> > --- 3045N.orig/drivers/staging/android/ashmem.c 2012-05-05 10:42:33.564056626 -0700
> > +++ 3045N/drivers/staging/android/ashmem.c      2012-05-05 10:46:25.692062478 -0700
> > @@ -19,6 +19,7 @@
> >  #include <linux/module.h>
> >  #include <linux/file.h>
> >  #include <linux/fs.h>
> > +#include <linux/falloc.h>
> >  #include <linux/miscdevice.h>
> >  #include <linux/security.h>
> >  #include <linux/mm.h>
> > @@ -363,11 +364,12 @@ static int ashmem_shrink(struct shrinker
> >
> >        mutex_lock(&ashmem_mutex);
> >        list_for_each_entry_safe(range, next, &ashmem_lru_list, lru) {
> > -               struct inode *inode = range->asma->file->f_dentry->d_inode;
> >                loff_t start = range->pgstart * PAGE_SIZE;
> > -               loff_t end = (range->pgend + 1) * PAGE_SIZE - 1;
> > +               loff_t end = (range->pgend + 1) * PAGE_SIZE;
> >
> > -               vmtruncate_range(inode, start, end);
> > +               do_fallocate(range->asma->file,
> > +                               FALLOC_FL_PUNCH_HOLE | FALLOC_FL_KEEP_SIZE,
> > +                               start, end - start);
> >                range->purged = ASHMEM_WAS_PURGED;
> >                lru_del(range);
> >
> > --- 3045N.orig/mm/madvise.c     2012-05-05 10:42:33.572056784 -0700
> > +++ 3045N/mm/madvise.c  2012-05-05 10:46:25.692062478 -0700
> > @@ -11,8 +11,10 @@
> >  #include <linux/mempolicy.h>
> >  #include <linux/page-isolation.h>
> >  #include <linux/hugetlb.h>
> > +#include <linux/falloc.h>
> >  #include <linux/sched.h>
> >  #include <linux/ksm.h>
> > +#include <linux/fs.h>
> >
> >  /*
> >  * Any behaviour which results in changes to the vma->vm_flags needs to
> > @@ -200,8 +202,7 @@ static long madvise_remove(struct vm_are
> >                                struct vm_area_struct **prev,
> >                                unsigned long start, unsigned long end)
> >  {
> > -       struct address_space *mapping;
> > -       loff_t offset, endoff;
> > +       loff_t offset;
> >        int error;
> >
> >        *prev = NULL;   /* tell sys_madvise we drop mmap_sem */
> > @@ -217,16 +218,14 @@ static long madvise_remove(struct vm_are
> >        if ((vma->vm_flags & (VM_SHARED|VM_WRITE)) != (VM_SHARED|VM_WRITE))
> >                return -EACCES;
> >
> > -       mapping = vma->vm_file->f_mapping;
> > -
> >        offset = (loff_t)(start - vma->vm_start)
> >                        + ((loff_t)vma->vm_pgoff << PAGE_SHIFT);
> > -       endoff = (loff_t)(end - vma->vm_start - 1)
> > -                       + ((loff_t)vma->vm_pgoff << PAGE_SHIFT);
> >
> > -       /* vmtruncate_range needs to take i_mutex */
> > +       /* filesystem's fallocate may need to take i_mutex */
> >        up_read(&current->mm->mmap_sem);
> > -       error = vmtruncate_range(mapping->host, offset, endoff);
> > +       error = do_fallocate(vma->vm_file,
> > +                               FALLOC_FL_PUNCH_HOLE | FALLOC_FL_KEEP_SIZE,
> > +                               offset, end - start);
> >        down_read(&current->mm->mmap_sem);
> >        return error;
> >  }
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> > the body of a message to majordomo@xxxxxxxxxxxxxxx
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> > Please read the FAQ at  http://www.tux.org/lkml/
>