Re: [PATCH] mm: simplify find_vma_prev

From: Andrew Morton
Date: Fri Dec 09 2011 - 15:24:12 EST


On Fri, 9 Dec 2011 15:09:04 -0500
kosaki.motohiro@xxxxxxxxx wrote:

> From: KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx>
>
> commit 297c5eee37 (mm: make the vma list be doubly linked) added
> vm_prev member into vm_area_struct. Therefore we can simplify
> find_vma_prev() by using it. Also, this change help to imporove
> page fault performance becuase it has strong locality of reference.
>
> Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx>
> ---
> mm/mmap.c | 34 ++++++----------------------------
> 1 files changed, 6 insertions(+), 28 deletions(-)
>
> diff --git a/mm/mmap.c b/mm/mmap.c
> index eae90af..955750c 100644
> --- a/mm/mmap.c
> +++ b/mm/mmap.c
> @@ -1605,37 +1605,15 @@ EXPORT_SYMBOL(find_vma);
>
> /* Same as find_vma, but also return a pointer to the previous VMA in *pprev. */
> struct vm_area_struct *
> -find_vma_prev(struct mm_struct *mm, unsigned long addr,
> - struct vm_area_struct **pprev)
> +find_vma_prev(struct mm_struct *mm, unsigned long addr, struct vm_area_struct **pprev)
> {
> - struct vm_area_struct *vma = NULL, *prev = NULL;
> - struct rb_node *rb_node;
> - if (!mm)
> - goto out;
> -
> - /* Guard against addr being lower than the first VMA */
> - vma = mm->mmap;
> -
> - /* Go through the RB tree quickly. */
> - rb_node = mm->mm_rb.rb_node;
> -
> - while (rb_node) {
> - struct vm_area_struct *vma_tmp;
> - vma_tmp = rb_entry(rb_node, struct vm_area_struct, vm_rb);
> + struct vm_area_struct *vma;
>
> - if (addr < vma_tmp->vm_end) {
> - rb_node = rb_node->rb_left;
> - } else {
> - prev = vma_tmp;
> - if (!prev->vm_next || (addr < prev->vm_next->vm_end))
> - break;
> - rb_node = rb_node->rb_right;
> - }
> - }
> + vma = find_vma(mm, addr);
> + if (vma)
> + *pprev = vma->vm_prev;
>
> -out:
> - *pprev = prev;
> - return prev ? prev->vm_next : vma;
> + return vma;
> }

This changes the (undocumented, naturally) interface in disturbing ways.

Currently, *pprev will always be written to. With this change, *pprev
will only be written to if find_vma_prev() returns non-NULL.

Looking through the code, this is mostly benign. But it will cause the
CONFIG_STACK_GROWSUP version of find_extend_vma() to use an
uninitialised stack slot in ways which surely will crash the kernel.

So please have a think about that and fix it up. And please add
documentation for find_vma_prev()'s interface so we don't break it next
time.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/