Re: [PATCH v3 1/2] kasan: detect negative size in memory operation function

From: Walter Wu
Date: Mon Nov 11 2019 - 05:12:39 EST


On Mon, 2019-11-11 at 12:29 +0300, Andrey Ryabinin wrote:
>
> On 11/11/19 10:14 AM, Walter Wu wrote:
> > On Sat, 2019-11-09 at 01:31 +0300, Andrey Ryabinin wrote:
> >>
> >> On 11/4/19 5:05 AM, Walter Wu wrote:
> >>
> >>>
> >>> diff --git a/mm/kasan/common.c b/mm/kasan/common.c
> >>> index 6814d6d6a023..4ff67e2fd2db 100644
> >>> --- a/mm/kasan/common.c
> >>> +++ b/mm/kasan/common.c
> >>> @@ -99,10 +99,14 @@ bool __kasan_check_write(const volatile void *p, unsigned int size)
> >>> }
> >>> EXPORT_SYMBOL(__kasan_check_write);
> >>>
> >>> +extern bool report_enabled(void);
> >>> +
> >>> #undef memset
> >>> void *memset(void *addr, int c, size_t len)
> >>> {
> >>> - check_memory_region((unsigned long)addr, len, true, _RET_IP_);
> >>> + if (report_enabled() &&
> >>> + !check_memory_region((unsigned long)addr, len, true, _RET_IP_))
> >>> + return NULL;
> >>>
> >>> return __memset(addr, c, len);
> >>> }
> >>> @@ -110,8 +114,10 @@ void *memset(void *addr, int c, size_t len)
> >>> #undef memmove
> >>> void *memmove(void *dest, const void *src, size_t len)
> >>> {
> >>> - check_memory_region((unsigned long)src, len, false, _RET_IP_);
> >>> - check_memory_region((unsigned long)dest, len, true, _RET_IP_);
> >>> + if (report_enabled() &&
> >>> + (!check_memory_region((unsigned long)src, len, false, _RET_IP_) ||
> >>> + !check_memory_region((unsigned long)dest, len, true, _RET_IP_)))
> >>> + return NULL;
> >>>
> >>> return __memmove(dest, src, len);
> >>> }
> >>> @@ -119,8 +125,10 @@ void *memmove(void *dest, const void *src, size_t len)
> >>> #undef memcpy
> >>> void *memcpy(void *dest, const void *src, size_t len)
> >>> {
> >>> - check_memory_region((unsigned long)src, len, false, _RET_IP_);
> >>> - check_memory_region((unsigned long)dest, len, true, _RET_IP_);
> >>> + if (report_enabled() &&
> >>
> >> report_enabled() checks seems to be useless.
> >>
> >
> > Hi Andrey,
> >
> > If it doesn't have report_enable(), then it will have below the error.
> > We think it should be x86 shadow memory is invalid value before KASAN
> > initialized, it will have some misjudgments to do directly return when
> > it detects invalid shadow value in memset()/memcpy()/memmove(). So we
> > add report_enable() to avoid this happening. but we should only use the
> > condition "current->kasan_depth == 0" to determine if KASAN is
> > initialized. And we try it is pass at x86.
> >
>
> Ok, I see. It just means that check_memory_region() return incorrect result in early stages of boot.
> So, the right way to deal with this would be making kasan_report() to return bool ("false" if no report and "true" if reported)
> and propagate this return value up to check_memory_region().
>
This changes in v4.

>
> >>> diff --git a/mm/kasan/generic_report.c b/mm/kasan/generic_report.c
> >>> index 36c645939bc9..52a92c7db697 100644
> >>> --- a/mm/kasan/generic_report.c
> >>> +++ b/mm/kasan/generic_report.c
> >>> @@ -107,6 +107,24 @@ static const char *get_wild_bug_type(struct kasan_access_info *info)
> >>>
> >>> const char *get_bug_type(struct kasan_access_info *info)
> >>> {
> >>> + /*
> >>> + * If access_size is negative numbers, then it has three reasons
> >>> + * to be defined as heap-out-of-bounds bug type.
> >>> + * 1) Casting negative numbers to size_t would indeed turn up as
> >>> + * a large size_t and its value will be larger than ULONG_MAX/2,
> >>> + * so that this can qualify as out-of-bounds.
> >>> + * 2) If KASAN has new bug type and user-space passes negative size,
> >>> + * then there are duplicate reports. So don't produce new bug type
> >>> + * in order to prevent duplicate reports by some systems
> >>> + * (e.g. syzbot) to report the same bug twice.
> >>> + * 3) When size is negative numbers, it may be passed from user-space.
> >>> + * So we always print heap-out-of-bounds in order to prevent that
> >>> + * kernel-space and user-space have the same bug but have duplicate
> >>> + * reports.
> >>> + */
> >>
> >> Completely fail to understand 2) and 3). 2) talks something about *NOT* producing new bug
> >> type, but at the same time you code actually does that.
> >> 3) says something about user-space which have nothing to do with kasan.
> >>
> > about 2)
> > We originally think the heap-out-of-bounds is similar to
> > heap-buffer-overflow, maybe we should change the bug type to
> > heap-buffer-overflow.
>
> There is no "heap-buffer-overflow".
>
If I remember correctly, "heap-buffer-overflow" is one of existing bug
type in user-space? Or you want to expect to see an existing bug type in
kernel space?

> >
> > about 3)
> > Our idea is just to always print "heap-out-of-bounds" and don't
> > differentiate if the size come from user-space or not.
>
> Still doesn't make sence to me. KASAN doesn't differentiate if the size coming from user-space
> or not. It simply doesn't have any way of knowing from where is the size coming from.

Yes, it don't know where is coming from. so we originally always print
the existing bug type to indicate negative size, or we can remove 3)?