Re: [patch] ipv4: don't warn about skb ack allocation failures

From: Eric Dumazet
Date: Wed Jun 17 2009 - 18:31:28 EST


David Rientjes a écrit :
> On Wed, 17 Jun 2009, Eric Dumazet wrote:
>
>>> Yes, if they are recoverable without any side effects. Otherwise, they
>>> will continue to emit page allocation failure messages which cause users
>>> to waste their time when they recognize a problem of an unknown
>>> seriousness level in both reporting the issue and looking for resulting
>>> corruption. The __GFP_NOWARN annotation suppresses such warnings for
>>> those very reasons.
>> Then why emit the warning at first place ?
>>
>> Once we patch all call sites to use GFP_ATOMIC | __GFP_NOWARN, I bet 99%
>> GFP_ATOMIC allocations in kernel will use it, so we go back to silent mode.
>>
>> If a GFP_ATOMIC call site *cannot* use __GFP_NOWARN, it will either :
>>
>> - call panic()
>> - crash with a nice stack trace because caller was not aware NULL could be
>> returned by kmalloc()
>>
>>
>> Maybe GFP_ATOMIC should include __GFP_NOWARN
>>
>> #define GFP_ATOMIC (__GFP_HIGH)
>> ->
>> #define GFP_ATOMIC (__GFP_HIGH | __GFP_NOWARN)
>>
>
> You must now mask off __GFP_NOWARN in the gfp flags for the allocation if
> you have a GFP_ATOMIC allocation that wants the page allocation failure
> warning messages. That message includes pertinent information with regard
> to the state of the VM that is otherwise unavailable by a BUG_ON() or NULL
> pointer dereference.
>
> For example, I could only diagnose Justin's failure as a harmless page
> allocator warning because I could identify its caller, the gfp mask of the
> allocation attempt, and the memory available. It would not have otherwise
> been possible to find that the system was actually oom.
>
> The general principle is that it is up to the caller to know whether an
> allocation failure is recoverable or not and not up to any VM
> implementation.

My point is that 99% of callers know allocation failures are
recoverable.

Instead of patching 10000 places in kernel, just patch 10 places where
allocations failures are not recoverable and call BUG() or whatever
lovely debugging aid (using __GFP_NOFAIL for example, I dont know)

GFP_NOWARN should be the default, and GFP_WARN_AND_FULL_EXPLANATION the exception.

In the past, only high order page allocations could trigger some trace,
I wonder why current kernels want to warn that an allocation failed,
since kmalloc(sz, GFP_ATOMIC) is allowed to return NULL and always was.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/