Re: [PATCH v2 1/3] nvme-hwmon: Return error on kzalloc failure

From: Keith Busch
Date: Tue Oct 04 2022 - 13:34:00 EST

On Tue, Oct 04, 2022 at 05:50:49PM +0300, Serge Semin wrote:
> >
> > This particular condition for hwmon is not something that prevents us from
> > making forward progress.
> If you consider the hwmon functionality as optional (AFAIU you are),
> then just ignore the return value no matter the reason.

That is not an option. This function does IO, and the controller may not be
usable on the other side of that, which means initialization must abort. We
can't just ignore errors; we just don't need to report errors that don't
prevent forward progress.

> If the problem
> caused the hwmon initialization process to fail turns to be critical
> it will be raised in some other place which is required for the NVME
> driver to work properly. Otherwise the hwmon module initialization may
> still cause the probe procedure to halt, which makes it not optional.
> That's what I meant when was saying about "the function and its
> caller semantics not implying that".
> >
> > > > The
> > > > driver can participate in memory reclaim, so failing on a low memory condition
> > > > can make matters worse.
> > >
> > > Yes it can, so can many other places in the driver utilizing kmalloc
> > > with just GFP_KERNEL flag passed including on the same path as the
> > > nvme_hwmon_init() execution. Kmalloc will make sure the reclaim is
> > > either finished or executed in background anyway in all cases.
> >
> > This path is in the first initialization before we've set up a namespace that
> > can be used as a reclaim destination.
> >
> > > Don't
> > > really see why memory allocation failure is less worse in this case
> > > than in many others in the same driver especially seeing as I said
> >
> > The other initialization kmalloc's are required to make forward progress toward
> > setting up a namespace. This one is not required.
> Anyway what you say seems still contradicting. First you said that the
> hwmon functionality was optional, but the only error being ignored was
> the no-memory one which was very rare and turned to be not ignored in
> the most of the other places.

> Second you got to accept the second
> patch of the series, which introduced a one more kmalloc followed
> right after the first one in the same function nvme_hwmon_init().

My comments on this patch were intended to be applied to all similiarly added