Re: [PATCH] jbd2: preserve original nofs flag during journal restart

From: Michal Hocko
Date: Wed May 17 2017 - 11:24:47 EST


On Wed 17-05-17 16:56:27, Jan Kara wrote:
> On Wed 17-05-17 16:13:57, Michal Hocko wrote:
> > On Wed 17-05-17 05:33:01, Tahsin Erdogan wrote:
> > > When a transaction starts, start_this_handle() saves current
> > > PF_MEMALLOC_NOFS value so that it can be restored at journal stop time.
> > > Journal restart is a special case that calls start_this_handle() without
> > > stopping the transaction. start_this_handle() isn't aware that the
> > > original value is already stored so it overwrites it with current value.
> > >
> > > For instance, a call sequence like below leaves PF_MEMALLOC_NOFS flag set
> > > at the end:
> > >
> > > jbd2_journal_start()
> > > jbd2__journal_restart()
> > > jbd2_journal_stop()
> > >
> > > Make jbd2__journal_restart() restore the original value before calling
> > > start_this_handle().
> > >
> > > Fixes: 81378da64de6 ("jbd2: mark the transaction context with the scope GFP_NOFS context")
> > > Signed-off-by: Tahsin Erdogan <tahsin@xxxxxxxxxx>
> > > ---
> > > fs/jbd2/transaction.c | 1 +
> > > 1 file changed, 1 insertion(+)
> > >
> > > diff --git a/fs/jbd2/transaction.c b/fs/jbd2/transaction.c
> > > index 9ee4832b6f8b..dfd6afebdfeb 100644
> > > --- a/fs/jbd2/transaction.c
> > > +++ b/fs/jbd2/transaction.c
> > > @@ -680,6 +680,7 @@ int jbd2__journal_restart(handle_t *handle, int nblocks, gfp_t gfp_mask)
> > >
> > > rwsem_release(&journal->j_trans_commit_map, 1, _THIS_IP_);
> > > handle->h_buffer_credits = nblocks;
> > > + memalloc_nofs_restore(handle->saved_alloc_context);
> > > ret = start_this_handle(journal, handle, gfp_mask);
> > > return ret;
> > > }
> >
> > I remember Jack has mentioned something about nested transaction back
> > then when reviewing the patch. But I cannot remember or find a pointer
> > to that email. I have a vague recollection that there is a reference
> > counting for those transactions.
> >
> > Anyway, Is this patch really correct? So let's say we are in
> > the transaction context already and then you disable the scope
> > NOFS protection, start_this_handle will allocate before it calls
> > memalloc_nofs_save and that would recurse to the filesystem. If
> > anything wouldn't it be better to simply call memalloc_nofs_save only if
> > we start a new transaction? I thought we were doing that already but the
> > code is so convoluted I have hard time to wrap my head around it.
>
> I was thinking about his as well but the fact is jbd2__journal_restart()
> actually does equivalent of jbd2_journal_stop() for the handle above the
> place where memalloc_nofs_restore() was added so in this sense we really
> miss memalloc_nofs_restore() there... So what Tahsin did makes sense.

OK, I will trust you ;). This deserves a comment then, I believe.

Thanks!
--
Michal Hocko
SUSE Labs