Re: Possible IRQ lock inversion from 2.6.29-Linus-03321-gbe0ea69(2.6.29-git)

From: Jonathan Corbet
Date: Fri Mar 27 2009 - 14:06:40 EST


On Fri, 27 Mar 2009 13:54:35 +0100
Bartlomiej Zolnierkiewicz <bzolnier@xxxxxxxxx> wrote:

> I remember looking a bit more closely into the issue and not seeing
> the problem with the locking (though I could have missed something):
>
> file->f_lock is never taken in hard-irq or soft-irq context and in
> the only place where file->f_lock is taken with fasync_lock hold we're
> protected against IRQs by write_lock_irq().

I do think that the warning is spurious at this time.

> [ Despite not being a problem now I think that changing spin_[un]lock()
> to *_irq() variants for file->f_lock could be (given that it really
> fixes the warning) more viable long-term solution than adding special
> lockdep handling (well, it could be that one day file->f_lock is used
> in soft-irq context and then the irq lock inversion issue will become
> a real one) and shouldn't incurr performance penalty since we hold it
> only for a very brief time. ]

We could do that. When I made the change I'd verified that there were
no users in IRQ context, and I couldn't really see why there should
be. I'd rather avoid adding all those IRQ disables if I can avoid it.

How about, instead, just reversing the order of lock acquisition in
fasync_helper()? That would increase the hold time for f_lock, but I
have a hard time seeing that being a real problem. I'm running with
the following now; all seems well. I'll send it up in a bit if nobody
gripes.

Thanks,

jon

diff --git a/fs/fcntl.c b/fs/fcntl.c
index d865ca6..b9c1a4b 100644
--- a/fs/fcntl.c
+++ b/fs/fcntl.c
@@ -531,6 +531,7 @@ int fasync_helper(int fd, struct file * filp, int on, struct fasync_struct **fap
if (!new)
return -ENOMEM;
}
+ spin_lock(&filp->f_lock); /* outside fasync_lock to keep lockdep happy */
write_lock_irq(&fasync_lock);
for (fp = fapp; (fa = *fp) != NULL; fp = &fa->fa_next) {
if (fa->fa_file == filp) {
@@ -555,14 +556,12 @@ int fasync_helper(int fd, struct file * filp, int on, struct fasync_struct **fap
result = 1;
}
out:
- /* Fix up FASYNC bit while still holding fasync_lock */
- spin_lock(&filp->f_lock);
if (on)
filp->f_flags |= FASYNC;
else
filp->f_flags &= ~FASYNC;
- spin_unlock(&filp->f_lock);
write_unlock_irq(&fasync_lock);
+ spin_unlock(&filp->f_lock);
return result;
}

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/