Re: Fw: 2.4.20pre8aa2 failed assertions in ip_{conntrack,nat}_core.c

From: Bernd Petrovitsch (bernd@gams.at)
Date: Thu Oct 10 2002 - 17:05:10 EST


Martin Josefsson <gandalf@wlug.westbo.se> wrote:
>> [2.] Full description of the problem/report:
>> I'm running 2.4.20pre8aa2 with all the firewalling stuff enabled
>> on a Pentium 75 and get occasionally
>> ---- snip (lines broken to keep it readable) ----
>> Oct 10 20:00:04 fw kernel: ASSERT ip_conntrack_core.c:97
>> &ip_conntrack_lock readlocked
>> Oct 10 20:00:04 fw kernel: ASSERT ip_conntrack_core.c:1067
>> &ip_conntrack_lock not readlocked
>> Oct 10 20:00:04 fw kernel: ASSERT: ip_nat_core.c:841
>> &ip_conntrack_lock not readlocked
>> Oct 10 20:00:04 fw kernel: ASSERT ip_conntrack_core.c:97
>> &ip_conntrack_lock readlocked
>> Oct 10 20:00:04 fw kernel: ASSERT: ip_nat_core.c:841
>> &ip_conntrack_lock not readlocked
>> ---- snip ----
>> Otherwise everything is working fine.
>
>I've attached a small patch that fixes the ip_conntrack_core.c:97 and
>ip_nat_core.c:841 messages, These two messages aren't bugs in the
>locking, it's just the lock-debugging that can't handle "recursive"
>read-locks on the same lock and cpu correctly and thus reports an error
>even though everything is fine.

This explains why the box was actually usable.

>(the patch is made against 2.4.20-pre8-ac2 but it should apply.)

It does. Compiled and boot4ed the patched kernel.

>ASSERT ip_conntrack_core.c:1067 &ip_conntrack_lock not readlocked
>this might be a real locking-bug but it's only used in
>conntrack-handlers so unless you have two expectations happening at the
>same time on diffrent cpu's there's no problem. I'll look into it.

Thanks!

>> [6.] A small shell script or example program which triggers the
>> problem (if possible)
>> Simply http'ing or ftp'ing trigger it occasionally. The symptom is
>> that a particular (TCP-)connection "freezes".
>
>hmm, shouldn't happen. Can you describe it in more detail?

Not really, it just popped up a dozen times within 3 hours within 7
minutes (hmm, God knows what I was doing exactly at that time to
trigger it).
And I actually realized it only because some <tab> completion in
ncftp hang (of course it came out on the console but who is looking
there all the time since that's not the primary PC).
I'll see if it occurs again .....

        Bernd

-- 
Bernd Petrovitsch                              Email : bernd@gams.at
g.a.m.s gmbh                                  Fax : +43 1 205255-900
Prinz-Eugen-Straße 8                    A-1040 Vienna/Austria/Europe
                     LUGA : http://www.luga.at

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Tue Oct 15 2002 - 22:00:39 EST