Re: WARNING: at net/sched/sch_generic.c:219dev_watchdog+0xfe/0x17e() with tg3 network

From: Matt Carlson
Date: Thu Nov 20 2008 - 13:43:34 EST


On Wed, Nov 19, 2008 at 09:37:47PM -0800, Willy Tarreau wrote:
> Hello Matt,
>
> On Wed, Nov 19, 2008 at 07:11:01PM -0800, Matt Carlson wrote:
> > > My tg3 is just PCI-based, no PCIe in this beast. I can send more
> > > info when I turn it on. I don't think that the tg3 driver changes
> > > often, so most likely digging through the changes between 2.6.25
> > > and 2.6.27 should not take much time. I just don't know if I can
> > > reliably reproduce the issue right now.
> >
> > Willy, this problem description sounds a little different than the
> > original report. There was a bug where the driver would wait 2.5
> > seconds for a firmware event that would never get serviced. That
> > fix has already landed in the 2.6.27 tree though.
> >
> > I glanced over the changes between 2.6.25 and 2.6.27.6. There are quite
> > a few changes related to phylib support for an upcoming device, but not
> > so many changes that affect older devices. What device are you using?
>
> I think it's a 5704, but I will check this this morning when I'm at
> work. I also want to try to reliably reproduce the problem. After
> that, I see only 29 patches which differ from the two kernels, it
> should be pretty easy to spot the culprit.

O.K. Let me know how it goes.

Could we clarify something though? In your previous email, you said you
didn't have any problems on pre-2.6.25 kernels. I'm wondering if the
problem goes back further than 2.6.25. From 2.6.24 to 2.6.25, there was a
significant set of flow control changes that took place. I suspect that
might have something to do with Roger's problem, and it may have
something to do with your problem too. So, is it true that 2.6.25 works
for you? If not, can you try disabling flow control and see if that
helps?

> If you think it's a different bug than original report (though I
> really thought it was the same), I'll post my findings in a separate
> thread not to mix investigations.

Right now, I think it is premature to say, so let's continue as if they
were the same problem. We can always break it out into a separate
discussion later.


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/