Re: [PATCH NET-NEXT 0/10] hardware time stamping with new fieldsin shinfo
From: Patrick Ohly
Date: Sat Feb 21 2009 - 04:16:13 EST
On Mon, 2009-02-16 at 07:44 +0000, Patrick Ohly wrote:
> On Mon, 2009-02-16 at 09:16 +0200, David Miller wrote:
> > That TX clone wrt. skb_orphan() issue will need a happier solution.
> >
> > Can you describe that problem in detail? Maybe someone can come
> > up with a way to avoid that stuff.
>
> Bouncing information about a sent packet back to the sender (in
> skb_tstamp_tx()) requires access to the socket via which the packet was
> sent (orig_skb->sk).
>
> This information must be available after the packet was handed to
> ops->ndo_start_xmit() in order to implement the TX time stamping
> software fallback for devices which don't implement hardware time
> stamping (not enabled and/or not implemented at all).
There's an even more fundamental problem: the TX software fallback in
net_dev_hard_start_xmit() has to access the skb after ndo_start_xmit()
has returned successfully.
During stress tests LAD colleagues at Intel ran into kernel panics that
pointed to net_dev_hard_start_xmit() as the culprit. Removing the
current TX software fallback code solved this. My apologies for not
realizing earlier that this code is violating skb ownership rules.
We propose to remove the faulty code and then solve it in a proper way
later on. Hardware time stamping will work fine without it. I will
forward the patch separately.
> I see several ways to solve this:
> * Always create another reference to skb->sk before
> ops->ndo_start_xmit() if TX time stamping is requested.
> Drawback: performance penalty for drivers which support TX time
> stamping or don't call skb_orphan().
[...]
> * Extend the driver API + another reference. Let drivers which
> implement TX time stamping (and thus know when to avoid
> skb_orphan()) signal that. dev_hard_start_xmit() then can avoid
> the "additional sk reference" thing for these drivers. Drawback:
> requires analyzing and potentially flagging all drivers to have
> a real effect (as for bnx2).
Something along these lines with an additional reference to the skb
might work.
--
Best Regards, Patrick Ohly
The content of this message is my personal opinion only and although
I am an employee of Intel, the statements I make here in no way
represent Intel's position on the issue, nor am I authorized to speak
on behalf of Intel on this matter.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/