Re: [Intel-wired-lan] i40e X722 RSS problem with NAT-Traversal IPsec packets

From: Alexander Duyck
Date: Thu May 02 2019 - 13:04:00 EST

Next message: Vidya Sagar: "[PATCH V4 2/2] PCI: dwc: Export APIs to support .remove() implementation"
Previous message: Vidya Sagar: "Re: [PATCH V2 1/2] PCI: dwc: Add API support to de-initialize host"
In reply to: Lennart Sorensen: "Re: [Intel-wired-lan] i40e X722 RSS problem with NAT-Traversal IPsec packets"
Next in thread: Lennart Sorensen: "Re: [Intel-wired-lan] i40e X722 RSS problem with NAT-Traversal IPsec packets"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Thu, May 2, 2019 at 8:11 AM Lennart Sorensen
<lsorense@xxxxxxxxxxxxxxxxxxx> wrote:
>
> On Wed, May 01, 2019 at 03:52:57PM -0700, Alexander Duyck wrote:
> > I'm not sure how RSS will do much for you here. Basically you only
> > have the source IP address as your only source of entropy when it
> > comes to RSS since the destination IP should always be the same if you
> > are performing a server role and terminating packets on the local
> > system and as far as the ports in your example you seem to only be
> > using 4500 for both the source and the destination.
>
> I have thousands of IPsec clients connecting. Simply treating them as
> normal UDP packets would work. The IP address is different, and often
> the port too.

Thanks for the clarification. I just wanted to verify that I know we
have had similar complaints in the past and it turns out those were
only using one set of IP addresses.

> > In your testing are you only looking at a point to point connection
> > between two systems, or do you have multiple systems accessing the
> > system you are testing? I ask as the only way this should do any
> > traffic spreading via RSS would be if the source IPs are different and
> > that would require multiple client systems accessing the server.
>
> I tried changing the client IP address and the RSS hash key. It never
> changed to another queue. Something is broken.

Okay, so if changing the RSS hash key has not effect then it is likely
not being used.

> > In the case of other encapsulation types over UDP, such as VXLAN, I
> > know that a hash value is stored in the UDP source port location
> > instead of the true source port number. This allows the RSS hashing to
> > occur on this extra information which would allow for a greater
> > diversity in hash results. Depending on how you are generating the ESP
> > encapsulation you might look at seeing if it would be possible to have
> > a hash on the inner data used as the UDP source port in the outgoing
> > packets. This would help to resolve this sort of issue.
>
> Well it works on every other network card except this one. Every other
> intel card in the past we have used had no problem doing this right.

The question is what is different about this card, and I don't have an
immediate answer so we would need to do some investigation.

> You want all the packets for a given ipsec tunnel to go to the same queue.
> That is not a problem here. What you don't want is every ipsec packet
> from everyone going to the same queue (always queue 0). So simply
> treating them as UDP packets with a source and destination IP and port
> would work perfectly fine. The X722 isn't doing that. It is always
> assigning a hash value of 0 to these packets.

You had stated in your earlier email that "Other UDP packets are
fine". Perhaps we need to do some further isolation to identify why
the ESP over UDP packets are not being hashed on while other UDP
packets are.

Would it be possible to provide a couple of raw Ethernet frames
instead of IP packets for us to examine? I noticed the two packets you
sent earlier didn't start until the IP header. One possibility would
be that if we had any extra outer headers or trailers added to the
packet that could possibly cause issues since that might either make
the packet not parsable or possibly flag it as some sort of length
error when the size of the packet doesn't match what is reported in
the headers.

One other thing we may want to look at doing is trying to identify the
particular part of the packets that might be causing the hash to not
be generated. One way to do that would be to use something like
netperf to generate packets and send them toward your test system.
Something like the command line below could be used to send packets
that should be similar to the ones you provided earlier:
netperf -H <target IP> -t UDP_STREAM -N -- -P 4500,4500 -m 132

If the packets generated by netperf were not hashed that would tell us
then it may be some sort of issue with how UDP packets are being
parsed, and from there we could narrow things down by modifying port
numbers and changing packet sizes. If that does get hashed then we
need to start looking outside of the IP/UDP header parsing for
possible issues since there is likely something else causing the
issue.

Thanks.

- Alex

Next message: Vidya Sagar: "[PATCH V4 2/2] PCI: dwc: Export APIs to support .remove() implementation"
Previous message: Vidya Sagar: "Re: [PATCH V2 1/2] PCI: dwc: Add API support to de-initialize host"
In reply to: Lennart Sorensen: "Re: [Intel-wired-lan] i40e X722 RSS problem with NAT-Traversal IPsec packets"
Next in thread: Lennart Sorensen: "Re: [Intel-wired-lan] i40e X722 RSS problem with NAT-Traversal IPsec packets"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]