Re: Linux 2.4.1 network (socket) performance

From: Pavel Machek (pavel@suse.cz)
Date: Fri Dec 31 1999 - 20:38:17 EST


Hi!

> Hello, I am trying to find the reason for very, very poor network
> performance with sustained data transfers on Linux 2.4.1. I found
> a work-around, but don't think user-mode code should have to provide
> such work-arounds.
>
> In the following, with Linux 2.4.1, on a dedicated 100/Base
> link:
>
> s = socket connected to DISCARD (null-sink) server.
>
> while(len)
> {
> stat = write(s, buf, min(len, MTU));
> /* Yes, I do check for an error */
> buf += stat;
> len -= stat;
> }
>
> Data length is 0x00010000 bytes.
>
> MTU Average trans rate Fastest trans rate
> ---- ----------------- -----------------
> 65536 0.468 Mb/s 0.902 Mb/s
> 32768 0.684 Mb/s 0.813 Mb/s
> 16384 2.989 Mb/s 3.121 Mb/s
> 8192 5.211 Mb/s 6.160 Mb/s
> 4094 8.212 Mb/s 9.101 Mb/s
> 2048 8.561 Mb/s 9.280 Mb/s
> 1024 7.250 Mb/s 7.500 Mb/s
> 512 4.818 Mb/s 5.107 Mb/s
>
> As you can see, there is a maximum data length that can be
> handled with reasonable speed from a socket. Trying to find
> out what that was, I discovered that the best MTU was 3924.
> I don't know why. It shows:

Looks like that's page_size - epsilon.

> MTU Average trans rate Fastest trans rate
> ---- ----------------- -----------------
> 3924 8.920 Mb/s 9.31 Mb/s

But even this is *not* reasonable speed for 100MBit ethernet!

> If the user's data length is higher than this, there is a 1/100th
> of a second wait between packets. The larger the user's data length,
> the more the data gets chopped up into 1/100th of a second intervals.
>
> It looks as though user data that can't fit into two Ethernet packets
> is queued until the next time-slice on a 100 Hz system. This severely
> hurts sustained data performance. The performance with a single
> 64k data buffer is abysmal. If it gets chopped up into 2048 byte
> blocks in user-space, it's reasonable.
>
> Both machines are Dual Pentium 600 MHz machines with identical eepro100
> Ethernet boards. I substituted, LANCE (Hewlett Packard), and 3COM boards
> (3c59x) with essentially no change.

Strange. Do you have interrupts working okay? [I'm able to get 4Mbit with
ne2000 hooked on timer IRQ, so this is not totally stupid Question.]

> Does this point out a problem? Or should user-mode code be required

Definitely problem.

> to chop up data lengths to something more "reasonable" for the kernel?
> If so, how does the user know what "reasonable" is?

-- 
Philips Velo 1: 1"x4"x8", 300gram, 60, 12MB, 40bogomips, linux, mutt,
details at http://atrey.karlin.mff.cuni.cz/~pavel/velo/index.html.

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Wed Mar 07 2001 - 21:00:10 EST