Impressive hard work, but a Pentium can already do:
movl p[0] to a (1/2 cycle)
adcl b to s (1/2 cycle)
movl p[1] to b (1/2 cycle)
adcl a to s (1/2 cycle)
etc.
to checksum 4 bytes per cycle (roughly), as the code in
arch/i386/lib/checksum.c does. Am I wrong?
The 686 version looks like it might max out the bus... (repeated adcls).
The really interesting case is copy-and-checksum. How fast can you do
that?
-- Jamie
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/faq.html