> My wild guess is that with the "faster" code, the K7 is avoiding loading
> cache lines just to write them out again, and is just writing tons of data.
> The PPC G4 - and perhaps even the G3 - performs a similar trick
> automatically, without special assembly...
X86 has done that since the K5 era.
No the main thing that the mmx copier does is to read and write in 64bit
wide chunks, and then more importantly to prefetch pending data. Thus instead
of stalling on reads there is a continual stream of data from the sdram hitting
the cpu ready for us to write it back out (and the write out is buffered too)
Alan
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
This archive was generated by hypermail 2b29 : Mon May 07 2001 - 21:00:22 EST