Re: [PATCH RFC] [X86] performance improvement for memcpy_64.S byfast string.

From: Pavel Machek
Date: Thu Nov 12 2009 - 18:04:24 EST



> Ling, if you are interested, could you send a user-space test-app to
> this thread that everyone could just compile and run on various older
> boxes, to gather a performance profile of hand-coded versus string ops
> performance?
>
> ( And i think we can make a judgement based on cache-hot performance
> alone - if then the strings ops will perform comparatively better in
> cache-cold scenarios, so the cache-hot numbers would be a conservative
> estimate. )

Ugh, really? I'd expect cache-cold performance to be not helped at all
(memory bandwidth limit) and you'll get slow down from additional
i-cache misses...
Pavel

--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/