(no subject)

From: Mala Anand (manand@us.ibm.com)
Date: Thu Sep 19 2002 - 17:56:42 EST

Andrew Morton wrote ...

>It seems that movsl works acceptably with all alignments on AMD
>hardware, although this needs to be checked with more recent machines.

>movsl is a (bad) loss on PII and PIII for all alignments except 8&8.
>Don't know about P4 - I can test that in a day or two.

>I expect that a minimal, 90% solution would be just:

>fancy_copy_to_user(dst, src, count)
> if (arch_has_sane_movsl || ((dst|src) & 7) == 0)
> movsl_copy_to_user(dst, src, count);
> else
> movl_copy_to_user(dst, src, count);


>#define fancy_copy_to_user copy_to_user

>and we really only need fancy_copy_to_user in a handful of
>places - the bulk copies in networking and filemap.c. For all
>the other call sites it's probably more important to keep the
>code footprint down than it is to squeeze the last few drops out
>of the copy speed.

>Mala Anand has done some work on this. See

><searches> Yes, I have a copy of Mala's patch here which works
>against 2.5.current. Mala's patch will cause quite an expansion
>of kernel size; we would need an implementation which did not
>use inlining. This work was discussed at OLS2002. See

I will move the code from uaccess.h (inline) to usercopy.c (routine)
and will post it soon. It is in my list of things to do.


   Mala Anand
   IBM Linux Technology Center - Kernel Performance
   Phone:838-8088; Tie-line:678-8088

To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

This archive was generated by hypermail 2b29 : Mon Sep 23 2002 - 22:00:28 EST