Re: [RFC PATCH] powerpc/32: Switch VDSO to C implementation.
From: Christophe Leroy
Date: Sat Oct 26 2019 - 11:55:14 EST
Le 26/10/2019 Ã 15:55, Andy Lutomirski a ÃcritÂ:
On Tue, Oct 22, 2019 at 6:56 AM Christophe Leroy
<christophe.leroy@xxxxxx> wrote:
The performance is rather disappoiting. That's most likely all
calculation in the C implementation are based on 64 bits math and
converted to 32 bits at the very end. I guess C implementation should
use 32 bits math like the assembly VDSO does as of today.
gettimeofday: vdso: 750 nsec/call
gettimeofday: vdso: 1533 nsec/call
Small improvement (3%) with the proposed change:
gettimeofday: vdso: 1485 nsec/call
By inlining do_hres() I get the following:
gettimeofday: vdso: 1072 nsec/call
A perf report might be informative.
Not sure there is much to learn from perf report:
With the original RFC:
51.83% test_vdso [vdso] [.] do_hres
24.86% test_vdso [vdso] [.] __c_kernel_gettimeofday
7.33% test_vdso [vdso] [.] __kernel_gettimeofday
5.77% test_vdso test_vdso [.] main
1.55% test_vdso [kernel.kallsyms] [k] copy_page
0.67% test_vdso libc-2.23.so [.] _dl_addr
0.55% test_vdso ld-2.23.so [.] do_lookup_x
With vdso_calc_delta() optimised as suggested by Thomas + inlined do_hres():
68.00% test_vdso [vdso] [.] __c_kernel_gettimeofday
12.59% test_vdso [vdso] [.] __kernel_gettimeofday
6.22% test_vdso test_vdso [.] main
2.07% test_vdso [kernel.kallsyms] [k] copy_page
1.04% test_vdso ld-2.23.so [.] _dl_relocate_object
0.89% test_vdso ld-2.23.so [.] do_lookup_x
I've tried 'perf annotate', but I have not found how to tell perf to use
vdso32.so.dbg file for annotate [vdso].
Test app:
#include <dlfcn.h>
#include <stdarg.h>
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <unistd.h>
#include <sys/mman.h>
#include <sys/time.h>
static int (*gettimeofday_vdso)(struct timeval *tv, struct timezone *tz);
int main(int argc, char **argv)
{
void *handle = dlopen("linux-vdso32.so.1", RTLD_NOW | RTLD_GLOBAL);
struct timeval tv;
struct timezone tz;
int i;
(void)dlerror();
gettimeofday_vdso = dlsym(handle, "__kernel_gettimeofday");
if (dlerror())
gettimeofday_vdso = NULL;
for (i = 0; i < 100000; i++)
gettimeofday_vdso(&tv, &tz);
}
Christophe