Re: [PATCH 1/2] perf tools: allow building for ARM

From: Jamie Iles
Date: Fri Dec 11 2009 - 06:00:41 EST


On Fri, Dec 11, 2009 at 11:38:48AM +0100, Ingo Molnar wrote:
>
> * Jamie Iles <jamie.iles@xxxxxxxxxxxx> wrote:
>
> > On Fri, Dec 11, 2009 at 11:23:16AM +0100, Ingo Molnar wrote:
> > > cpu_relax() looks fine, but rmb() seems not to match the one that can be
> > > found in arch/arm/:
> > >
> > > arch/arm/include/asm/system.h:#define rmb() dmb()
> > > arch/arm/include/asm/system.h:#define rmb() do { if (arch_is_coherent()) dmb(); else barrier(); } while (0)
> > > arch/arm/include/asm/system.h:#define smp_rmb() rmb()
> > >
> > > arch/arm/include/asm/system.h:#define dmb() __asm__ __volatile__ ("dmb" : : : "memory")
> > > arch/arm/include/asm/system.h:#define dmb() __asm__ __volatile__ ("mcr p15, 0, %0, c7, c10, 5" \
> > > arch/arm/include/asm/system.h:#define dmb() __asm__ __volatile__ ("" : : : "memory")
> > > arch/arm/include/asm/system.h:#define dmb() __asm__ __volatile__ ("" : : : "memory")
> >
> > The implementation of the barriers depend on the CPU arch revision
> > which is defined in the kernel config. As the perf tools don't use the
> > kernel config, we don't know here what arch revision we're building
> > for. Perhaps we need a LINUX_ARM_ARCH parameter when building for ARM
> > so we can pick the correct one.
>
> rmb() is used in two places in perf:
>
> tools/perf/builtin-record.c: rmb();
> tools/perf/builtin-top.c: rmb();
>
> to interact with the shared kernel/user ring-buffer. Getting a barrier
> wrong there may cause hickups in recording.
>
> Could you tell me a bit more about this ARM instruction - is the 'DMB'
> instruction used on all SMP ARM cores? Can it be used unconditionally,
> or is the instruction undefined on certain versions? To get the ball
> rolling we could use it unconditionally in the initial patch, but this
> needs to be solved i suspect.
There are a few cases we need to deal with:
- v7 SMP: DMB instruction
- v6 SMP: MCR coprocessor instruction
- v5 and earlier no instructions for barriers.

Looking at the TRM for a v7 core (cortex A9) the MCR instruction that v6 uses
is deprecated but still present. I suspect we could use this to cover the v6
and v7 cores but we wouldn't be able to do soft perf events on v5 or earlier
(which don't have hardware counters).

Jamie
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/