Re: [PATCH v2 4/6] Documentation: arm64: document support for the AMU extension
From: Ionela Voinescu
Date: Thu Jan 30 2020 - 11:45:46 EST
Hi Suzuki,
On Thursday 30 Jan 2020 at 15:04:27 (+0000), Suzuki Kuruppassery Poulose wrote:
> Hi Ionela,
>
> On 18/12/2019 18:26, Ionela Voinescu wrote:
> > The activity monitors extension is an optional extension introduced
> > by the ARMv8.4 CPU architecture.
> >
> > Add initial documentation for the AMUv1 extension:
> > - arm64/amu.txt: AMUv1 documentation
> > - arm64/booting.txt: system registers initialisation
> > - arm64/cpu-feature-registers.txt: visibility to userspace
>
> We have stopped adding "invisible" fields to the list. So, you
> can drop the changes to cpu-feature-registers.txt.
>
> >
> > Signed-off-by: Ionela Voinescu <ionela.voinescu@xxxxxxx>
> > Cc: Catalin Marinas <catalin.marinas@xxxxxxx>
> > Cc: Will Deacon <will@xxxxxxxxxx>
> > Cc: Jonathan Corbet <corbet@xxxxxxx>
> > ---
> > Documentation/arm64/amu.rst | 107 ++++++++++++++++++
> > Documentation/arm64/booting.rst | 14 +++
> > Documentation/arm64/cpu-feature-registers.rst | 2 +
> > Documentation/arm64/index.rst | 1 +
> > 4 files changed, 124 insertions(+)
> > create mode 100644 Documentation/arm64/amu.rst
> >
> > diff --git a/Documentation/arm64/amu.rst b/Documentation/arm64/amu.rst
> > new file mode 100644
> > index 000000000000..62a6635522e1
> > --- /dev/null
> > +++ b/Documentation/arm64/amu.rst
> > @@ -0,0 +1,107 @@
> > +=======================================================
> > +Activity Monitors Unit (AMU) extension in AArch64 Linux
> > +=======================================================
> > +
> > +Author: Ionela Voinescu <ionela.voinescu@xxxxxxx>
> > +
> > +Date: 2019-09-10
> > +
> > +This document briefly describes the provision of Activity Monitors Unit
> > +support in AArch64 Linux.
> > +
> > +
> > +Architecture overview
> > +---------------------
> > +
> > +The activity monitors extension is an optional extension introduced by the
> > +ARMv8.4 CPU architecture.
> > +
> > +The activity monitors unit, implemented in each CPU, provides performance
> > +counters intended for system management use. The AMU extension provides a
> > +system register interface to the counter registers and also supports an
> > +optional external memory-mapped interface.
> > +
> > +Version 1 of the Activity Monitors architecture implements a counter group
> > +of four fixed and architecturally defined 64-bit event counters.
> > + - CPU cycle counter: increments at the frequency of the CPU.
> > + - Constant counter: increments at the fixed frequency of the system
> > + clock.
> > + - Instructions retired: increments with every architecturally executed
> > + instruction.
> > + - Memory stall cycles: counts instruction dispatch stall cycles caused by
> > + misses in the last level cache within the clock domain.
> > +
> > +When in WFI or WFE these counters do not increment.
> > +
> > +The Activity Monitors architecture provides space for up to 16 architected
> > +event counters. Future versions of the architecture may use this space to
> > +implement additional architected event counters.
> > +
> > +Additionally, version 1 implements a counter group of up to 16 auxiliary
> > +64-bit event counters.
> > +
> > +On cold reset all counters reset to 0.
> > +
> > +
> > +Basic support
> > +-------------
> > +
> > +The kernel can safely run a mix of CPUs with and without support for the
> > +activity monitors extension.
>
>
> Therefore, when CONFIG_ARM64_AMU_EXTN is
> > +selected we unconditionally enable the capability to allow any late CPU
> > +(secondary or hotplugged) to detect and use the feature.
> > +
> > +When the feature is detected on a CPU, a per-CPU variable (amu_feat) is
> > +set, but this does not guarantee the correct functionality of the
> > +counters, only the presence of the extension.
>
> nit: I would rather omit the implementation details (esp variable names)
> in the documentation. It may become a pain to keep this in sync with the
> code changes. You could simply mention, "we keep track of the availability
> of the feature" per CPU. If someone wants to figure out
> how, they can always read the code.
>
> > +
> > +Firmware (code running at higher exception levels, e.g. arm-tf) support is
> > +needed to:
> > + - Enable access for lower exception levels (EL2 and EL1) to the AMU
> > + registers.
> > + - Enable the counters. If not enabled these will read as 0.
> > + - Save/restore the counters before/after the CPU is being put/brought up
> > + from the 'off' power state.
> > +
> > +When using kernels that have this configuration enabled but boot with
> > +broken firmware the user may experience panics or lockups when accessing
> > +the counter registers. Even if these symptoms are not observed, the
> > +values returned by the register reads might not correctly reflect reality.
> > +Most commonly, the counters will read as 0, indicating that they are not
> > +enabled. If proper support is not provided in firmware it's best to disable
> > +CONFIG_ARM64_AMU_EXTN.
>
> For the sake of one kernel runs everywhere, do we need some other
> mechanism to disable the AMU. e.g kernel parameter to disable amu
> at runtime ?
>
The reason I've not added this is twofold:
- Even if we add this, it should be in order to disable the use of the
counters for a certain purpose, in this case frequency invariance.
On its own AMU provides the counters but it does not mandate their
use.
- I could add something to disable the use of the core and cycle
counters for frequency invariance at runtime, but I doubt that
anyone would use it. Logically it makes sense to use the counters
order to have a more accurate view of the performance that the CPUs
are actually providing. Therefore, until anyone asks for this, I
thought it's better to keep it simple and not add extra switches,
until there is a use for them.
Does it make sense?
P.S. I'll make all the other changes you've suggested in v3.
Thank you,
Ionela.
> > diff --git a/Documentation/arm64/booting.rst b/Documentation/arm64/booting.rst
> > index 5d78a6f5b0ae..a3f1a47b6f1c 100644
> > --- a/Documentation/arm64/booting.rst
> > +++ b/Documentation/arm64/booting.rst
> > @@ -248,6 +248,20 @@ Before jumping into the kernel, the following conditions must be met:
> > - HCR_EL2.APK (bit 40) must be initialised to 0b1
> > - HCR_EL2.API (bit 41) must be initialised to 0b1
> > + For CPUs with Activity Monitors Unit v1 (AMUv1) extension present:
> > + - If EL3 is present:
> > + CPTR_EL3.TAM (bit 30) must be initialised to 0b0
> > + CPTR_EL2.TAM (bit 30) must be initialised to 0b0
> > + AMCNTENSET0_EL0 must be initialised to 0b1111
> > + AMCNTENSET1_EL0 must be initialised to a platform specific value
> > + having 0b1 set for the corresponding bit for each of the auxiliary
> > + counters present.
> > + - If the kernel is entered at EL1:
> > + AMCNTENSET0_EL0 must be initialised to 0b1111
> > + AMCNTENSET1_EL0 must be initialised to a platform specific value
> > + having 0b1 set for the corresponding bit for each of the auxiliary
> > + counters present.
> > +
> > The requirements described above for CPU mode, caches, MMUs, architected
> > timers, coherency and system registers apply to all CPUs. All CPUs must
> > enter the kernel in the same exception level.
> > diff --git a/Documentation/arm64/cpu-feature-registers.rst b/Documentation/arm64/cpu-feature-registers.rst
> > index b6e44884e3ad..4770ae54032b 100644
> > --- a/Documentation/arm64/cpu-feature-registers.rst
> > +++ b/Documentation/arm64/cpu-feature-registers.rst
> > @@ -150,6 +150,8 @@ infrastructure:
> > +------------------------------+---------+---------+
> > | DIT | [51-48] | y |
> > +------------------------------+---------+---------+
> > + | AMU | [47-44] | n |
> > + +------------------------------+---------+---------+
>
> As mentioned above, please drop it.
>
>
> Suzuki