Re: [PATCH v2 4/6] Documentation: arm64: document support for the AMU extension
From: Suzuki K Poulose
Date: Thu Jan 30 2020 - 13:27:03 EST
Hi Ionela,
On Thu, Jan 30, 2020 at 04:45:42PM +0000, Ionela Voinescu wrote:
> Hi Suzuki,
>
> On Thursday 30 Jan 2020 at 15:04:27 (+0000), Suzuki Kuruppassery Poulose wrote:
> > Hi Ionela,
> >
> > On 18/12/2019 18:26, Ionela Voinescu wrote:
> > > The activity monitors extension is an optional extension introduced
> > > by the ARMv8.4 CPU architecture.
> > >
> > > Add initial documentation for the AMUv1 extension:
> > > - arm64/amu.txt: AMUv1 documentation
> > > - arm64/booting.txt: system registers initialisation
> > > - arm64/cpu-feature-registers.txt: visibility to userspace
> >
> > We have stopped adding "invisible" fields to the list. So, you
> > can drop the changes to cpu-feature-registers.txt.
> >
> > >
> > > Signed-off-by: Ionela Voinescu <ionela.voinescu@xxxxxxx>
> > > Cc: Catalin Marinas <catalin.marinas@xxxxxxx>
> > > Cc: Will Deacon <will@xxxxxxxxxx>
> > > Cc: Jonathan Corbet <corbet@xxxxxxx>
> > > ---
> > > Documentation/arm64/amu.rst | 107 ++++++++++++++++++
> > > Documentation/arm64/booting.rst | 14 +++
> > > Documentation/arm64/cpu-feature-registers.rst | 2 +
> > > Documentation/arm64/index.rst | 1 +
> > > 4 files changed, 124 insertions(+)
> > > create mode 100644 Documentation/arm64/amu.rst
> > >
> > > diff --git a/Documentation/arm64/amu.rst b/Documentation/arm64/amu.rst
> > > new file mode 100644
> > > index 000000000000..62a6635522e1
> > > --- /dev/null
> > > +++ b/Documentation/arm64/amu.rst
> > > @@ -0,0 +1,107 @@
> > > +-------------
> > > +
> > > +The kernel can safely run a mix of CPUs with and without support for the
> > > +activity monitors extension.
> >
> >
> > Therefore, when CONFIG_ARM64_AMU_EXTN is
> > > +selected we unconditionally enable the capability to allow any late CPU
> > > +(secondary or hotplugged) to detect and use the feature.
> > > +
> > > +When the feature is detected on a CPU, a per-CPU variable (amu_feat) is
> > > +set, but this does not guarantee the correct functionality of the
> > > +counters, only the presence of the extension.
> >
> > nit: I would rather omit the implementation details (esp variable names)
> > in the documentation. It may become a pain to keep this in sync with the
> > code changes. You could simply mention, "we keep track of the availability
> > of the feature" per CPU. If someone wants to figure out
> > how, they can always read the code.
> >
> > > +
> > > +Firmware (code running at higher exception levels, e.g. arm-tf) support is
> > > +needed to:
> > > + - Enable access for lower exception levels (EL2 and EL1) to the AMU
> > > + registers.
> > > + - Enable the counters. If not enabled these will read as 0.
> > > + - Save/restore the counters before/after the CPU is being put/brought up
> > > + from the 'off' power state.
> > > +
> > > +When using kernels that have this configuration enabled but boot with
> > > +broken firmware the user may experience panics or lockups when accessing
> > > +the counter registers. Even if these symptoms are not observed, the
> > > +values returned by the register reads might not correctly reflect reality.
> > > +Most commonly, the counters will read as 0, indicating that they are not
> > > +enabled. If proper support is not provided in firmware it's best to disable
> > > +CONFIG_ARM64_AMU_EXTN.
> >
> > For the sake of one kernel runs everywhere, do we need some other
> > mechanism to disable the AMU. e.g kernel parameter to disable amu
> > at runtime ?
> >
>
> The reason I've not added this is twofold:
> - Even if we add this, it should be in order to disable the use of the
> counters for a certain purpose, in this case frequency invariance.
> On its own AMU provides the counters but it does not mandate their
> use.
> - I could add something to disable the use of the core and cycle
> counters for frequency invariance at runtime, but I doubt that
> anyone would use it. Logically it makes sense to use the counters
> order to have a more accurate view of the performance that the CPUs
> are actually providing. Therefore, until anyone asks for this, I
> thought it's better to keep it simple and not add extra switches,
> until there is a use for them.
>
> Does it make sense?
The comment is about addressing someone who must run an "AMU" enabled
kernel ("one kernel") on a system with potentially "broken firmware",
where there is no option to use the system as you mention above,
the firmware could panic. How common is the "broken firmware" ?
Right now there is no way to ensure "firmware" is sane and if
someone detects that firmware is broken, there is no way to
disable the AMU if they are running a standard distro kernel.
A kernel parameter could prevent the AMU capability from
being detected on a broken system and thus make it usable
(without the AMU of course). Now, if the "broken firmware"
is extremely rare, we could simply ignore this case and
ignore the suggestion.
Suzuki
>
> P.S. I'll make all the other changes you've suggested in v3.
>
> Thank you,
> Ionela.
>
>
>
> > > diff --git a/Documentation/arm64/booting.rst b/Documentation/arm64/booting.rst
> > > index 5d78a6f5b0ae..a3f1a47b6f1c 100644
> > > --- a/Documentation/arm64/booting.rst
> > > +++ b/Documentation/arm64/booting.rst
> > > @@ -248,6 +248,20 @@ Before jumping into the kernel, the following conditions must be met:
> > > - HCR_EL2.APK (bit 40) must be initialised to 0b1
> > > - HCR_EL2.API (bit 41) must be initialised to 0b1
> > > + For CPUs with Activity Monitors Unit v1 (AMUv1) extension present:
> > > + - If EL3 is present:
> > > + CPTR_EL3.TAM (bit 30) must be initialised to 0b0
> > > + CPTR_EL2.TAM (bit 30) must be initialised to 0b0
> > > + AMCNTENSET0_EL0 must be initialised to 0b1111
> > > + AMCNTENSET1_EL0 must be initialised to a platform specific value
> > > + having 0b1 set for the corresponding bit for each of the auxiliary
> > > + counters present.
> > > + - If the kernel is entered at EL1:
> > > + AMCNTENSET0_EL0 must be initialised to 0b1111
> > > + AMCNTENSET1_EL0 must be initialised to a platform specific value
> > > + having 0b1 set for the corresponding bit for each of the auxiliary
> > > + counters present.
> > > +
> > > The requirements described above for CPU mode, caches, MMUs, architected
> > > timers, coherency and system registers apply to all CPUs. All CPUs must
> > > enter the kernel in the same exception level.
> > > diff --git a/Documentation/arm64/cpu-feature-registers.rst b/Documentation/arm64/cpu-feature-registers.rst
> > > index b6e44884e3ad..4770ae54032b 100644
> > > --- a/Documentation/arm64/cpu-feature-registers.rst
> > > +++ b/Documentation/arm64/cpu-feature-registers.rst
> > > @@ -150,6 +150,8 @@ infrastructure:
> > > +------------------------------+---------+---------+
> > > | DIT | [51-48] | y |
> > > +------------------------------+---------+---------+
> > > + | AMU | [47-44] | n |
> > > + +------------------------------+---------+---------+
> >
> > As mentioned above, please drop it.
> >
> >
> > Suzuki