Re: [PATCH 2/2] arm64/mm: Enable color zero pages

From: Gavin Shan
Date: Thu Sep 17 2020 - 00:37:15 EST


Hi Robin,

On 9/16/20 8:46 PM, Robin Murphy wrote:
On 2020-09-16 09:28, Will Deacon wrote:
On Wed, Sep 16, 2020 at 01:25:23PM +1000, Gavin Shan wrote:
This enables color zero pages by allocating contigous page frames
for it. The number of pages for this is determined by L1 dCache
(or iCache) size, which is probbed from the hardware.

    * Add cache_total_size() to return L1 dCache (or iCache) size

    * Implement setup_zero_pages(), which is called after the page
      allocator begins to work, to allocate the contigous pages
      needed by color zero page.

    * Reworked ZERO_PAGE() and define __HAVE_COLOR_ZERO_PAGE.

Signed-off-by: Gavin Shan <gshan@xxxxxxxxxx>
---
  arch/arm64/include/asm/cache.h   | 22 ++++++++++++++++++++
  arch/arm64/include/asm/pgtable.h |  9 ++++++--
  arch/arm64/kernel/cacheinfo.c    | 34 +++++++++++++++++++++++++++++++
  arch/arm64/mm/init.c             | 35 ++++++++++++++++++++++++++++++++
  arch/arm64/mm/mmu.c              |  7 -------
  5 files changed, 98 insertions(+), 9 deletions(-)

diff --git a/arch/arm64/include/asm/cache.h b/arch/arm64/include/asm/cache.h
index a4d1b5f771f6..420e9dde2c51 100644
--- a/arch/arm64/include/asm/cache.h
+++ b/arch/arm64/include/asm/cache.h
@@ -39,6 +39,27 @@
  #define CLIDR_LOC(clidr)    (((clidr) >> CLIDR_LOC_SHIFT) & 0x7)
  #define CLIDR_LOUIS(clidr)    (((clidr) >> CLIDR_LOUIS_SHIFT) & 0x7)
+#define CSSELR_TND_SHIFT    4
+#define CSSELR_TND_MASK        (UL(1) << CSSELR_TND_SHIFT)
+#define CSSELR_LEVEL_SHIFT    1
+#define CSSELR_LEVEL_MASK    (UL(7) << CSSELR_LEVEL_SHIFT)
+#define CSSELR_IND_SHIFT    0
+#define CSSERL_IND_MASK        (UL(1) << CSSELR_IND_SHIFT)
+
+#define CCSIDR_64_LS_SHIFT    0
+#define CCSIDR_64_LS_MASK    (UL(7) << CCSIDR_64_LS_SHIFT)
+#define CCSIDR_64_ASSOC_SHIFT    3
+#define CCSIDR_64_ASSOC_MASK    (UL(0x1FFFFF) << CCSIDR_64_ASSOC_SHIFT)
+#define CCSIDR_64_SET_SHIFT    32
+#define CCSIDR_64_SET_MASK    (UL(0xFFFFFF) << CCSIDR_64_SET_SHIFT)
+
+#define CCSIDR_32_LS_SHIFT    0
+#define CCSIDR_32_LS_MASK    (UL(7) << CCSIDR_32_LS_SHIFT)
+#define CCSIDR_32_ASSOC_SHIFT    3
+#define CCSIDR_32_ASSOC_MASK    (UL(0x3FF) << CCSIDR_32_ASSOC_SHIFT)
+#define CCSIDR_32_SET_SHIFT    13
+#define CCSIDR_32_SET_MASK    (UL(0x7FFF) << CCSIDR_32_SET_SHIFT)

I don't think we should be inferring cache structure from these register
values. The Arm ARM helpfully says:

   | You cannot make any inference about the actual sizes of caches based
   | on these parameters.

so we need to take the topology information from elsewhere.

Yes, these represent parameters for the low-level cache maintenance by set/way instructions, and nothing more. There are definitely cases where they do not reflect the underlying cache structure (commit 793acf870ea3 is an obvious first call).


Thanks for your confirming. Yeah, the system registers aren't
reliable since the cache way/set are hard coded to 1/1. As I
suggested in another thread, ACPI (PPTT) table would be correct
place to get such kind of information.

[...]

Cheers,
Gavin