Re: [v5.4 stable] arm: stm32: Regression observed on "no-map" reserved memory region

From: Florian Fainelli
Date: Tue Apr 20 2021 - 12:34:05 EST




On 4/20/2021 9:10 AM, Ard Biesheuvel wrote:
> On Tue, 20 Apr 2021 at 17:54, Rob Herring <robh+dt@xxxxxxxxxx> wrote:
>>
>> On Tue, Apr 20, 2021 at 10:12 AM Alexandre TORGUE
>> <alexandre.torgue@xxxxxxxxxxx> wrote:
>>>
>>>
>>>
>>> On 4/20/21 4:45 PM, Rob Herring wrote:
>>>> On Tue, Apr 20, 2021 at 9:03 AM Alexandre TORGUE
>>>> <alexandre.torgue@xxxxxxxxxxx> wrote:
>>>>>
>>>>> Hi,
>>>>
>>>> Greg or Sasha won't know what to do with this. Not sure who follows
>>>> the stable list either. Quentin sent the patch, but is not the author.
>>>> Given the patch in question is about consistency between EFI memory
>>>> map boot and DT memory map boot, copying EFI knowledgeable folks would
>>>> help (Ard B for starters).
>>>
>>> Ok thanks for the tips. I add Ard in the loop.
>>
>> Sigh. If it was only Ard I was suggesting I would have done that
>> myself. Now everyone on the patch in question and relevant lists are
>> Cc'ed.
>>
>
> Thanks for the cc.
>
>>>
>>> Ard, let me know if other people have to be directly added or if I have
>>> to resend to another mailing list.
>>>
>>> thanks
>>> alex
>>>
>>>>
>>>>>
>>>>> Since v5.4.102 I observe a regression on stm32mp1 platform: "no-map"
>>>>> reserved-memory regions are no more "reserved" and make part of the
>>>>> kernel System RAM. This causes allocation failure for devices which try
>>>>> to take a reserved-memory region.
>>>>>
>>>>> It has been introduced by the following path:
>>>>>
>>>>> "fdt: Properly handle "no-map" field in the memory region
>>>>> [ Upstream commit 86588296acbfb1591e92ba60221e95677ecadb43 ]"
>>>>> which replace memblock_remove by memblock_mark_nomap in no-map case.
>>>>>
>
> Why was this backported? It doesn't look like a bugfix to me.
>
>>>>> Reverting this patch it's fine.
>>>>>
>>>>> I add part of my DT (something is maybe wrong inside):
>>>>>
>>>>> memory@c0000000 {
>>>>> reg = <0xc0000000 0x20000000>;
>>>>> };
>>>>>
>>>>> reserved-memory {
>>>>> #address-cells = <1>;
>>>>> #size-cells = <1>;
>>>>> ranges;
>>>>>
>>>>> gpu_reserved: gpu@d4000000 {
>>>>> reg = <0xd4000000 0x4000000>;
>>>>> no-map;
>>>>> };
>>>>> };
>>>>>
>>>>> Sorry if this issue has already been raised and discussed.
>>>>>
>
> Could you explain why it fails? The region is clearly part of system
> memory, and tagged as no-map, so the patch in itself is not
> unreasonable. However, we obviously have code that relies on how the
> region is represented in /proc/iomem, so it would be helpful to get
> some insight into why this is the case.

I do wonder as well, we have a 32MB "no-map" reserved memory region on
our platforms located at 0xfe000000. Without the offending commit,
/proc/iomem looks like this:

40000000-fdffefff : System RAM
40008000-40ffffff : Kernel code
41e00000-41ef1d77 : Kernel data
100000000-13fffffff : System RAM

and with the patch applied, we have this:

40000000-fdffefff : System RAM
40008000-40ffffff : Kernel code
41e00000-41ef3db7 : Kernel data
fdfff000-ffffffff : System RAM
100000000-13fffffff : System RAM

so we can now see that the region 0xfe000000 - 0xfffffff is also cobbled
up with the preceding region which is a mailbox between Linux and the
secure monitor at 0xfdfff000 and of size 4KB. It seems like there is

The memblock=debug outputs is also different:

[ 0.000000] MEMBLOCK configuration:
[ 0.000000] memory size = 0xfdfff000 reserved size = 0x7ce4d20d
[ 0.000000] memory.cnt = 0x2
[ 0.000000] memory[0x0] [0x00000040000000-0x000000fdffefff],
0xbdfff000 bytes flags: 0x0
[ 0.000000] memory[0x1] [0x00000100000000-0x0000013fffffff],
0x40000000 bytes flags: 0x0
[ 0.000000] reserved.cnt = 0x6
[ 0.000000] reserved[0x0] [0x00000040003000-0x0000004000e494],
0xb495 bytes flags: 0x0
[ 0.000000] reserved[0x1] [0x00000040200000-0x00000041ef1d77],
0x1cf1d78 bytes flags: 0x0
[ 0.000000] reserved[0x2] [0x00000045000000-0x000000450fffff],
0x100000 bytes flags: 0x0
[ 0.000000] reserved[0x3] [0x00000047000000-0x0000004704ffff],
0x50000 bytes flags: 0x0
[ 0.000000] reserved[0x4] [0x000000c2c00000-0x000000fdbfffff],
0x3b000000 bytes flags: 0x0
[ 0.000000] reserved[0x5] [0x00000100000000-0x0000013fffffff],
0x40000000 bytes flags: 0x0

[ 0.000000] MEMBLOCK configuration:
[ 0.000000] memory size = 0x100000000 reserved size = 0x7ca4f24d
[ 0.000000] memory.cnt = 0x3
[ 0.000000] memory[0x0] [0x00000040000000-0x000000fdffefff],
0xbdfff000 bytes flags: 0x0
[ 0.000000] memory[0x1] [0x000000fdfff000-0x000000ffffffff],
0x2001000 bytes flags: 0x4
[ 0.000000] memory[0x2] [0x00000100000000-0x0000013fffffff],
0x40000000 bytes flags: 0x0
[ 0.000000] reserved.cnt = 0x6
[ 0.000000] reserved[0x0] [0x00000040003000-0x0000004000e494],
0xb495 bytes flags: 0x0
[ 0.000000] reserved[0x1] [0x00000040200000-0x00000041ef3db7],
0x1cf3db8 bytes flags: 0x0
[ 0.000000] reserved[0x2] [0x00000045000000-0x000000450fffff],
0x100000 bytes flags: 0x0
[ 0.000000] reserved[0x3] [0x00000047000000-0x0000004704ffff],
0x50000 bytes flags: 0x0
[ 0.000000] reserved[0x4] [0x000000c3000000-0x000000fdbfffff],
0x3ac00000 bytes flags: 0x0
[ 0.000000] reserved[0x5] [0x00000100000000-0x0000013fffffff],
0x40000000 bytes flags: 0x0

in the second case we can clearly see that the 32MB no-map region is now
considered as usable RAM.

Hope this helps.

>
> In any case, the mere fact that this causes a regression should be
> sufficient justification to revert/withdraw it from v5.4, as I don't
> see a reason why it was merged there in the first place. (It has no
> fixes tag or cc:stable)

Agreed, however that means we still need to find out whether a more
recent kernel is also broken, I should be able to tell you that a little
later.
--
Florian