Re: [PATCH v3] mm/hotplug: Only respect mem= parameter during boot stage
From: Michal Hocko
Date: Fri Feb 14 2020 - 05:53:46 EST
On Tue 04-02-20 13:06:43, Baoquan He wrote:
> In commit 357b4da50a62 ("x86: respect memory size limiting via mem=
> parameter") a global varialbe max_mem_size is added to store
> the value parsed from 'mem= ', then checked when memory region is
> added. This truly stops those DIMMs from being added into system memory
> during boot-time.
>
> However, it also limits the later memory hotplug functionality. Any
> DIMM can't be hotplugged any more if its region is beyond the
> max_mem_size. We will get errors like:
>
> [ 216.387164] acpi PNP0C80:02: add_memory failed
> [ 216.389301] acpi PNP0C80:02: acpi_memory_enable_device() error
> [ 216.392187] acpi PNP0C80:02: Enumeration failure
>
> This will cause issue in a known use case where 'mem=' is added to
> the hypervisor. The memory that lies after 'mem=' boundary will be
> assigned to KVM guests. After commit 357b4da50a62 merged, memory
> can't be extended dynamically if system memory on hypervisor is not
> sufficient.
>
> So fix it by also checking if it's during boot-time restricting to add
> memory. Otherwise, skip the restriction.
>
> And also add this use case to document of 'mem=' kernel parameter.
I have to say I am not entirely happy about this change but the breakage
seems to be real so we have to live with that. If there are usecases that
need to restrict the physical memory range for real we would have to add
a new command line parameter.
> Fixes: 357b4da50a62 ("x86: respect memory size limiting via mem= parameter")
> Signed-off-by: Baoquan He <bhe@xxxxxxxxxx>
Acked-by: Michal Hocko <mhocko@xxxxxxxx>
> ---
> v2->v3:
> In discussion of v1 and v2, People have concern about the use case
> related to the code change. So add the use case into patch log and
> document of 'mem=' in kernel-parameters.txt.
>
> Documentation/admin-guide/kernel-parameters.txt | 13 +++++++++++--
> mm/memory_hotplug.c | 8 +++++++-
> 2 files changed, 18 insertions(+), 3 deletions(-)
>
> diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt
> index ddc5ccdd4cd1..b809767e5f74 100644
> --- a/Documentation/admin-guide/kernel-parameters.txt
> +++ b/Documentation/admin-guide/kernel-parameters.txt
> @@ -2533,13 +2533,22 @@
> For details see: Documentation/admin-guide/hw-vuln/mds.rst
>
> mem=nn[KMG] [KNL,BOOT] Force usage of a specific amount of memory
> - Amount of memory to be used when the kernel is not able
> - to see the whole system memory or for test.
> + Amount of memory to be used in cases as follows:
> +
> + 1 for test;
> + 2 when the kernel is not able to see the whole system memory;
> + 3 memory that lies after 'mem=' boundary is excluded from
> + the hypervisor, then assigned to KVM guests.
> +
> [X86] Work as limiting max address. Use together
> with memmap= to avoid physical address space collisions.
> Without memmap= PCI devices could be placed at addresses
> belonging to unused RAM.
>
> + Note that this only takes effects during boot time since
> + in above case 3, memory may need be hot added after boot
> + if system memory of hypervisor is not sufficient.
> +
> mem=nopentium [BUGS=X86-32] Disable usage of 4MB pages for kernel
> memory.
>
> diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c
> index 36d80915ddc2..e6c75ceacf9a 100644
> --- a/mm/memory_hotplug.c
> +++ b/mm/memory_hotplug.c
> @@ -105,7 +105,13 @@ static struct resource *register_memory_resource(u64 start, u64 size)
> unsigned long flags = IORESOURCE_SYSTEM_RAM | IORESOURCE_BUSY;
> char *resource_name = "System RAM";
>
> - if (start + size > max_mem_size)
> + /*
> + * Make sure value parsed from 'mem=' only restricts memory adding
> + * while booting, so that memory hotplug won't be impacted. Please
> + * refer to document of 'mem=' in kernel-parameters.txt for more
> + * details.
> + */
> + if (start + size > max_mem_size && system_state < SYSTEM_RUNNING)
> return ERR_PTR(-E2BIG);
>
> /*
> --
> 2.17.2
--
Michal Hocko
SUSE Labs