Re: [PATCH 3/5] arm: decompressor: define a new zImage tag
From: Lukasz Stelmach
Date: Mon Jun 01 2020 - 16:28:03 EST
It was <2020-06-01 pon 19:25>, when Russell King - ARM Linux admin wrote:
> On Mon, Jun 01, 2020 at 06:19:52PM +0200, Lukasz Stelmach wrote:
>> It was <2020-06-01 pon 15:55>, when Russell King - ARM Linux admin wrote:
>> > On Mon, Jun 01, 2020 at 04:27:52PM +0200, Åukasz Stelmach wrote:
>> >> Add DCSZ tag which holds dynamic memory (stack, bss, malloc pool)
>> >> requirements of the decompressor code.
>> >
>> > Why do we need to know the stack and BSS size, when the userspace
>> > kexec tool doesn't need to know this to perform the same function.
>>
>>
>> kexec-tools load zImage as low in DRAM as possible and rely on two
>> assumptions:
>>
>> + the zImage will copy itself to make enough room for the kernel,
>> + sizeof(zImage+mem) < sizeof(kernel+mem), which is true because
>> of compression.
>>
>> DRAM start
>> + 0x8000
>>
>> zImage |-----------|-----|-------|
>> text+data bss stack
>>
>> text+data bss
>> kernel |---------------------|-------------------|
>>
>>
>> initrd |-initrd-|-dtb-|
>
> This is actually incorrect, because the decompressor will self-
> relocate itself to avoid the area that it is going to decompress
> into.
I described the state right after kexec(8) invocation.
> So, while the decompressor runs, in the above situation it
> ends up as:
>
>
> ram |------------------------------------------------------...
> text+data bss
> kernel |---------------------|-------------------|
> zImage |-----------|-----|-------|
> text+data bss stack+malloc
And I am trying to achieve this state before the decompressor starts so
it won't need to copy itself during boot. The only exception is (as we
discussed under a different patch) when edata_size >= 128-eps MiB because
loading zImage above 128 MiB prevents it from properly detecting
physical memory. In such unlikely case my code behaves like kexec-tools
and loads zImage low. That is why I suggested that passing detailed
information about memory layout to the decompressor would help.
> Where "text+data" is actually smaller than the image size that
> was loaded - the part of the image that performs the relocation
> is discarded (the first chunk of code up to "restart" - 200
> bytes.) The BSS is typically smaller than 200 bytes, so we've
> been able to get away without knowing the actual BSS size so
> far.
>
>
> ram |--|-----------------------------------------|---------...
> |<>| TEXT_OFFSET
> kernel |---------------------|-------------------|
> |<----edata_size----->|<-----bss_size---->|
> |<---------------kernel_size------------->|
> zImage |-----------|-----|-------|
> |<-------len------->| (initial)
> |<----------len------------>| (final)
>
> The "final" len value is what the decompressor prints as the "zImage
> requires" debugging value.
>
> Hence, the size that the decompressed kernel requires is kernel_size.
>
> The size that the decompressor requires is edata_size + len(final).
>
> Now, if you intend to load the kernel to ram + TEXT_OFFSET + edata_size
> then it isn't going to lose the first 200 bytes of code, so as you
> correctly point out, we need to know the BSS size.
Formal note: can we keep using terms zImage and kernel as separate,
where zImage is what is loaded with kexec and kernel is the decompressed
code loaded at TEXT_OFFSET. I believe, it will help us avoid mistakes.
>> >> +struct arm_zimage_tag_dc {
>> >> + struct tag_header hdr;
>> >> + union {
>> >> +#define ZIMAGE_TAG_DECOMP_SIZE ARM_ZIMAGE_MAGIC4
>> >> + struct zimage_decomp_size {
>> >> + __le32 bss_size;
>> >> + __le32 stack_size;
>> >> + __le32 malloc_size;
>> >> + } decomp_size;
>
> You certainly don't need to know all this. All you need to know is
> how much space the decompressor requires after the end of the image,
> encompassing the BSS size, stack size and malloc size, which is one
> value.
I agree. However, since we are not fighting here for every single byte,
I'd rather add them as separate values and make the tag, even if only
slightly, more future-proof.
Kind regards,
--
Åukasz Stelmach
Samsung R&D Institute Poland
Samsung Electronics
Attachment:
signature.asc
Description: PGP signature