Re: How to disable Linux kernel self-extraction (KERNEL_GZIP, KERNEL_BZIP2, â)?

From: Paul Menzel
Date: Tue Apr 24 2018 - 05:09:16 EST


Dear Adam,


Thank you very much to join the discussion.

On 04/24/18 04:08, Adam Borowski wrote:
On Mon, Apr 23, 2018 at 07:02:30PM +0200, Pavel Machek wrote:
I try to decrease boot time, and my system has an SSD and enough space, so
loading 18 instead of 12 MB doesnât make a difference, but the
self-extraction is noticeable. So, I like to disable it.

How long does GZIP extraction take on your hardware?

Itâs hard to measure â at least I didnât find a way to do so â, but counting
from the last GRUB message to the first message of Linux (with `quiet`
removed from the the command line), it takes roughly *two* seconds.

I took a somewhat different approach: I recorded the output from grub+kernel
to ttyrec over serial line, and rigged ttyrec2ansi to output timestamp
difference from the last checkpoint every time an '\e' or '\n' is seen.
'\e' is important, as there's no other marking for when grub stops the
interactive phase and starts the actual boot.

Nice work. Itâd be great, if you shared these patches, so others and I can reproduce it.

(Also, `scripts/extract-vmlinux` needs to be updated for LZ4.)

Turns out that, reading from SSD, grub is way way slower than it should take
normally. The machine is old (AMD Phenom II X6 1055T), SSD is Crucial
CT240M500SSD1.

What firmware does the device (mainboard) have? Legacy BIOS or UEFI (or even coreboot ;-)). Itâs my understanding, that GRUB does not have a native driver with the legacy BIOS and UEFI, and just uses the BIOS calls or the UEFI equivalent.

I also have the zstd patch applied, which adds another data point.

The two "Loading XXX ..." lines come from grub, those timestamped within []
brackets from the kernel, ââare ttyrec timestamps, â is wrapped lines.


zstd:

Loading Linux 4.17.0-rc2-debug-00025-gd426b0ba363d ...â0.739823â
^MLoading initial ramdisk ...â0.402010â
^M[ 0.000000] Linux version 4.17.0-rc2-debug-00025-gd426b0ba363d â
(kilobyte@umbar) (gcc version 7.3.0 (Debian 7.3.0-16)) #1 SMP Mon Apr 23â
10:25:58 CEST 2018^Mâ0.785922â
[ 0.000000] Command line: BOOT_IMAGE=/sys/boot/vmlinuz-4.17.0-rc2-debug-00025-gd426b0ba363dâ
root=UUID=b7c38da9-ae84-4083-a1f8-6d7b4fc33961 ro rootflags=subvol=sys syscall.x32=yâ
console=tty0 console=ttyS0,115200n8 no_console_suspend^Mâ0.020199â

gzip:

Loading Linux 4.17.0-rc2-debug-gz-00025-gd426b0ba363d ...â0.724988â
^MLoading initial ramdisk ...â0.357951â
^M[ 0.000000] Linux version 4.17.0-rc2-debug-gz-00025-gd426b0ba363d â
(kilobyte@umbar) (gcc version 7.3.0 (Debian 7.3.0-16)) #1 SMP Mon Apr 23 â
23:15:07 CEST 2018^Mâ0.777977â
[ 0.000000] Command line: BOOT_IMAGE=/sys/boot/vmlinuz-4.17.0-rc2-debug-gz-00025-gd426b0ba363dâ
root=UUID=b7c38da9-ae84-4083-a1f8-6d7b4fc33961 ro rootflags=subvol=sys syscall.x32=yâ
console=tty0 console=ttyS0,115200n8 no_console_suspend^Mâ0.020117â

lz4:

Loading Linux 4.17.0-rc2-debug-none-00025-gd426b0ba363d ...â0.799969â
^MLoading initial ramdisk ...â0.424959â
^M[ 0.000000] Linux version 4.17.0-rc2-debug-lz4-00025-gd426b0ba363d â
(kilobyte@umbar) (gcc version 7.3.0 (Debian 7.3.0-16)) #1 SMP Tue Apr 24 â
00:34:59 CEST 2018^Mâ0.732925â
[ 0.000000] Command line: BOOT_IMAGE=/sys/boot/vmlinuz-4.17.0-rc2-debug-lz4-00025-gd426b0ba363dâ
root=UUID=b7c38da9-ae84-4083-a1f8-6d7b4fc33961 ro rootflags=subvol=sys syscall.x32=yâ
console=tty0 console=ttyS0,115200n8 no_console_suspend^Mâ0.021019â

zstd again:

Loading Linux 4.17.0-rc2-debug-00025-gd426b0ba363d ...â0.728852â
^MLoading initial ramdisk ...â0.399968â
^M[ 0.000000] Linux version 4.17.0-rc2-debug-00025-gd426b0ba363d â
(kilobyte@umbar) (gcc version 7.3.0 (Debian 7.3.0-16)) #1 SMP Mon Apr 23 â
10:25:58 CEST 2018^Mâ0.786964â
[ 0.000000] Command line: BOOT_IMAGE=/sys/boot/vmlinuz-4.17.0-rc2-debug-00025-gd426b0ba363dâ
root=UUID=b7c38da9-ae84-4083-a1f8-6d7b4fc33961 ro rootflags=subvol=sys syscall.x32=yâ
console=tty0 console=ttyS0,115200n8 no_console_suspend^Mâ0.020071â

lz4 rigged for no compression:

Loading Linux 4.17.0-rc2-debug-none-00025-gd426b0ba363d-dirty ...â0.479834â
^MLoading initial ramdisk ...â2.246985â

Just to be sure. The 2.2 seconds are from loading the 27 MB Linux kernel image, right?

^M[ 0.000000] Linux version 4.17.0-rc2-debug-none-00025-gd426b0ba363d-dirty â
(kilobyte@umbar) (gcc version 7.3.0 (Debian 7.3.0-16)) #5 SMP Tue Apr 24 â
02:57:18 CEST 2018^Mâ0.711949â
[ 0.000000] Command line: BOOT_IMAGE=/sys/boot/vmlinuz-4.17.0-rc2-debug-none-00025-gd426b0ba363d-dirtyâ
root=UUID=b7c38da9-ae84-4083-a1f8-6d7b4fc33961 ro rootflags=subvol=sys syscall.x32=yâ
console=tty0 console=ttyS0,115200n8 no_console_suspend^Mâ0.021902â

Sizes of relevant files:

14826134 initrd.img-4.17.0-rc2-debug-00025-gd426b0ba363d (zstd)
14826352 initrd.img-4.17.0-rc2-debug-gz-00025-gd426b0ba363d
14826909 initrd.img-4.17.0-rc2-debug-lz4-00025-gd426b0ba363d
14826761 initrd.img-4.17.0-rc2-debug-none-00025-gd426b0ba363d-dirty
6567408 vmlinuz-4.17.0-rc2-debug-00025-gd426b0ba363d (zstd)
7230960 vmlinuz-4.17.0-rc2-debug-gz-00025-gd426b0ba363d
8775152 vmlinuz-4.17.0-rc2-debug-lz4-00025-gd426b0ba363d
27821552 vmlinuz-4.17.0-rc2-debug-none-00025-gd426b0ba363d-dirty
(I did not alter initrd compression, which is zstd in all cases).

Does the size of the uncompressed image match the size in `arch/x86/boot/compressed/vmlinux.bin`?

So yes, looks like uncompressed kernel image may be good idea.

Seems like the time to actually read this far bigger file from the disk
using grub's inefficient way, takes longer than the gains from faster
decompression. You can eliminate the decompression step altogether by
avoiding copying, but it still looks like it's not a win.

I've seen u-boot taking ~60 seconds to read from a SD card, too.

I could test on my coreboot systems, if GRUB is faster with the native AHCI driver.

Another surprise is that zstd is a notch _slower_ than gzip (in userspace
it's drastically faster for the same compression ratio), but reduced disk
space is still nice. It's worth investigating why it's not as fast as it
should be.

Maybe that should be done in a separate thread. Iâll split it out.

Actually... Compressors usually have a mode when they store the data
uncompressed. So you should be able to prepare .gz image which is not
really compressed inside, and thus really fast to uncompress.

I can't seem to find any. IIRC xz format can store uncompressed blocks but
the tool doesn't appear to expose this as an option.

Or maybe even better -- there should be some compression algorithms
that are fast enough to uncompress that there should be no
slowdown. Maybe use one of those?

Perhaps my method is totally wrong, but differences in decompression speed
are surprisingly small, dwarfed by whatever else the kernel does between
messages.

I did not test xz, nor ran tests more than once, but it's 4am so these are
things to do tomorrow.

Itâd be interesting to find out, what is happening in the first 700 ms, before the first Linux message is transmitted over serial line. It could be, that the serial line affects the time stamps for example, or takes so long to set up the serial console.


Kind regards,

Paul

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature