Re: [GIT PULL] printk for 5.20
From: Thomas Gleixner
Date: Mon Aug 08 2022 - 21:18:28 EST
Linus,
On Tue, Aug 02 2022 at 20:19, Linus Torvalds wrote:
> On Mon, Aug 1, 2022 at 8:08 AM Petr Mladek <pmladek@xxxxxxxx> wrote:
>>
>> - Completely disable printing on consoles with CONFIG_RT.
> And guys, I want to make it really clear how disappointed I am with
> the printk tree lately. There seems to be some kind of hardline
> religious fervor having taken over to make these kinds of "this is how
> it has to be done, screw any sanity or common sense".
...
> Put another way: not only am I not pulling this, I'm concerned that I
> will not be pulling printk patches in the future either because of
> where these pull requests seem to be trending.
I really have to stand up for the printk maintainers here.
Especially Petr has done an extraordinary job in the past few years.
There have been hickups, but with such a semantically ill defined
mechanism like printk() that's not a surprise at all.
Let me add some historical background here.
In Sept. 2019, i.e. almost 3 years ago, we all - including you - sat
together at Plumbers in Lisbon and agreed that printk() in it's back
then form is a nightmare not only for PREEMPT_RT. We also agreed back
then that seperating out the console writes into individual printk
threads makes sense because it also gets rid of the fully serializing
nature of printk() which exists for historical but not for technical
reasons. Also the magic oops_in_progress heuristics have been declared
to be just duct tape, as the still existing (after 25+ years of Linux)
situation with graphics demonstrates on a daily basis. We all agreed
that a dedicated atomic_write() which also allows to prioritize your
favourite pet pieve of making laptops more debugable by utilizing
persistant storage simpler and what's more important more reliable.
It took whopping 2.5 years to get to the point to reach this seperation
as it required to rewrite the buffers and other infrastructure. This was
very responsible and cautionosly guided by Petr and the other members of
the printk() maintainer team. Along with that went quite some
improvements like realistic timestamping and other details which matter
for dmesg power users.
The printk threads were added in the 5.19 merge window and unfortunately
reverted between 5.19-rc3 and 5.19-rc4 for the very wrong reason:
Some embedded boards failed to boot.
The root cause is missing locking in the init functions of the
related UART drivers.
This is not a problem of threaded printk(), Those are existing bugs
in these drivers which can be triggered on a 5.18 kernel. They are
hard to trigger and nobody cared so far because they were neither
bisectable nor reliably reproducible. The threaded printk() change
made them reproducible and the bisect pointed at the messenger and
not at the root cause.
Nevertheless it was decided to throw away valuable work for no real good
reason. That revert costs me constantly ~2 seconds of boot time on one my
main development machines. Not much, but ~10% and I know that quite some
folks in the fastboot camp give a leg for 10%.
The right decision would have been to offer this by boot or config
parameter and not to throw the baby out with the bathwater.
I know it's my fault because I was AFK that week...
Coming back to the commit in question which made you (rightfully) upset.
I agree that it should never have happened, but OTOH it's a very clear
message of developer frustration to you:
On one hand you can't get tired of "praising" the RT people about their
responsible approach to solve fundamental shortcomigs in the kernel, but
at the same time you're pulling whatever new fancy "technology" which
comes around the corner and then makes my and your inbox full of
security issues. That's obviously not a problem, right?
Neither is it a problem that all of these "feature" developers can
rightfully ignore PREEMPT_RT and offload all the resulting problems to
the RT people, right?
I might be wrong, but you seem to have a fundamental misunderstanding of
the scope and importance of PREEEMPT_RT:
> Oh, I agree that it probably is a pretty small community.
Pull the rug under that "pretty small community" and the industrial
world is comming to a grinding halt. That grinding halt will affect your
power grid, your water supply and quite some other essential things
which are listed under Civil Infrastructure. Plus the new fangled edge
computing, modern cars ....
There is a reason that at least some major industry players have
invested into the RT project for the last couple of years. All major
distro players have commercially supported RT offerings for years and
they are surely not doing this just for fun.
There are freeriders as always. Space*, car*, distro* ,,,,
> And I also think that people who are really into RT are basically
> always going to have extra patches anyway - I think the bulk of the
> core stuff has made it upstream, but not *all* has made it.
At the point of this writing the only outstanding issue is printk, which
prevents PREEMPT_RT from being enabled in your tree on x86[64] and
ARM[64].
This still does not justify the commit in question, but clearly points
out that there is an attitude problem.
Which attitude problem?
The attitude problem that PREEMPT_RT is just a niche, but the larger
kernel community is happy to consume the benefits of that allegded
niche without giving anything back.
Think about that.
Thanks,
Thomas