Re: [patch 5/5] clocksource: Rewrite watchdog code completely
From: Borislav Petkov
Date: Mon Mar 09 2026 - 11:56:16 EST
On Sat, Jan 24, 2026 at 12:18:01AM +0100, Thomas Gleixner wrote:
Patch 2 and 4 should say "Don't" in the subject title. :)
> The clocksource watchdog code has over time reached the state of an
> unpenetrable maze of duct tape and staples. The original design, which was
"impenetrable"
> made in the context of systems far smaller than today, is based on the
> assumption that the to be monitored clocksource (TSC) can be trivially
> compared against a known to be stable clocksource (HPET/ACPI-PM timer).
>
> Over the years it turned out that this approach has major flaws:
>
> - Long delays between watchdog invocations can result in wrap arounds
> of the reference clocksource
>
> - Scalability of the reference clocksource readout can degrade on large
> multi-socket systems due to interconnect congestion
>
> This was addressed with various heuristics which degraded the accurracy of
"accuracy"
But those are just nitpicks.
The overall thing sounds good to me, I haven't done any indepth review but
from a cursory look, it looks nice.
Testing looks good too on one machine so
Tested-by: Borislav Petkov (AMD) <bp@xxxxxxxxx>
I say, let's queue it now so that it can get wider testing.
Thx.
--
Regards/Gruss,
Boris.
https://people.kernel.org/tglx/notes-about-netiquette