Re: [RFC][PATCH 1/2] sched: Extended scheduler time slice
From: Linus Torvalds
Date: Sat Feb 01 2025 - 18:19:01 EST
On Sat, 1 Feb 2025 at 15:08, Steven Rostedt <rostedt@xxxxxxxxxxx> wrote:
>
> No it doesn't need to be yield. That just seemed like the obvious
> system call to use. But any system call would force a schedule. We
> could just optimize yield() though.
No, "optimizing" yield is not on the table.
Why? Because it has *semantics* that people depend on because it has
historical meaning. Things like "move the process to the end of the
scheduling queue of that priority".
That may sound like a good thing, but it's ABSOLUTELY NOT what you
should actually do unless you know *exactly* what the system behavior
is.
For example, maybe the reason the kernel set NEED_RESCHED_LAZY is that
a higher-priority process is ready to run. You haven't used up your
time slice yet, but something more important needs the CPU.
If you call "sched_yield()", sure, you'll run that higher priority
thing. So far so good.
But you *also* are literally telling the scheduler to put you at the
back of the queue, despite the fact that maybe you are still in line
to be run for *your* priority level. So now your performance will
absolutely suck, because you just told the scheduler that you are not
important, and other processes in your priority level should get
priority.
So no. "yield()" does not mean "just reschedule". It rally means
"yield my position in the scheduling queue".
You are literally better off using absolutely *ANY* other system call.
The fact that you are confused about this kind of very basic issue
does imply that this patch should absolutely be handled by somebody
who knows the scheduler better.
Linus