Re: [PATCH] kernel/hung_task.c: disable on suspend

From: Vitaly Kuznetsov
Date: Fri Sep 21 2018 - 11:18:32 EST


"Rafael J. Wysocki" <rafael@xxxxxxxxxx> writes:

> On Fri, Sep 21, 2018 at 3:11 PM Vitaly Kuznetsov <vkuznets@xxxxxxxxxx> wrote:
>>
>> Oleg Nesterov <oleg@xxxxxxxxxx> writes:
>>
>> > On 09/17, Rafael J. Wysocki wrote:
>> >>
>> >> On Fri, Sep 14, 2018 at 6:21 PM Oleg Nesterov <oleg@xxxxxxxxxx> wrote:
>> >> >
>> >> > > > Since you are adding the notifier anyway, what about designing it to make
>> >> > > > the thread wait on _PREPARE until the notifier kicks it again on exit
>> >> > > > fron suspend/hibernation?
>> >> >
>> >> > Well. I agree that freezable kthreads are not nice, but it seems you are
>> >> > going to add another questionable interface ;)
>> >>
>> >> Why would it be questionable?
>> >>
>> >> The watchdog needs to be disarmed somehow before tasks are frozen and
>> >> re-armed after they have been thawed or it may report false-positives
>> >> on the way out. PM notifiers can be used for that.
>> >
>> > Or watchdog() can simply use set_freezable/freezing interface we already
>> > have, without additional complications.
>> >
>> > Yes, this is not "before tasks are frozen", but probably should work?
>> >
>> > OK, I won't argue.
>>
>> I was hoping you and Rafael will come to an agreement but the discussion
>> just died ... so where do we stand on this? I see the following options:
>>
>> 1) The v1 patch is good, no freezing/disabling/parking required.
>
> This would work IMO, but it also is somewhat wasteful to run this
> thread when "suspended", because it doesn't do anything then.
>
> Maybe you could simply force "timeout" to be zero for the "suspension"
> time? That should effectively make the thread sleep then, right?
>

Right, we'll have to zero both interval and timeout (if we want to keep
the current logic picking the minimum of two). The following patch seems
to be working for me, I'll send it out as v2 if there're no objections:

diff --git a/kernel/hung_task.c b/kernel/hung_task.c
index b9132d1269ef..ac6e8c9306bd 100644
--- a/kernel/hung_task.c
+++ b/kernel/hung_task.c
@@ -15,6 +15,7 @@
#include <linux/lockdep.h>
#include <linux/export.h>
#include <linux/sysctl.h>
+#include <linux/suspend.h>
#include <linux/utsname.h>
#include <linux/sched/signal.h>
#include <linux/sched/debug.h>
@@ -242,12 +243,14 @@ void reset_hung_task_detector(void)
}
EXPORT_SYMBOL_GPL(reset_hung_task_detector);

+static unsigned long hung_last_checked;
+
/*
* kthread which checks for tasks stuck in D state
*/
static int watchdog(void *dummy)
{
- unsigned long hung_last_checked = jiffies;
+ hung_last_checked = jiffies;

set_user_nice(current, 0);

@@ -272,9 +275,40 @@ static int watchdog(void *dummy)
return 0;
}

+static int hungtask_pm_notify(struct notifier_block *self,
+ unsigned long action, void *hcpu)
+{
+ static unsigned long saved_timeout, saved_interval;
+
+ switch (action) {
+ case PM_SUSPEND_PREPARE:
+ case PM_HIBERNATION_PREPARE:
+ saved_timeout = sysctl_hung_task_timeout_secs;
+ saved_interval = sysctl_hung_task_check_interval_secs;
+ sysctl_hung_task_timeout_secs = 0;
+ sysctl_hung_task_check_interval_secs = 0;
+ wake_up_process(watchdog_task);
+ break;
+ case PM_POST_SUSPEND:
+ case PM_POST_HIBERNATION:
+ sysctl_hung_task_timeout_secs = saved_timeout;
+ sysctl_hung_task_check_interval_secs = saved_interval;
+ hung_last_checked = jiffies;
+ wake_up_process(watchdog_task);
+ break;
+ default:
+ break;
+ }
+ return NOTIFY_OK;
+}
+
static int __init hung_task_init(void)
{
atomic_notifier_chain_register(&panic_notifier_list, &panic_block);
+
+ /* Disable hung task detector on suspend */
+ pm_notifier(hungtask_pm_notify, 0);
+
watchdog_task = kthread_run(watchdog, NULL, "khungtaskd");

return 0;

--
Vitaly