time freeze under 2.4.9

From: Firenza
Date: Wed Nov 19 2003 - 05:07:01 EST



Hello,

Pretext: 4-way server runs on plain vanilla redhat 2.4.9-21
kernel for almost 2 years without any problems. Two weeks ago,
applications start to behave "strangely", which is traced
back to the fact that time "froze" on that server.

# strace sleep 1
...
nanosleep({1, 0}, <unfinished ...> (never returns)

date shows the freezing time, hwclock shows that the hardware
clock is still working and no suspicious kernel messages are
found in the logs.

Discouragingly, the same thing happened to a different server
(same kernel, very similar hardware) a couple of days later.
Additionally that server is not in use, which makes it less
likely that this behaviour is caused by a freshly introduced
application.

Rebooting fixed the problem, but the same behaviour surfaced
again for the first server a couple of hours ago.

What circumstances could cause this behaviour?

Is there a way to further pinpoint the cause?

cheers,
Firenza

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/