Sorry for following up on my own post, I have a little extra
information.
I started a g++ job to try to force the machine to write out some dirty
buffers before I reboot. g++ now hangs along with two sync's, bdflush
and the postgres process.
This is from top:
PID USER PRI NI SIZE RSS SHARE WCHAN STAT %CPU %MEM TIME COMMAN
6 root 9 0 0 0 0 raid1_all DW 0.0 0.0 0:26 bdflush
1140 joe 9 0 71564 39M 0 wait_on_b D 0.0 32.3 1:04 cc1plus
1007 root 9 0 72 4 4 wait_on_b D 0.0 0.0 0:00 sync
10023 postgres 9 0 368 4 4 wait_on_b D 0.0 0.0 0:00 postmas
Seems like something's rotten with bdflush and raid1_all (-something).
There is one (software) RAID1 on four SCSI disks in the machine, perhaps RAID1
has a misfeature when more than just two disks are in the mirror ?
Anyway, this machine is going down now - I can't wait anymore, sorry. I wonder
how many file writes didn't make it...
-- ................................................................ : jakob@unthought.net : And I see the elder races, : :.........................: putrid forms of man : : Jakob Østergaard : See him rise and claim the earth, : : OZ9ABN : his downfall is at hand. : :.........................:............{Konkhra}...............: - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
This archive was generated by hypermail 2b29 : Sun Sep 23 2001 - 21:00:26 EST