Re: bdflush and postgres stuck in D state

From: Jakob Østergaard (jakob@unthought.net)
Date: Tue Sep 18 2001 - 12:30:23 EST


Sorry for following up on my own post, I have a little extra
information.

I started a g++ job to try to force the machine to write out some dirty
buffers before I reboot. g++ now hangs along with two sync's, bdflush
and the postgres process.

This is from top:

  PID USER PRI NI SIZE RSS SHARE WCHAN STAT %CPU %MEM TIME COMMAN
    6 root 9 0 0 0 0 raid1_all DW 0.0 0.0 0:26 bdflush
 1140 joe 9 0 71564 39M 0 wait_on_b D 0.0 32.3 1:04 cc1plus
 1007 root 9 0 72 4 4 wait_on_b D 0.0 0.0 0:00 sync
10023 postgres 9 0 368 4 4 wait_on_b D 0.0 0.0 0:00 postmas

Seems like something's rotten with bdflush and raid1_all (-something).

There is one (software) RAID1 on four SCSI disks in the machine, perhaps RAID1
has a misfeature when more than just two disks are in the mirror ?

Anyway, this machine is going down now - I can't wait anymore, sorry. I wonder
how many file writes didn't make it...

-- 
................................................................
:   jakob@unthought.net   : And I see the elder races,         :
:.........................: putrid forms of man                :
:   Jakob Østergaard      : See him rise and claim the earth,  :
:        OZ9ABN           : his downfall is at hand.           :
:.........................:............{Konkhra}...............:
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sun Sep 23 2001 - 21:00:26 EST