Re: Lots of con-current I/O = resets SATA link? (2.6.25.10)
From: Mattias Wadenstein
Date:  Mon Jul 07 2008 - 05:53:34 EST
On Sat, 5 Jul 2008, Justin Piszcz wrote:
On Sat, 5 Jul 2008, Robert Hancock wrote:
Justin Piszcz wrote:
Can you post your dmesg from bootup with the controller/drive detection?
So you've got 6 drives in the machine. Intel chipsets normally seem pretty 
robust with AHCI.
Are you certain that your machine has enough power to run all those drives 
properly? We've seen in a number of cases that power fluctuations or noise 
can cause these kinds of errors.
I have a 650watt PSU (nice antec one) and the power draw of the box is 
~148watts w/ veliciraptors, ~250 when fully load all 4 cores + all 12 disks 
writing.  I have turned off the irqbalance daemon and I am going to see if 
the problem re-occurs.
Looking at the sum wattage number is really misleading for this. You need 
to dig out the specs for how many amps it can provide on the different 
voltages (5 and 12 volts). In particular, many modern PSUs have several 
separate 12V rails, where one (or more, some have the 12V supply split 
into 3 or 4 parts!) is used for CPU and GFX card power and usually only 
one is available for disks.
You can also have plenty of 12V left but run out of 5V, or the other way 
around. I've spent quite some time trying to find a PSU that would handle 
18 disks without costing too much. The splitting of the 12V power into 
separate rails and a general lack of 5V compared to what the disks need 
according to their specs just made it difficult, and I ended up bonding 
two PSUs together (linking the ground together with some custom cabling) 
to get a stable machine again.
/Mattias Wadenstein
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/