PCIe performance on nForcePro 2200/CK804

From: J.A. MagallÃn
Date: Thu Feb 04 2010 - 19:50:25 EST


Hi all...

I have a mobo with this chipset, the board in question is a SuperMicro H8DCE:

http://www.supermicro.com/Aplus/motherboard/Opteron/nForce/H8DCE.cfm

It has a GeForce GTX 260. The PCIe slot is supposed to be x16.
I use CUDA to do some calculations, and the transfer speed from host
memory to device is something like 1.3 GB/s (measured just timing a
host->device copy).

The same card on an Intel ICH10 chipset (Asus P5Q-PRO board) gives
about 4Gb/s.
One oldy 8800GTX on ICH8 (P965) gives 2.5 Gb/s (Intel DP965LT).

Is the nForce chipset so bad, or is it a kernel misconfiguration/bug ?
Or hardware/BIOS is not setup correctly ?

I am using 2.6.33-rc6 for x86-64. The kernel, gcc, CUDA, and even the test
binary is the same on all boxes (shared $HOME)...

Any ideas ?

cicely:~# lspci -vv -s 01:00.0
01:00.0 VGA compatible controller: nVidia Corporation GT200 [GeForce GTX 260] (rev a1) (prog-if 00 [VGA controller])
Subsystem: ASUSTeK Computer Inc. Device 82e3
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0
Interrupt: pin A routed to IRQ 19
Region 0: Memory at be000000 (32-bit, non-prefetchable) [size=16M]
Region 1: Memory at c0000000 (64-bit, prefetchable) [size=256M]
Region 3: Memory at bc000000 (64-bit, non-prefetchable) [size=32M]
Region 5: I/O ports at 8c00 [size=128]
[virtual] Expansion ROM at bfb80000 [disabled] [size=512K]
Capabilities: [60] Power Management version 3
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Address: 0000000000000000 Data: 0000
Capabilities: [78] Express (v1) Endpoint, MSI 00
DevCap: MaxPayload 128 bytes, PhantFunc 0, Latency L0s <512ns, L1 <4us
ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
MaxPayload 128 bytes, MaxReadReq 512 bytes
DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
LnkCap: Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Latency L0 <512ns, L1 <1us
ClockPM- Surprise- LLActRep- BwNot-
LnkCtl: ASPM Disabled; RCB 128 bytes Disabled- Retrain- CommClk-
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed 2.5GT/s, Width x16, TrErr- Train- SlotClk- DLActive- BWMgmt- ABWMgmt-
Kernel driver in use: nvidia
Kernel modules: nvidia-current, nvidiafb, nouveau

TIA

--
J.A. Magallon <jamagallon()ono!com> \ Software is like sex:
\ It's better when it's free
Mandriva Linux release 2010.1 (Cooker) for x86_64
Linux 2.6.32.3-desktop-0.rc2.1mnb (gcc 4.4.2 ) SMP
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/