Well, I adjusted some settings in BIOS for PCIE power and changed the power management to disabled, and power to "full" and another one to "performance".
Also QD4 w/64 Threads was this test.
If I change QD to 128 like the IOMeter Intel test performance actually drops for that QD compared to 4, and the rest stay the same.
It seems I'm hitting a 900Mb/s (7.2Gbit) limit here. Which would appear to be PCIE 3.0 single lane performance. Maybe this is why my other PCIE slots aren't working, maybe the board has a problem with PCIE lanes? Or, the tests I"m doing aren't stressing this drive enough, I couldn't get IOMeter to near mimic Intel's test with it.. maybe someone has an intel-config file they can share I could use
The Intel 750 400gb actually performed slightly better sequential read and write than the P3700.
FWIW: The actual power usage didn't change @ idle changing PCIE settings. 165w @ Idle with ESXI 5.5 & 1 VM (Win 2012), with 1 SSD, and 1 P3700, and 128gb RAM, and 2 E5-2683 v3s.