If you're nowhere near maximum theoretical throughput of 40gbe, don't expect that 25gbe is going to magically result in better performance. I've been able to achieve 850MB/s on 10gbe, and 3.4GB/s on 40gbe (multi stream aggregate), all at default 1500mtu, with no tuning beyond optimizing block sizes. And that's real disk io, not raw network iperf, where I'm actually constrained by the disks not the network.
I don't think the problem is 40gbe technology itself.... You need to do some tuning.
I don't think the problem is 40gbe technology itself.... You need to do some tuning.