This is only a speed training testing without any accuracy or tuning involved. I only want
to test and compare the V100s and P100s in terms of crunching speed. In this testing, I used
1281167 training images and 50000 validation images (ILSVRC2012) and NV-caffe for deep
learning framework. On...
Deep learning for fun and new *opportunities :)
Supermicro 4028GR-TXR system
Intel E5 2698 V4 x 2
Samsung 16GB 2400 x 24
Nvidia P100-SXM2 GPU x 8
BIOS 3/22/17
IPMI 3.52
Software stack: Centos 7.2 x64, Nvidia driver 375.20, Cuda_8.0.44, Caffe(AlexNet/GoogleNet), Nccl...
System configurations used for the testing:
Motherboard: X10DRG-HT 1.02
BIOS: 7/20/16
CPU: E5-2667 v3 @ 3.20GHz x 2
MEM: 8GB SK Hynix x 8
GPU: Nvidia Tesla P40 x 2 (CPU2 slot 4 location)
OS: Centos 7.2 64 bit
Driver: Nvidia 367.44
Cuda Toolkit: 8.0.44
MPI: MPICH-3.0
Compiler: GCC version 4.8.5...
GitHub - NVIDIA/nccl: Optimized primitives for collective multi-GPU communication
Fast Multi-GPU collectives with NCCL
System Configuration:
Motherboard: X10DRG-O / Product Name: SYS-4028GR-TR
BIOS: 7/27/2016
IPMI: 3.44
CPU: E5 2689 3.1Ghz V4 x 2
Memory: Samsung 16GB x 24
GPU: Tesla M40 x 8...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.