Tesla V100 VS. P100 (deep learning speed testing)

dhenzjhen

Member
Sep 14, 2016
38
55
18
San Jose, California
This is only a speed training testing without any accuracy or tuning involved. I only want
to test and compare the V100s and P100s in terms of crunching speed. In this testing, I used
1281167 training images and 50000 validation images (ILSVRC2012) and NV-caffe for deep
learning framework. On the networks, I used AlexNet, GoogleNET and CaffeNET with different
batch sizes and 300,000 iterations runtime on all the tests.

System: Supermicro SYS-4028GR-TXRT
Motherboard: X10DG0-T
CPU: E5 2699V4 x 2
MEM: 32GB Micron x 12
BIOS: 5/25/17
GPU: Nvidia Tesla V100 SXM2 x 8 | P100 SXM2 x 8
OS: Ubuntu 16.04 x64
Driver: 384.81
CUDA: version 9




-dhenzjhen@SMCI
 
Last edited:

hifijames

Member
Dec 26, 2017
32
7
8
60
That's some serious horsepower you got there! Wish I could have that kind of toys to play with:)
I am a total noob here, so please forgive my ignorance. How involved will it be to switch one framework to another, say from Caffe to Tensorflow?
 

LukeP

Member
Feb 12, 2017
174
18
18
40
any followup why V100 with its 110Tflops is so slow? was that benchmark not tuned for V100?