If you really need to check by yourself, here is the A100 whitepaper https://images.nvidia.com/aem-dam/e...ter/nvidia-ampere-architecture-whitepaper.pdfidk anymore, i'm sure you are right. its 4am.
The tensor flops are page 36
If you really need to check by yourself, here is the A100 whitepaper https://images.nvidia.com/aem-dam/e...ter/nvidia-ampere-architecture-whitepaper.pdfidk anymore, i'm sure you are right. its 4am.
I benchmarked bf16 /w fp32 acc on DRIVE A100 to 250TF with cublasLt. I’m betting the real A100 does better (it should do 312TF because it has 108 SM instead of 96 for the Drive version)No, i'm fine - idk what i'm reading anymore. I have supermicro papers saying that number is with sparsity, another one saying its not and vice versa...
I only used the 2x8-pin VGA connectors and left the EPS one empty.For those with the 3x8 pin adapter card, are the correct cables 2 x 8 pin PCIe connectors and 1 x 8 pin EPS connector? I thought the EPS one was supposed to be compatible with the CPU cables but it seems like it only works for fits for the cable end that plugs into the power supply. Any products/adapters used? Thanks
Did you try the nvlink for qs version, does it working?They seem to be pretty certain about the traces missing, I actually asked about whether its possible to get it working some how, their feedback was they have had success by migrating the whole chip to a PCIe doner. I assume they have tested the circuits out on the SXM2 board if they have gone to that regard, but thats all the info I have.
The GPU itself needs to be modified not the board. I found a modder who can do it, minimum quantity is 50 GPUs.Did you try the nvlink for qs version, does it working?
I think it was like around 300 with short spikes 320, 350. I can run a test in a day or two if this is important to you. If so, reply here.Anybody have any numbers on what the minimum max power draw is? Like, if you were to lock the graphics clocks to 1140MHz and run model training or something else intensive, what's the power draw? Preemptive thank you to anyone willing to share the numbers, considering buying one but I'm a little concerned about the 400W+ power draw.
Thanks, would appreciate that!I think it was like around 300 with short spikes 320, 350. I can run a test in a day or two if this is important to you. If so, reply here.
300-340. Maybe if you hit a perfect shape it gets to 350. This is on an SXM2 board. Might be different on an SXM2 adapter.Thanks, would appreciate that!
I don't know if it's really necessary to do that. With three fans and five heat pipes on an open rack, the temperature of the PG199 inference model can be controlled below 70 degrees. Will this affect the secondary sale?Found an interesting photo of delidded PG199 on Xianyu 闲鱼 - 闲不住?上闲鱼! It seems pretty straightforward and not so many tools needed to do so.
When inferencing this card is constrained by memory bandwidth and power consumption rarely goes higher than 250w. If you also do some other math on this card than the only option you have without delid is to lower the core clocks. When hitting 450w and more the card tends to fall off the PCIe bus to protect itself. One of the possible reason for this is overheating no matter how many pipes you use outside of IHS because TIM between IHS and core/memory have insufficient thermal conductivity. Replacing it with indium-gallium alloy could win you hundred of MHz.I don't know if it's really necessary to do that. With three fans and five heat pipes on an open rack, the temperature of the PG199 inference model can be controlled below 70 degrees. Will this affect the secondary sale?
I saw your post on xianyu.I just haven't paid attention for a few months, and almost all the low-priced PG199s on eBay are sold out. Maybe more people are discovering the trick of using this computing card.