Nvidia announces PCI-e version of Tesla P100
Nvidia has announced PCI-e versions of the Tesla P100 accelerator. Nvidia already unveiled the HPC card with Pascal GPU in April, but that version had a Mezzanine connector developed by the manufacturer itself.
Nvidia will make two versions of the PCI-e variant of the P100 available at the end of this year: one with 16GB hbm2 and one with 12GB. The cards do not have the nvlink interconnect: Nvidia has developed its own Mezzanine connector for that interface with which P100 cards can communicate with each other.
The cards have slightly lower boost clock speeds than the Mezzanine version and the TDP is also lower: 250W instead of 300W. The 12GB version of the pci-e-P100 also has a 3072-bit wide memory bus, compared to the 4096-bit interface of the other two cards. Like the P100 card that Nvidia announced in April, it concerns accelerators for high performance computing such as supercomputers with a Pascal GPU and high bandwidth memory of the second generation.
Nvidia announced the P100 cards at the International Supercomputing Conference in Frankfurt.
Tesla P100 (Mezzanine) |
Tesla P100 (16GB) |
Tesla P100 (12GB) |
Tesla M40 | |
Stream Processors | 3584 | 3584 | 3584 | 3072 |
Core clock speed | 1328MHz | ? | ? | 948MHz |
Boost clock speed | 1480MHz | 1300MHz | 1300MHz | 1114MHz |
Memory clock speed | 1.4Gbit/s HBM2 | 1.4Gbit/s HBM2 | 1.4Gbit/s HBM2 | 6Gbit/s gddr5 |
Memory bus | 4096-bit | 4096-bit | 3072-bit | 384-bit |
Memory bandwidth | 720GB/sec | 720GB/sec | 540GB/sec | 288GB/sec |
Memory amount | 16GB | 16GB | 12GB | 12GB |
semi-precision | 21.2 tflops | 18.7 tflops | 18.7 tflops | 6.8 tflops |
Single Precision | 10.6 tflops | 9.3 tflops | 9.3 tflops | 6.8 tflops |
Double Precision | 5.3 tflops (1/2 rate) |
4.7 tflops (1/2 rate) |
4.7 tflops (1/2 rate) |
213 gflops (1/32 rate) |
GPU | GP100 (610mm2) |
GP100 (610mm2) |
GP100 (610mm2) |
GM200 |
Transistors | 15.3 billion | 15.3 billion | 15.3 billion | 8 billion |
Tdp | 300W | 250W | 250W | 250W |
form factor | mezzanine | pci-e | pci-e | pci-e |
Cooling | AFTER | passive | passive | passive |
Process | tsmc 16nm finfet | tsmc 16nm finfet | tsmc 16nm finfet | tsmc 28nm |
Architecture | Pascal | Pascal | Pascal | Maxwell 2 |
Table sourced from Anandtech