|Release date||December 7, 2017|
|Fabrication process||TSMC 12 nm (FinFET)|
|Successor||Ampere (consumer, professional)|
Volta is the codename for a GPU microarchitecture developed by Nvidia, succeeding Pascal. It was first announced on a roadmap in March 2013, although the first product was not announced until May 2017. The architecture is named after 18th–19th century Italian chemist and physicist Alessandro Volta. It was NVIDIA's first chip to feature Tensor Cores, specially designed cores that have superior deep learning performance over regular CUDA cores. The architecture is produced with TSMC's 12 nm FinFET process. The Ampere microarchitecture is the successor to Volta.
The first graphics card to use it was the datacenter Tesla V100, e.g. as part of the Nvidia DGX-1 system. It has also been used in the Quadro GV100 and Titan V. There were no mainstream GeForce graphics cards based on Volta.
Architectural improvements of the Volta architecture include the following:
- CUDA Compute Capability 7.0
- concurrent execution of integer and floating point operations
- TSMC's 12 nm FinFET process, allowing 21.1 billion transistors.
- High Bandwidth Memory 2 (HBM2),
- NVLink 2.0: a high-bandwidth bus between the CPU and GPU, and between multiple GPUs. Allows much higher transfer speeds than those achievable by using PCI Express; estimated to provide 25 Gbit/s per lane. (Disabled for Titan V)
- Tensor cores: A tensor core is a unit that multiplies two 4×4 FP16 matrices, and then adds a third FP16 or FP32 matrix to the result by using fused multiply–add operations, and obtains an FP32 result that could be optionally demoted to an FP16 result. Tensor cores are intended to speed up the training of neural networks. Volta's Tensor cores are first generation while Ampere has third generation Tensor cores.
- PureVideo Feature Set I hardware video decoding
At Nvidia's annual GPU Technology Conference keynote on May 10, 2017, Nvidia officially announced the Volta microarchitecture along with the Tesla V100. The Volta GV100 GPU is built on a 12 nm process size using HBM2 memory with 900 GB/s of bandwidth.
Nvidia officially announced the Quadro GV100 on March 27, 2018.
|Model||Launch||Code Name (s)||Fab
|Bus Interface||Core config||SM
|Clock speeds||Fillrate||Memory||Processing power (GFLOPS)||TDP
|NVLink Support||Launch Price|
|Nvidia Titan V||December 7, 2017||GV100-400-A1||TSMC 12 nm||21.1||815||PCIe 3.0 ×16||5120:320:96||640||80||6||4.5||1200||1455||1700||139.7||465.6||12||652.8||HBM2||3072||12288 (14899)||6144 (7450)||24576 (29798)||250||No||$2,999|
|Nvidia Quadro GV100||March 27, 2018||GV100||5120:320:128||6||1132||1628||1696||208.4||521||32||868.4||4096||11592 (16671)||5796 (8335)||23183 (33341)||Yes||$8,999|
|Nvidia Titan V CEO Edition||June 21, 2018||1200||1455||1700||186.2||465.6||870.4||12288 (14899)||6144 (7450)||24576 (29798)||N/A|
Volta is also reported to be included in the Summit and Sierra supercomputers, used for GPGPU compute. The Volta GPUs will connect to the POWER9 CPUs via NVLink 2.0, which is expected to support cache coherency and therefore improve GPGPU performance.
- Gasior, Geoff (19 March 2013). "Nvidia's Volta GPU to feature on-chip DRAM". The Tech Report. Retrieved 14 March 2017.
- Smith, Ryan (2017-05-10). "The NVIDIA GPU Tech Conference 2017 Keynote Live Blog". Retrieved 2018-11-03.
- "NVIDIA Volta AI Architecture | NVIDIA". NVIDIA. Retrieved 2018-04-11.
- Killian, Zak (14 March 2017). "Report: TSMC set to fabricate Volta and Centriq on 12-nm process". The Tech Report. Retrieved 14 March 2017.
- Durant, Luke; Giroux, Olivier; Harris, Mark; Stam, Nick (May 10, 2017). "Inside Volta: The World's Most Advanced Data Center GPU". Nvidia developer blog.
- Gasior, Geoff (March 19, 2013). "Nvidia's Volta GPU to feature on-chip DRAM". The Tech Report.
- Shah, Agam (22 August 2016). "Nvidia's NVLink 2.0 will first appear in Power9 servers next year". PC World. Retrieved 14 March 2017.
- Harris, Mark (May 11, 2017). "CUDA 9 Features Revealed: Volta, Cooperative Groups and More". Retrieved August 12, 2017.
- "NVIDIA Ampere Architecture In-Depth". 14 May 2020.
- https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/nvidia-ampere-architecture-whitepaper.pdf[bare URL PDF]
- Cutress, Ian; Tallis, Billy (4 January 2016). "CES 2017: Nvidia Keynote Liveblog". AnandTech. Retrieved 9 January 2017.
- "NVIDIA DRIVE Xavier, World's Most Powerful SoC, Brings Dramatic New AI Capabilities | NVIDIA Blog". The Official NVIDIA Blog. 2018-01-07. Retrieved 2018-11-03.
- Smith, Ryan (10 May 2017). "Nvidia Volta Unveiled". AnandTech. Retrieved 2 June 2017.
- "NVIDIA TITAN V Transforms the PC into AI Supercomputer".
- "Introducing NVIDIA TITAN V: The World's Most Powerful PC Graphics Card".
- "NVIDIA Reinvents the Workstation with Real-Time Ray Tracing".
- "Introducing NVIDIA TITAN V: The World's Most Powerful PC Graphics Card". NVIDIA. Retrieved 2017-12-08.
- "NVIDIA Quadro GV100". Retrieved 2018-03-27.
- Smith, Ryan. "NVIDIA Unveils & Gives Away New Limited Edition 32GB Titan V "CEO Edition"". Retrieved 2018-07-06.
- "NVIDIA TITAN V CEO Edition". TechPowerUp. Retrieved 2018-07-07.
- Shankland, Steven (14 September 2015). "IBM, Nvidia land $325M supercomputer deal". CNET. Retrieved 29 December 2015.
- Noyes, Katherine (16 March 2015). "IBM, Nvidia rev HPC engines in next-gen supercomputer push". PC World. Retrieved 29 December 2015.
- Smith, Ryan (17 November 2014). "Nvidia Volta, IBM Power9 Land Contracts for New US Government Supercomputers". Anandtech. Retrieved 14 March 2017.
- Lilly, Paul (January 25, 2017). "NVIDIA 12nm FinFET Volta GPU Architecture Reportedly Replacing Pascal In 2017". HotHardware.