High Bandwidth Memory: Difference between revisions

Content deleted Content added

Inline

Revision as of 14:56, 25 April 2017

High Bandwidth Memory (HBM) is a high-performance RAM interface for 3D-stacked DRAM from AMD and Hynix. It is to be used in conjunction with high-performance graphics accelerators and network devices.^[1] The first devices to use HBM are the AMD Fiji GPUs.^[2]^[3]

High Bandwidth Memory has been adopted by JEDEC as an industry standard in October 2013.^[4] The second generation, HBM2, was accepted by JEDEC in January 2016.^[5]

Technology

HBM achieves higher bandwidth while using less power in a substantially smaller form factor than DDR4 or GDDR5.^[6] This is achieved by stacking up to eight DRAM dies, including an optional base die with a memory controller, which are interconnected by through-silicon vias (TSV) and microbumps. The HBM technology is similar in principle but incompatible with the Hybrid Memory Cube interface developed by Micron Technology.^[7]

HBM memory bus is very wide in comparison to other DRAM memories such as DDR4 or GDDR5. An HBM stack of four DRAM dies (4-Hi) has two 128-bit channels per die for a total of 8 channels and a width of 1024 bits in total. A graphics card/GPU with four 4-Hi HBM stacks would therefore have a memory bus with a width of 4096 bits. In comparison, the bus width of GDDR memories is 32 bits, with 16 channels for a graphics card with a 512-bit memory interface.^[8] HBM will support up to 4 GB per package.

Interface

The HBM DRAM is tightly coupled to the host compute die with a distributed interface. The interface is divided into independent channels. Each channel is completely independent of one another. Channels are not necessarily synchronous to each other. The HBM DRAM uses a wide-interface architecture to achieve high-speed, low-power operation. The HBM DRAM uses a 500 MHz differential clock CK_t/CK_c. Commands are registered at the rising edge of CK_t, CK_c. Each channel interface maintains a 128 bit data bus operating at DDR data rates. HBM supports transfer rates of 1 GT/s per pin (transferring 1 bit), yielding an overall package bandwidth of 128 GB/s.^[9]

HBM 2

The second generation of High Bandwidth Memory, HBM 2, also specifies up to eight dies per stack and doubles pin transfer rates up to 2 GT/s. Retaining 1024-bit wide access, HBM2 is able to reach 256 GB/s memory bandwidth per package. The HBM2 spec allows up to 8 GB per package. HBM2 is predicted to be especially useful for performance sensitive consumer applications such as virtual reality.^[10]

On January 19, 2016, Samsung announced early mass production of HBM2, at up to 4 GB per stack.^[11]^[12] SK Hynix also announced availability of 4 GB stacks in August 2016.^[13]

HBM 3

A third generation of High Bandwidth Memory, HBM 3, was announced in 2016.^[14]^[15] HBM3 is expected to offer increased memory capacity, greater bandwidth, lower voltage, and lower costs. The increased density is expected to come from greater density per die and more die stacks per chip. Bandwidth is expected to be up to 512 GB/s. No release date has been announced, though Samsung expects volume production by 2020.

History

The development of High Bandwidth Memory began at AMD in 2008 to solve the problem of ever increasing power usage and form factor of computer memory. Amongst other things AMD developed procedures to solve the die stacking problems with a team led by Senior AMD Fellow Bryan Black. Partners from the memory industry (SK Hynix), interposer industry (UMC) and packaging industry (Amkor Technology and ASE) were obtained to help AMD realize their vision of HBM.^[16] High volume manufacturing began at a Hynix facility in Icheon, Korea in 2015.

HBM has been adopted as industry standard JESD235 by JEDEC as of October 2013 following a proposal by AMD and SK Hynix in 2010.^[4] The first chip utilizing HBM is AMD Fiji which was released in June 2015 powering the AMD Radeon R9 Fury X.^[17]^[2]^[18]

HBM2 was accepted by JEDEC as standard JESD235a in January 2016.^[5] The first GPU chip utilizing HBM2 is the Nvidia Tesla P100 which was officially announced in April 2016.^[19]^[20]

Future

At Hot Chips in August 2016 both Samsung and Hynix announced the next generation HBM memory technologies.^[21]^[22] Both companies announced high performance products expected to have increased density, increased bandwidth, and lower power. Samsung also announced a lower cost version of HBM under development targeting mass markets. Removing the buffer die and decreasing the number of TSVs lowers cost, though at the expense of a decreased overall bandwidth (200 GB/s).

References

^ ISSCC 2014 Trends page 118 "High-Bandwidth DRAM"
^ ^a ^b Smith, Ryan (2 July 2015). "The AMD Radeon R9 Fury X Review". Anandtech. Retrieved 1 August 2016.
^ Morgan, Timothy Prickett (March 25, 2014). "Future Nvidia 'Pascal' GPUs Pack 3D Memory, Homegrown Interconnect". EnterpriseTech. Retrieved 26 August 2014. Nvidia will be adopting the High Bandwidth Memory (HBM) variant of stacked DRAM that was developed by AMD and Hynix
^ ^a ^b HIGH BANDWIDTH MEMORY (HBM) DRAM (JESD235), JEDEC, October 2013
^ ^a ^b "JESD235a: High Bandwidth Memory 2". 2016-01-12.
^ HBM: Memory Solution for Bandwidth-Hungry Processors, Joonyoung Kim and Younsu Kim, SK hynix // Hot Chips 26, August 2014
^ Where Are DRAM Interfaces Headed? // EETimes, 4/18/2014 "The Hybrid Memory Cube (HMC) and a competing technology called High-Bandwidth Memory (HBM) are aimed at computing and networking applications. These approaches stack multiple DRAM chips atop a logic chip."
^ Highlights of the HighBandwidth Memory (HBM) Standard. Mike O’Connor, Sr. Research Scientist, NVidia // The Memory Forum – June 14, 2014
^ "High-Bandwidth Memory (HBM)" (PDF). AMD. 2015-01-01. Retrieved 2016-08-10.
^ Valich, Theo. "NVIDIA Unveils Pascal GPU: 16GB of memory, 1TB/s Bandwidth". VR World. Retrieved 2016-01-24.
^ https://news.samsung.com/global/samsung-begins-mass-producing-worlds-fastest-dram-based-on-newest-high-bandwidth-memory-hbm-interface
^ http://www.extremetech.com/extreme/221473-samsung-announces-mass-production-of-next-generation-hbm2-memory
^ Shilov, Anton (1 August 2016). "SK Hynix Adds HBM2 to Catalog". Anandtech. Retrieved 1 August 2016.
^ Walton, Mark (23 August 2016). "HBM3: Cheaper, up to 64GB on-package, and terabytes-per-second bandwidth". Ars Technica. Retrieved 3 February 2017.
^ Ferriera, Bruno (23 August 2016). "HBM3 and GDDR6 emerge fresh from the oven of Hot Chips". Tech Report. Retrieved 3 February 2017.
^ [1] High-Bandwidth Memory (HBM) from AMD: Making Beautiful Memory
^ Smith, Ryan (19 May 2015). "AMD HBM Deep Dive". Anandtech. Retrieved 1 August 2016.
^ [2] AMD Ushers in a New Era of PC Gaming including World’s First Graphics Family with Revolutionary HBM Technology
^ Smith, Ryan (5 April 2016). "Nvidia announces Tesla P100 Accelerator". Anandtech. Retrieved 1 August 2016.
^ http://www.nvidia.com/object/tesla-p100.html
^ Smith, Ryan (23 August 2016). "Hot Chips 2016: Memory Vendors Discuss Ideas for Future Memory Tech - DDR5, Cheap HBM & More". Anandtech. Retrieved 23 August 2016.
^ Walton, Mark (23 August 2016). "HBM3: Cheaper, up to 64GB on-package, and terabytes-per-second bandwidth". Ars Technica. Retrieved 23 August 2016.

External links

HIGH BANDWIDTH MEMORY (HBM) DRAM (JESD235), JEDEC, October 2013
25.2 A 1.2V 8Gb 8-channel 128GB/s high-bandwidth memory (HBM) stacked DRAM with effective microbump I/O test methods using 29 nm process and TSV. D.U Lee, SK hynix, ISSCC 2014 doi:10.1109/ISSCC.2014.6757501

[1] ISSCC 2014 Trends page 118 "High-Bandwidth DRAM"

[amd_fiji-2] Smith, Ryan (2 July 2015). "The AMD Radeon R9 Fury X Review". Anandtech. Retrieved 1 August 2016.

[3] Morgan, Timothy Prickett (March 25, 2014). "Future Nvidia 'Pascal' GPUs Pack 3D Memory, Homegrown Interconnect". EnterpriseTech. Retrieved 26 August 2014. Nvidia will be adopting the High Bandwidth Memory (HBM) variant of stacked DRAM that was developed by AMD and Hynix

[HBM_JEDEC-4] HIGH BANDWIDTH MEMORY (HBM) DRAM (JESD235), JEDEC, October 2013

[HBM2_JEDEC-5] "JESD235a: High Bandwidth Memory 2". 2016-01-12.

[hchips26-6] HBM: Memory Solution for Bandwidth-Hungry Processors, Joonyoung Kim and Younsu Kim, SK hynix // Hot Chips 26, August 2014

[7] Where Are DRAM Interfaces Headed? // EETimes, 4/18/2014 "The Hybrid Memory Cube (HMC) and a competing technology called High-Bandwidth Memory (HBM) are aimed at computing and networking applications. These approaches stack multiple DRAM chips atop a logic chip."

[8] Highlights of the HighBandwidth Memory (HBM) Standard. Mike O’Connor, Sr. Research Scientist, NVidia // The Memory Forum – June 14, 2014

[9] "High-Bandwidth Memory (HBM)" (PDF). AMD. 2015-01-01. Retrieved 2016-08-10.

[10] Valich, Theo. "NVIDIA Unveils Pascal GPU: 16GB of memory, 1TB/s Bandwidth". VR World. Retrieved 2016-01-24.

[11] ttps://news.samsung.com/global/samsung-begins-mass-producing-worlds-fastest-dram-based-on-newest-high-bandwidth-memory-hbm-interface

[12] ttp://www.extremetech.com/extreme/221473-samsung-announces-mass-production-of-next-generation-hbm2-memory

[13] Shilov, Anton (1 August 2016). "SK Hynix Adds HBM2 to Catalog". Anandtech. Retrieved 1 August 2016.

[14] Walton, Mark (23 August 2016). "HBM3: Cheaper, up to 64GB on-package, and terabytes-per-second bandwidth". Ars Technica. Retrieved 3 February 2017.

[15] Ferriera, Bruno (23 August 2016). "HBM3 and GDDR6 emerge fresh from the oven of Hot Chips". Tech Report. Retrieved 3 February 2017.

[16] [1] High-Bandwidth Memory (HBM) from AMD: Making Beautiful Memory

[17] Smith, Ryan (19 May 2015). "AMD HBM Deep Dive". Anandtech. Retrieved 1 August 2016.

[18] [2] AMD Ushers in a New Era of PC Gaming including World’s First Graphics Family with Revolutionary HBM Technology

[19] Smith, Ryan (5 April 2016). "Nvidia announces Tesla P100 Accelerator". Anandtech. Retrieved 1 August 2016.

[20] ttp://www.nvidia.com/object/tesla-p100.html

[21] Smith, Ryan (23 August 2016). "Hot Chips 2016: Memory Vendors Discuss Ideas for Future Memory Tech - DDR5, Cheap HBM & More". Anandtech. Retrieved 23 August 2016.

[22] Walton, Mark (23 August 2016). "HBM3: Cheaper, up to 64GB on-package, and terabytes-per-second bandwidth". Ars Technica. Retrieved 23 August 2016.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

@@ Line 19: / Line 19: @@
 === {{anchor|HBM 3}} HBM 3 ===
-A third generation of High Bandwidth Memory, HBM 3, was announced in 2016.<ref>{{cite news|last1=Walton|first1=Mark|title=HBM3: Cheaper, up to 64GB on-package, and terabytes-per-second bandwidth|url=https://arstechnica.com/gadgets/2016/08/hbm3-details-price-bandwidth/|accessdate=3 February 2017|publisher=Ars Technica|date=23 August 2016}}</ref><ref>{{cite news|last1=Ferriera|first1=Bruno|title=HBM3 and GDDR6 emerge fresh from the oven of Hot Chips|url=https://techreport.com/news/30559/hbm3-and-gddr6-emerge-fresh-from-the-oven-of-hot-chips|accessdate=3 February 2017|publisher=Tech Report|date=23 August 2016}}</ref> HBM3 is expected to offer increased memory capacity, greater bandwidth, lower voltage, and lower costs. The increased density is expected to come from greater density per die and more die stacks per chip. Bandwidth is expected to be up to 512GB/s. No release date has been announced, though Samsung expects volume production by 2020.
+A third generation of High Bandwidth Memory, HBM 3, was announced in 2016.<ref>{{cite news|last1=Walton|first1=Mark|title=HBM3: Cheaper, up to 64GB on-package, and terabytes-per-second bandwidth|url=https://arstechnica.com/gadgets/2016/08/hbm3-details-price-bandwidth/|accessdate=3 February 2017|publisher=Ars Technica|date=23 August 2016}}</ref><ref>{{cite news|last1=Ferriera|first1=Bruno|title=HBM3 and GDDR6 emerge fresh from the oven of Hot Chips|url=https://techreport.com/news/30559/hbm3-and-gddr6-emerge-fresh-from-the-oven-of-hot-chips|accessdate=3 February 2017|publisher=Tech Report|date=23 August 2016}}</ref> HBM3 is expected to offer increased memory capacity, greater bandwidth, lower voltage, and lower costs. The increased density is expected to come from greater density per die and more die stacks per chip. Bandwidth is expected to be up to 512 GB/s. No release date has been announced, though Samsung expects volume production by 2020.
 == History ==

v t e Dynamic random-access memory (DRAM)
Asynchronous	FPM DRAM EDO DRAM
Synchronous	SDRAM DDR SDRAM DDR2 DDR3 DDR4 DDR5 LPDDR (Mobile DDR) Fast Cycle DRAM (FCRAM) eDRAM RLDRAM HBM HBM2 HBM2E HBM3 HBM-PIM HBM3E Hybrid Memory Cube
Graphics	VRAM WRAM MDRAM SGRAM GDDR GDDR2 GDDR3 GDDR4 GDDR5 GDDR6 GDDR7
Rambus	RDRAM XDR DRAM XDR2 DRAM
Memory modules	SIMM DIMM UniDIMM CAMM
Lists	DRAM timeline SDRAM timeline Bandwidth Transistor count