POWER9
General information | |
---|---|
Launched | 2017 |
Designed by | IBM |
Common manufacturer | |
Performance | |
Max. CPU clock rate | 4 GHz[1] |
Cache | |
L1 cache | 32+32 KB per core[1] |
L2 cache | 512 KB per core[1] |
L3 cache | 120 MB per chip[1] |
L4 cache | via Centaur[1] |
Architecture and classification | |
Technology node | 14 nm |
Instruction set | Power Architecture (Power ISA v.3.0) |
Physical specifications | |
Cores | |
History | |
Predecessor | POWER8 |
POWER, PowerPC, and Power ISA architectures |
---|
NXP (formerly Freescale and Motorola) |
IBM |
|
IBM/Nintendo |
Other |
Related links |
Cancelled in gray, historic in italic |
POWER9 are a family of superscalar symmetric multiprocessors based on the Power Architecture, and introduced in August 2016, at the Hot Chips conference.[2] The POWER9 based processors will be manufactured using a 14 nm FinFET process,[3] and will come in at least four versions; 12- and 24-core versions for scale out and 12- and 24-core versions for scale up applications,[3] and possibly more since the POWER9 architecture is open for licensing and modification by the OpenPOWER Foundation members.[4]
Systems using POWER9 are slated to be available in 2017.[2]
Variants
POWER9 devices will be targeting two different markets, the scale-out and scale-up markets, each variant come in 12- and 24-core versions:[1]
- IBM POWER9 SO – scale-out version, optimized for dual socket computers with up to 120 GB/s bandwidth to directly attached DDR4 memory[1][3][5] (targeted for release in 2017)
- IBM POWER9 SU – scale-up version, optimized for four sockets or more, large NUMA machines with up to 230 GB/s bandwidth to buffered memory[1][5]
Design
The POWER9 core comes in two variants, one is four-way multithreading called SMT4 and one eight-way called SMT8.[1] The SMT4- and SMT8-cores are quite similar, in that they consist of a number of so called slices fed by common schedulers. A slice is a rudimentary 64-bit single threaded processing core with load store unit (LSU), integer unit (ALU) and a vector scalar unit (VSU, doing SIMD and floating point). A super-slice is the combination of two slices. An SMT4-core consists of a 32 KB L1 cache, a 32 KB L1 data cache, an instruction fetch unit (IFU) and a instruction sequencing unit (ISU) which feeds two super-slices. An SMT8-core has two sets of L1 caches and, IFUs and ISUs to feed four super-slices. The result is that the 12-core and 24-core versions of POWER9 each consist of the same amount of slices, i.e. 96 each and the same amount of L1 cache.
A POWER9 core, whether SMT4 and SMT8, has a 12-stage pipeline (five stage shorter than its predecessor, the POWER8) but aims to retain the clock frequency of around 4 GHz.[1] It will be the first to incorporate elements of the Power ISA v.3.0 that was released in December 2015, including the VSX-3 instructions[6] The POWER9 design is made to be modular and used in more processor variants and used for licensing, on a different fabrication process than IBM's.[5] On chip are co-processors for compression and cryptography, as well as a large low-latency eDRAM L3 cache.[3]
I/O
A lot of facilities are on-chip for helping with massive off-chip I/O performance:
- The SO version has integrated DDR4 controllers for directly attached RAM, while the SU will use the off-chip Centaur architecture introduced with POWER8 to include high performance eDRAM L4 cache and memory controllers for DDR4 RAM.[1][3]
- The Blueink facility running NVLink v.2 interconnects for close attachment of graphics co-processors from Nvidia and Open CAPI accelerators.[7]
- General purpose PCIe v.4 connections for attaching regular ASICs, FPGAs and other periferals as well as CAPI 2.0 and CAPI 1.0 devices designed for POWER8.
- Multiprocessor (Symmetric multiprocessor system) links to connect other POWER9 processors in on the same motherboard, or in other closely attached enclosures.
Supercomputers
The United States Department of Energy together with Oak Ridge National Laboratory and Lawrence Livermore National Laboratory have contracted IBM and Nvidia to build two supercomputers, the Summit and the Sierra, that will be based on POWER9 processors coupled with Nvidia's Volta GPUs. These systems are slated to go online in 2017.[8][9][10]
See also
References
- ^ a b c d e f g h i j k Big Blue Aims For The Sky With Power9
- ^ a b c IBM's blazing Power9 chip is coming, and here's what you need to know
- ^ a b c d e f OpenPOWER and the Roadmap Ahead
- ^ Power9: Google gives Intel a chip-flip migraine, IBM tries to lures big biz
- ^ a b c The Prospects For A Power9 Revolution
- ^ Add full Power ISA 3.0 / POWER9 binutils support
- ^ Nvidia's NVLink 2.0 will first appear in Power9 servers next year
- ^ NVIDIA Volta, IBM POWER9 Land Contracts For New US Government Supercomputers
- ^ ORNL Summit home page
- ^ Lawrence Livermore signs contract with IBM