Puma (microarchitecture)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Puma - Family 16h (2nd-gen)
Produced From mid-2014 to present
Common manufacturer(s)
Max. CPU clock rate 1.35 GHz to 2.5 GHz
Min. feature size 28 nm
Instruction set AMD64 (x86-64)
Cores 2–4
L1 cache 64 KB per core[1]
L2 cache 1 MB to 2 MB shared
Socket(s)
Predecessor Jaguar - Family 16h
GPU Radeon Rx: 128 cores, 300-800 Mhz
Core name(s)
  • Beema
  • Mullins
Brand name(s)
This article is about the microarchitecture. For the 2008 mobile platform, see AMD mobile platform § Puma platform (2008).

The Puma Family 16h is a low-power microarchitecture by AMD for its APUs. It succeeds the Jaguar as a second-generation version, targets the same market, and belongs to the same AMD architecture Family 16h. The Beema line of processors are aimed at low-power notebooks, and Mullins are targeting the tablet sector.

Design[edit]

The Puma cores use the same microarchitecture as Jaguar, and inherits the design:

Instruction set support[edit]

Like Jaguar, the Puma core has support for the following instruction sets and instructions: MMX, SSE, SSE2, SSE3, SSSE3, SSE4a, SSE4.1, SSE4.2, AVX, F16C, CLMUL, AES, BMI1, MOVBE (Move Big-Endian instruction), XSAVE/XSAVEOPT, ABM (POPCNT/LZCNT), and AMD-V.[1]

Features and ASICs[edit]

Features of AMD Accelerated Processing Units
Brand Llano Trinity Richland Kaveri Carrizo Bristol Ridge Raven Ridge    Desna,
Ontario,
Zacate
Kabini,
Temash
Beema,
Mullins
Carrizo-L Stoney Ridge
Platform Desktop, Mobile Mobile, Desktop Desktop, Mobile Ultra-mobile
Released Aug 2011 Oct 2012 Jun 2013 Jan 2014 Jun 2015 Jun 2016 Mar 2017 Jan 2011 May 2013 Q2 2014 May 2015 June 2016
Fab. (nm) GlobalFoundries 32 SOI 28 14 TSMC 40 28
Die size (mm2) 228 246 245 244.62 250.04 TBA 75 (+ 28 FCH) ~107 TBA 125
Socket FM1, FS1 FM2, FS1+, FP2 FM2+, FP3 FP4, FM2+ AM4, FP4 AM4 FT1 AM1, FT3 FT3b FP4 TBA
CPU architecture AMD 10h Piledriver Steamroller Excavator Zen Bobcat Jaguar Puma Puma+[3] Excavator
Memory support DDR3-1866
DDR3-1600
DDR3-1333
DDR3-2133
DDR3-1866
DDR3-1600
DDR3-1333
DDR4-2400
DDR4-2133
DDR4-1866
DDR4-1600
DDR3L-1333
DDR3L-1066
DDR3L-1866
DDR3L-1600
DDR3L-1333
DDR3L-1066
DDR3L-1866
DDR3L-1600
DDR3L-1333
Up to
DDR4-2133
3D engine[a] TeraScale 2
(VLIW5)
TeraScale 3
(VLIW4)
GCN 2nd Generation
(Mantle, HSA)
GCN 3rd
(Mantle, HSA)
GCN 4th
(Mantle, HSA)
TeraScale 2
(VLIW5)
GCN 2nd GCN 3rd
Up to 400:20:8 Up to 384:24:6 Up to 512:32:8 Up to 768:48:12 80:8:4 128:8:4 Up to 192:?:?
IOMMUv1 IOMMUv2 IOMMUv1[4] TBA TBA
Unified Video Decoder UVD 3 UVD 4.2 UVD 6 TBA UVD 3 UVD 4 UVD 4.2 UVD 6 UVD 6.1
Video Coding Engine N/A VCE 1.0 VCE 2.0 VCE 3.1 TBA N/A VCE 2.0 VCE 3.1
GPU power saving PowerPlay PowerTune N/A Enduro PowerTune
Max. displays[b] 2–3 2–4 2–4 3 4 TBA 2 TBA TBA
TrueAudio N/A [6] N/A[4] TBA
FreeSync N/A N/A TBA
/drm/radeon[7][8][9] N/A N/A
/drm/amd/amdgpu[10] N/A Experimental N/A Experimental
  1. ^ Unified shaders : texture mapping units : render output units
  2. ^ To feed more than two displays, the additional panels must have native DisplayPort support.[5] Alternatively active DisplayPort-to-DVI/HDMI/VGA adapters can be employed.

Improvements over Jaguar[edit]

  • 19% CPU core leakage reduction at 1.2V[11]
  • 38% GPU leakage reduction
  • 500 mW reduction in memory controller power
  • 200 mW reduction in display interface power
  • Chassis temperature aware turbo boost[12]
  • Selective boosting according to application needs (intelligent boost)
  • Support for ARM TrustZone via integrated Cortex-A5 processor
  • Support for DDR3L-1866 memory[13]

Processors[edit]

Desktop/Mobile (Beema)[edit]

Family Model Socket CPU GPU TDP Memory
Cores Frequency Max. Turbo L2 Cache Model Config. Max. Freq.
A8 6410 Socket FT3b 4 2.00 GHz 2.4 GHz 2 MB Radeon R5 128:?:? 800 MHz 15 W DDR3L-1866
A6 6310 1.80 GHz Radeon R4 800 MHz
A4 6250J 2.00 GHz N/A Radeon R3 600 MHz 25 W DDR3L-1600
A4 6210 1.80 GHz Radeon R3 600 MHz 15 W
E2 6110 1.50 GHz Radeon R2 500 MHz
E1 6010 2 1.35 GHz 1 MB 350 MHz 10 W DDR3L-1333

Tablet (Mullins)[edit]

Family Model CPU GPU Power Memory
Cores Frequency Max. Turbo L2 Cache Model Config. Max. Freq. TDP SDP
A10 Micro 6700T 4 1.2 GHz 2.2 GHz 2 MB Radeon R6 128:?:? 500 MHz 4.5 W 2.8 W DDR3L-1333
A6 Micro 6500T 1.8 GHz Radeon R4 401 MHz
A4 Micro 6400T 1.0 GHz 1.6 GHz Radeon R3 350 MHz
E1 Micro 6200T 2 1.4 GHz 1 MB Radeon R2 300 MHz 3.95 W DDR3L-1066

References[edit]

  1. ^ a b "Software Optimization Guide for Family 16h Processors". AMD. Retrieved August 3, 2013. 
  2. ^ "AMD launches new Beema, Mullins SoCs". ExtremeTech. 2014-04-29. Retrieved 2014-05-02. 
  3. ^ "AMD Mobile “Carrizo” Family of APUs Designed to Deliver Significant Leap in Performance, Energy Efficiency in 2015" (Press release). 2014-11-20. Retrieved 2015-02-16. 
  4. ^ a b Thomas De Maesschalck (2013-11-14). "AMD teases Mullins and Beema tablet/convertibles APU". Retrieved 2015-02-24. 
  5. ^ "How do I connect three or More Monitors to an AMD Radeon™ HD 5000, HD 6000, and HD 7000 Series Graphics Card?". AMD. Retrieved 2014-12-08. 
  6. ^ "A technical look at AMD’s Kaveri architecture". Semi Accurate. Retrieved 6 July 2014. 
  7. ^ Airlie, David (2009-11-26). "DisplayPort supported by KMS driver mainlined into Linux kernel 2.6.33". Retrieved 2016-01-16. 
  8. ^ Airlie, David (2009-11-26). "DisplayPort supported by KMS driver mainlined into Linux kernel 2.6.33". Retrieved 2014-07-02. 
  9. ^ "Radeon feature matrix". freedesktop.org. Retrieved 2016-01-10. 
  10. ^ Deucher, Alexander (2015-09-16). "XDC2015: AMDGPU" (PDF). Retrieved 2016-01-16. 
  11. ^ Shimpi, Anand. "AMD Beema/Mullins Architecture & Performance Preview". AnandTech. Retrieved 29 April 2014. 
  12. ^ Shimpi, Anand. "New Turbo Boost, The Lineup and Trustzone". AnandTech. Retrieved 29 April 2014. 
  13. ^ Woligroski, Don. "Meet The Mullins And Beema Tablet APUs". Toms Hardware. Retrieved 29 April 2014. 

External links[edit]