Intel Core (microarchitecture)

From Wikipedia, the free encyclopedia

  (Redirected from Intel Core Microarchitecture)
Jump to: navigation, search

The Intel Core microarchitecture (previously known as the Intel Next-Generation Micro-Architecture, or NGMA) is a multi-core processor microarchitecture unveiled by Intel in Q1 2006. It is based around an updated version of the Yonah core and could be considered the latest iteration of the Intel P6 microarchitecture, which traces its history back to the 1995 Pentium Pro. The extreme power consumption of NetBurst-based processors and the resulting inability to effectively increase clock speed was the primary reason Intel abandoned the NetBurst architecture. The Intel Core Microarchitecture was designed by the Intel Israel (IDC) team that previously designed the Pentium M mobile processor.

The architecture features lower power usage than before and is competitive with AMD in heat production.[citation needed] It has multiple cores and hardware virtualization support (marketed as Intel VT), as well as Intel 64 and SSSE3.

The first processors that used this architecture were code-named Merom, Conroe, and Woodcrest; Merom is for mobile computing, Conroe is for desktop systems, and Woodcrest is for servers and workstations. While architecturally identical, the three processor lines differ in the socket used, bus speed, and power consumption. Mainstream Core-based processors are branded Pentium Dual Core and low end branded Celeron; server and workstation Core-based processors are branded Xeon, while desktop and mobile Core-based processors are branded as Core 2. Despite their name, processors sold as Intel Core do not actually use the Core microarchitecture.

Intel CPU core roadmaps from NetBurst and Pentium M to Sandy Bridge. The Core family of processors are those with the light green background.

Contents

[edit] Technology

Intel Core 2 microarchitecture.

The Intel Core Microarchitecture is designed from the ground up, but similar to the Pentium M microarchitecture in design philosophy. The pipeline is 14 stages long — less than half of Prescott's, a signature feature of wide order execution cores. Core's execution unit is 4 issues wide, compared to the 3-issue cores of P6, P-M (Banias, Dothan, and Yonah), and NetBurst microarchitectures. The new architecture is a dual core design with linked L1 cache and shared L2 cache engineered for maximum performance per watt and improved scalability.

One new technology included in the design is Macro-Ops Fusion, which combines two x86 instructions into a single micro-operation. For example, a common code sequence like a compare followed by a conditional jump would become a single micro-op.

Other new technologies include 1 cycle throughput (2 cycles previously) of all 128-bit SSE instructions and a new power saving design. All components will run at minimum speed, ramping up speed dynamically as needed (similar to AMD's Cool'n'Quiet power-saving technology, as well as Intel's own SpeedStep technology from earlier mobile processors). This allows the chip to produce less heat, and consume as little power as possible.

For most Woodcrest CPUs, the front side bus (FSB) runs at 1333 MT/s; however, this is scaled down to 1066 MT/s for lower end 1.60 and 1.86 GHz variants.[1][2] The Merom mobile variant was initially targeted to run at a FSB of 667 MT/s while the second wave of Meroms, supporting 800 MT/s FSB, were released as part of the Santa Rosa platform with a different socket in May 2007. The desktop-oriented Conroe began with models having an FSB of 800 MT/s or 1066 MT/s with a 1333 MT/s line officially launched on July 22, 2007.

In 2007, AMD released a series of videos claiming that the FSB will prove to be the weak link for Intel, as the Core microarchitecture uses a shared bus, unlike their own HyperTransport.[citation needed] While not so critical in the mobile and desktop segments, this might be the handicap which will prevent Woodcrest-MP from taking the performance lead from AMD Opteron on systems with more than 2 cores per socket. Intel attempted to alleviate this problem by the use of advanced prefetchers and memory disambiguation which try to hide main-memory-access latency. However, this is mitigated to some degree by the use of a separate front-side bus for each physical CPU package.

The power consumption of these new processors is extremely low—average use energy consumption is to be in the 1-2 watt range in ultra low voltage variants, with thermal design powers (TDPs) of 65 watts for Conroe and most Woodcrests, 80 watts for the 3.0 GHz Woodcrest, and 40 watts for the low-voltage Woodcrest. However, this is subject to change. In comparison, an AMD Opteron 875HE processor consumes 55 watts, while the new Energy Efficient Socket AM2 line fits in the 35 watt thermal envelope (specified a different way so not directly comparable). Merom, the mobile variant, is listed at 35 watts TDP for standard versions and 5 watts TDP for Ultra Low Voltage (ULV) versions.

Previously, Intel warned that it would now focus on power efficiency, rather than raw performance. However, at IDF in the spring of 2006, Intel advertised both. Some of the promised numbers are:

  • 20% more performance for Merom at the same power level (compared to Core Duo)
  • 40% more performance for Conroe at 40% less power (compared to Pentium D)
  • 80% more performance for Woodcrest at 35% less power (compared to the original dual-core Xeon)

[edit] Steppings

The Core microarchitecture uses a number of steppings, which unlike previous microarchitectures not only represent incremental improvements but also different sets of features like cache size and low power modes. Most of these steppings are used across brands, typically by disabling some of the features and limiting clock frequencies on low-end chips.

Steppings with a reduced cache size use a separate naming scheme, which means that the releases are no longer in alphabetic order. Additional steppings have been used in internal and engineering samples, but are not listed in the tables.

Many of the high-end Core 2 and Xeon processors use Multi-Chip Modules of two or three chips in order to get larger cache sizes or more than two cores.

[edit] Steppings using 65 nm process

Mobile (Merom) Desktop (Conroe) Desktop (Kentsfield) Server (Woodcrest, Clovertown, Tigerton)
Stepping Released Area CPUID L2 cache Max. clock Celeron Pentium Core 2 Celeron Pentium Core 2 Xeon Core 2 Xeon Xeon
B2 Jul 2006 143 mm² 06F6 4 MiB 2.93 GHz M5xx T5000 T7000 L7000 E6000 X6000 3000 5100
B3 Nov 2006 143 mm² 06F7 4 MiB 3.00 GHz Q6000 QX6000 3200 5300
L2 Jan 2007 111 mm² 06F2 2 MiB 2.13 GHz T5000 U7000 E2000 E4000 E6000 3000
E1 May 2007 143 mm² 06FA 4 MiB 2.80 GHz M5xx T7000 L7000 X7000
G0 Apr 2007 143 mm² 06FB 4 MiB 3.00 GHz M5xx T7000 L7000 X7000 E2000 E4000 E6000 3000 Q6000 QX6000 3200 5100 5300 7200 7300
G2 Mar 2009 143 mm² 06FB 4 MiB 2.16 GHz M5xx T7000 L7000
M0 Jul 2007 111 mm² 06FD 2 MiB 2.40 GHz 5xx T1000 T2000 T3000 T5000 T7000 U7000 E1000 E2000 E4000
A1 Jun 2007 81 mm² 10661 1 MiB 2.20 GHz M5xx U2000 220 4x0

Steppings B2/B3, E1 and G0 of model 15 (cpuid 06fx) processors are evolutionary steps of the standard Merom/Conroe die with 4 MiB L2 cache, with the short-lived E1 stepping only being used in mobile processors. Stepping L2 and M0 are the "Allendale" chips with just 2 MiB L2 cache, reducing production cost and power consumption for low-end processors.

The G0 and M0 steppings improve idle power consumption in C1E state and add the C2E state in desktop processors. In mobile processors, all of which support C1 through C4 idle states, steppings E1, G0, and M0 add support for the Mobile Intel 965 Express (Santa Rosa) platform with Socket P, while the earlier B2 and L2 steppings only appear for the Socket M based Mobile Intel 945 Express (Napa refresh) platform.

The model 22 stepping A1 (cpuid 10661h) marks a significant design change, with just a single core and 1 MiB L2 cache further reducing the power consumption and manufacturing cost for the low-end. Like the earlier steppings, A1 is not used with the Mobile Intel 965 Express platform.

Steppings G0, M0 and A1 mostly replaced all older steppings in 2008. In 2009, a new stepping G2 was introduced to replace the original stepping B2[3].

[edit] Steppings using 45 nm process

Mobile (Penryn) Desktop (Wolfdale) Desktop (Yorkfield) Server (Harpertown) Server (Dunnington)
Stepping Released Area CPUID L2 cache Max. clock Celeron Pentium Core 2 Pentium Core 2 Xeon Core 2 Xeon Xeon
C0 Nov 2007 107 mm² 10676 6 MiB 3.00 GHz E8000 P7000 T8000 T9000 P9000 SP9000 SL9000 X9000 E8000 E3100 QX9000 5200 5400
M0 Mar 2008 82 mm² 10676 3 MiB 2.40 GHz 7xx SU3000 P7000 P8000 T8000 SU9000 E5000 E2000 E7000
C1 Mar 2008 107 mm² 10677 6 MiB 3.20 GHz Q9000 QX9000 X3300
M1 Mar 2008 82 mm² 10677 3 MiB 2.50 GHz Q8000 Q9000 X3300
E0 Aug 2008 107 mm² 1067A 6 MiB 3.33 GHz T9000 P9000 SP9000 SL9000 Q9000 QX9000 E8000 E3100 Q9000 Q9000S QX9000 X3300 5200 5400
R0 Aug 2008 82 mm² 1067A 3 MiB 2.93 GHz 7xx 900 T4000 SU3000 T6000 P8000 SU9000 E5000 E7000 Q8000 Q8000S Q9000 Q9000S X3300
A1 Sep 2008 503 mm² 106D1 3 MiB 2.67 GHz 7400

In the model 23 (cpuid 01067xh), Intel started marketing stepping with full (6 MiB) and reduced (3 MiB) L2 cache at the same time, and giving them identical cpuid values. All steppings have the new SSE4.1 instructions. Stepping C1/M1 was a bug fix version of C0/M0 specifically for quad core processors and only used in those. Stepping E0/R0 adds two new instructions (XSAVE/XRSTOR) and replaces all earlier steppings.

In mobile processors, stepping C0/M0 is only used in the Intel Mobile 965 Express (Santa Rosa refresh) platform, whereas stepping E0/R0 supports the later Intel Mobile 4 Express chipsets with the Montavina platform.

Model 30 stepping A1 (cpuid 106d1h) adds an L3 cache as well as six instead of the usual two cores, which leads to an unusually large die size of 503 mm².[4] As of February 2008, it has only found its way into the very high-end Xeon 7400 series (Dunnington).

[edit] Current processors

[edit] Laptops

Codename Processor name Processor no Architecture No of cores Clock speed FSB L2 cache
Merom-1024 Celeron M 520, 530 65 nm 1 1.6, 1.73 GHz 533 MT/s 1 MiB
Merom-L 520, 530, 540, 550, 560 1.6, 1.73, 1.86, 2, 2.13 GHz
Celeron M ULV 523 933 MHz 533 MT/s 1 MiB
Merom-L Core 2 Solo ULV U2100, U2200 65 nm 1 1.06, 1.2 GHz 533 MT/s 1 MiB
Merom-1M Pentium Dual Core T2310, T2330, T2370, T2390 65 nm 2 1.46, 1.6, 1.73, 1.86 GHz 533 MT/s 1 MiB
Merom-2M Core 2 Duo ULV U7500, U7600, U7700 65 nm 2 1.06, 1.2, 1.33 GHz 533 MT/s 2 MiB
Core 2 Duo T5300 1.73 GHz 533 MT/s
T5250, T5450, T5550, T5750, T5850 1.5, 1.67, 1.83, 2.0, 2.16 GHz 667 MT/s
T5500, T5600 1.67, 1.83 GHz
T5270, T5470, T7100, T7250 1.4, 1.6, 1.8, 2.0 GHz 800 MT/s
Merom Core 2 Duo LV L7200, L7400 65 nm 2 1.33, 1.5 GHz 667 MT/s 4 MiB
L7300, L7500, L7700 1.4, 1.6, 1.8 GHz 800 MT/s
Core 2 Duo T5200 1.6 GHz 533 MT/s 2 MiB
T5500, T5600 1.67, 1.83 GHz 667 MT/s
T7200, T7400, T7600 2, 2.16, 2.33 GHz 4 MiB
T7300, T7500, T7700, T7800 2, 2.2, 2.4, 2.6 GHz 800 MT/s
Penryn-3M Core 2 Duo T8100, T8300 45 nm 2 2.1, 2.4 GHz 800 MT/s 3 MiB
Penryn Core 2 Duo MV P8400, P8600 45 nm 2 2.26, 2.4 GHz 1066 MT/s 3 MiB
P9500 2.53 GHz 6 MiB
Core 2 Duo T9300, T9500 2.5, 2.6 GHz 800 MT/s 6 MiB
T9400, T9600 2.53, 2.8 GHz 1066 MT/s
Merom XE Core 2 Extreme X7800, X7900 65 nm 2 2.6, 2.8 GHz 800 MT/s 4 MiB
Penryn XE Core 2 Extreme X9000 45 nm 2 2.8 GHz 800 MT/s 6 MiB

[edit] Desktops

Codename Processor name Processor no Architecture No of cores Clock speed FSB L2 cache Virtualization
Conroe-L Celeron 220, 420, 430, 440, 450 65 nm 1 1.2, 1.6, 1.8, 2, 2.2 GHz 533/800 MT/s 512 kiB No
Allendale E1200, E1400, E1500 2 1.6, 2.0, 2.2 GHz
Allendale Pentium Dual Core E2140, E2160, E2180, E2200, E2220 65 nm 2 1.6, 1.8, 2, 2.2, 2.4 GHz 800 MT/s 1 MiB
Core 2 Duo E4300, E4400, E4500, E4600, E4700 1.8, 2, 2.2, 2.4, 2.6 GHz 2 MiB
E6300, E6400 1.86, 2.13 GHz 1066 MT/s Yes
Conroe Core 2 Duo E6300, E6400 65 nm 2 1.86, 2.13 GHz 1066 MT/s 2 MiB
E6320, E6420 4 MiB
E6600, E6700 2.40, 2.67 GHz
E6540 2.33 1333 MT/s
E6550, E6750, E6850 2.33, 2.67, 3 GHz
Wolfdale Pentium Dual Core E5200 45 nm 2 2.5 GHz 800 MT/s 2 MiB No
E5300, E5400 2.6, 2.7 GHz
Core 2 Duo E7200, E7300, E7400, E7500 2.53, 2.66, 2.8, 2.93 GHz 1066 MT/s 3 MiB
E8190 2.67 1333 MT/s 6 MiB
E8200, E8300, E8400, E8500, E8600 2.67, 2.83, 3, 3.16, 3.33 GHz Yes
Kentsfield Core 2 Quad Q6600, Q6700 65 nm 4 2.4, 2.67 GHz 1066 MT/s 8 MiB
Yorkfield Core 2 Quad Q8200, Q8200S, Q8300 45 nm 4 2.33, 2.5 GHz 1333 MT/s 4 MiB No
Q9300, Q9400, Q9400S 2.5, 2.66 GHz 6 MiB Yes
Q9450, Q9550, Q9550S, Q9650 2.67, 2.83, 3 GHz 12 MiB
Conroe XE Core 2 Extreme X6800 65 nm 2 2.93 GHz 1066 MT/s 4 MiB
Kentsfield XE Core 2 Extreme QX6700, QX6800 65 nm 4 2.67, 2.93 GHz 1066 MT/s 8 MiB
QX6850 3 GHz 1333 MT/s
Yorkfield XE Core 2 Extreme QX9650 45 nm 4 3 GHz 1333 MT/s 12 MiB
QX9770, QX9775 3.2 GHz 1600 MT/s

[edit] Servers and workstations

Codename Processor name Processor no Architecture No of cores Clock speed FSB L2 cache
Allendale Dual-Core Xeon 3040, 3050 65 nm 2 1.86, 2.13 GHz 1066 MT/s 2 MiB
Conroe Dual-Core Xeon 3040, 3050 65 nm 2 1.86, 2.13 GHz 1066 MT/s 2 MiB
3060, 3070 2.4, 2.67 GHz 4 MiB
3065, 3075, 3085 2.33, 2.67, 3 GHz 1333 MT/s
Wolfdale Dual-Core Xeon E3110 45 nm 2 3.0 GHz 1333 MT/s 6 MiB
Woodcrest Dual-Core Xeon LV 5128, 5138 65 nm 2 1.86, 2.13 GHz 1066 MT/s 4 MiB
5148 2.33 GHz 1333 MT/s
Dual-Core Xeon 5110, 5120 1.6, 1.86 GHz 1066 MT/s
5130, 5140, 5150, 5160 2, 2.33, 2.67, 3 GHz 1333 MT/s
Wolfdale-DP Dual-Core Xeon E5205 45 nm 2 1.86 GHz, 1066 MT/s 6 MiB
E5220, E5240 2.33, 3, 3.16 GHz 1333 MT/s
X5260 3.33 GHz
X5272 3.4 GHz 1600 MT/s
Kentsfield Quad-Core Xeon X3210, X3220, X3230 65 nm 4 2.13, 2.4, 2.67 GHz 1066 MT/s 8 MiB
Clovertown Quad-Core Xeon LV L5310, L5320 65 nm 4 1.6, 1.86 GHz 1066 MT/s 8 MiB
L5335 2 GHz 1333 MT/s
Quad-Core Xeon E5310, E5320 1.6, 1.86 GHz 1066 MT/s
E5330, E5340, E5350 2.13, 2.4, 2.67 GHz
E5335, E5345, X5355, X5365 2, 2.33, 2.67, 3 GHz 1333 MT/s
Harpertown Quad-Core Xeon E5405, E5410, E5420, E5430 45 nm 4 2, 2.33, 2.5, 2.67 GHz 1333 MT/s 12 MiB
E5440, E5450, X5450, X5460 2.83, 3, 3, 3.16 GHz
E5462, E5472, X5472, X5482 2.8, 3, 3, 3.2 GHz 1600 MT/s
Harpertown LV Quad-Core Xeon LV L5410, L5420, L5430 45 nm 4 2.33, 2.5, 2.67 GHz 1333 MT/s 12 MiB
Tigerton-DC Dual-Core Xeon E7210, E7220 65 nm 2 2.4, 2.93 GHz 1066 MT/s 4 MiB
Tigerton Quad-Core Xeon LV L7345 65 nm 4 1.86 GHz 1066 MT/s 8 MiB
Quad-Core Xeon E7310, E7320 1.6, 2.13 GHz 4 MiB
E7330 2.4 GHz 6 MiB
E7340, X7350 2.4, 2.93 GHz 8 MiB

[edit] Future processors

[edit] Laptops

Codename Processor name Processor no Architecture No of cores Clock speed FSB L2 cache
Merom-L Celeron 575, 587 65 nm 1 2, 2.17 GHz 667 MT/s 1 MiB
Penryn Celeron ULV (SFF) 723 45 nm 2 1.2 GHz 800 MT/s 1 MiB
Penryn-3M Core 2 Solo ULV (SFF) U3300 45 nm 1 1.2 GHz 800 MT/s 3 MiB
Penryn-3M Core 2 Duo ULV (SFF) SU9300, SU9400 45 nm 2 1.2, 1.4 GHz 800 MT/s 3 MiB
Penryn Core 2 Duo LV (SFF) SL9300, SL9400 1.6, 1.86 GHz 1066 MT/s 6 MiB
Core 2 Duo MV (SFF) SP9300, SP9400 2.26, 2.4 GHz
Penryn XE Core 2 Extreme X9100 45 nm 2 3.06 GHz 1066 MT/s 6 MiB
QX9300 4 2.53 GHz 12 MiB

[edit] Servers and workstations

Codename Processor name Processor no Architecture No of cores Clock speed FSB L2 cache L3 cache
Wolfdale-DP Dual-Core Xeon X5270 45 nm 2 3.5 GHz 1333 MT/s 6 MiB No
Dual-Core Xeon LV L5240, L5250 45 nm 2 2.93, 3.16 GHz 1333 MT/s 6 MiB
Yorkfield Quad-Core Xeon X3320, X3330 45 nm 4 2.5, 2.67 GHz 1333 MT/s 6 MiB
X3370 3 GHz 12 MiB
Harpertown LV Quad-Core Xeon LV L5440, L5450 45 nm 4 3, 3.16 GHz 1333 MT/s 12 MiB
Dunnington Six Core Xeon E7450, X7460 45 nm 6 2.4, 2.67 GHz 1066 MT/s 9 MiB 12/16 MiB
Six Core Xeon LV L7455 45 nm 6 2.13 GHz 12 MiB

[edit] See also

[edit] References

Personal tools