Click

Saturday, April 3, 2010

Why Pine Trail Isn't Much Faster Than the First Atom

Unlocking the Phenom II X2 555: 3.2GHz Quad-Core for $99


Three days before Apple revolutio...wait, that didn't happen. I kid, I kid.
On Monday AMD updated just about all of its processor families with new chips in response to Clarkdale. We got the Athlon II X2 255, Athlon II X3 440, Phenom II X2 555 BE, Phenom II X4 635 and Phenom II X4 910e. All of the chips are in Bench, so if you want to know how they compare have a look - or check out our review.
There are two things I left out of that review that I felt needed following up on. First, let's take the Phenom II X2 555 BE.
If you read my take on the 555 you'll know that I don't really believe it's worth the price. Most users will be better off with a Core i3 530. There is just one exception I failed to mention: some Phenom II X2s can be turned into a Phenom II X4.
The technique is nothing new. Using any AMD chipset motherboard with a SB710 or SB750 South Bridge and proper BIOS support you'll have a feature called Advanced Clock Calibration (ACC). AMD introduced this feature back in 2008 as a way to improve overclocking on Phenom processors by sacrificing some sort of corner case stability for real world frequency headroom.
The Phenom II X2 is nothing more than a Phenom II X4 with two cores disabled. Originally these cores were disabled because of low yields, but over time yields on quad-core Phenom IIs should be high enough to negate the need for a Phenom II X2. This is most likely why AMD removed the Phenom II X2 from its official price list. It's also why the stranger Phenom II derivatives are also absent from AMD's price list. All that's left are Phenom II X4s pretty much.

A Phenom II X4 900 series die: 258mm2, 4-cores and a 6MB L3 cache. Also the basis for the Phenom II X2.
And herein lies the problem for companies that rely on die harvesting for their product line. Initially, the Phenom II X2 is a great way of using defective Phenom II X4 die. Once yields improve however, you've now created a market for these Phenom II X2s and have to basically sell a full-blown Phenom II X4 at a cheaper price to meet that demand. You could create a new die that's a dual-core Phenom II, but that's expensive and pulls engineers away from more exciting projects like Bulldozer. Often times it's easier to just disable two cores and sell the chip for cheaper than you'd like. At the same time you can do your best to discourage your customers from ordering too many. Remove it off the official price list, charge a little more for it, and direct people at a cheaper native alternative - like the Athlon II X2.

The Athlon II X2 die. Two cores are all you get.
AMD's sticky situation is your gain however. While I can't guarantee that all Phenom II X2s can be converted into quad-core chips, I'd say that your chances are probably pretty good at this point if you get a new enough chip. As with any sort of out-of-spec operation, proceed at your own risk. You may risk ending up with nothing more than a dual-core processor or an unstable quad-core. In my case however, my Phenom II X2 555 BE's extra two cores were easily unlocked.
My Socket-AM3 testbed uses Gigabyte's GA-MA790FXT-UD5P motherboard. In its BIOS there's an option for Advanced Clock Calibration. All you need to do is set EC Firmware Selection to Hybrid, and ACC to Auto:
Patiently waiting and a self-initiated reboot later and my CPU was identified as a Phenom II X4 B55 BE. Four cores running at 3.2GHz, just like a Phenom II X4 955 but for $99.
The chip also performs just like a 3.2GHz quad-core Phenom II, because it is one at this point:
Processorx264 HD 1st Passx25 HD 2nd Pass
AMD Phenom II X4 96572.1 fps22.2 fps
AMD Phenom II X4 B5570.6 fps21.1 fps
AMD Phenom II X2 55545.2 fps10.9 fps

Overclocking is affected. With only two cores active my Phenom II X2 555 BE could run at 3.8GHz without any additional voltage. With four cores active, that number drops down to 3.6GHz.

My Phenom II X2 555 BE, with all four cores unlocked, and running at 3.6GHz.
If you're ok with the possibility of this not working at all, a Phenom II X2 555 BE with all four cores active is the absolute best value you can get for $99. AMD would like to charge you $160 for the opportunity, but you can put the savings towards a better video card or a shiny new SSD.

New Westmere Details Emerge: Power Efficiency and 4/6 Core Plans


CPUCodenameManufacturing ProcessCoresTransistor CountDie Size
Westmere 6CGulftown32nm61.17B240mm2
Nehalem 4CBloomfield45nm4731M263mm2
Nehalem 4CLynnfield45nm4774M296mm2
Westmere 2CClarkdale32nm2384M81mm2

We’ll find out soon enough.

AMD Reveals More Llano Details at ISSCC: 32nm, Power Gating, 4-cores, Turbo?

After cashing Intel’s check and appearing more competitive than expected against Clarkdale 2010 is like a fresh start for AMD. The news gets better.
Late last year AMD said that before the end of 2010 it would be sampling its first APU (Accelerated Processing Unit) - codenamed Llano. Today AMD is announcing that the first Llano samples, built on Global Foundries 32nm high-k + metal gate, SOI process will be sampling to partners in the first half of this year.

GF's 32nm SOI High-K + MG process will be used with Llano
For those not in the know, Llano is AMD’s first hybrid CPU-GPU with on-die graphics. The graphics core is a derivative of AMD’s DirectX 11 Evergreen lineup (the same lineage as the Radeon HD 5970, 5870, 5850, 5670, 5570, 5450, etc...).
Llano will go up against Sandy Bridge, which seems to have been pushed back to 2011 for volume availability according to Intel’s internal roadmaps. While Sandy Bridge will have graphics on-die, it will still only be DX10 class - AMD will have the feature-set advantage as far as graphics is concerned.

Llano's Features
Today we learn a bit more about the CPU side of Llano. The first chip will be a quad-core processor plus on-die graphics. Each core is Phenom II derived, but there’s no shared L3 cache. So Llano cores look a lot like Athlon II cores. I’m hearing that they may have some architectural tweaks, so performance could be better than present-day Athlon IIs.
At 32nm each core (minus L2 cache) is only 9.69 mm^2 and is made up of over 35M transistors. Each core is paired with its own 1MB L2 cache, meaning the quad-core processor will have a total of 4MB of L2 on-die. AMD expects Llano to run at above 3GHz, which should be more than possible at 32nm given that we’re already at close to 3GHz with the 45nm Athlon II X4.

AMD’s First Power Gated CPU

With Nehalem Intel introduced power gating, a technique that allows a core to be near-completely powered down minimizing leakage current when inactive. This not only reduces idle power but it also enables Intel to use extra TDP to turbo up active cores.
Llano uses power gating as well as a Digital APM Module. AMD doesn’t go into much detail on the digital APM module but I’m guessing we’ll see the same sort of turbo-like functionality out of Llano, including graphics turbo.
AMD also pointed out that Llano uses a “power aware clock grid design”. I couldn’t get much more information out of AMD on this one, other than its expecting a ~2x reduction in clock switching power. Simply distributing the clock to all parts of a modern day microprocessor can take up quite a bit of power, any improvements in efficiency there are very important.
I’ll keep digging to see if I can get any more details on this aspect of Llano.

Final Words

Llano will obviously require a new socket. All AMD is saying is that OEMs will be shipping systems in 2011. It’s unclear if we’ll see anything in the channel before then, but with sampling in the coming months it appears that AMD could be ready for Sandy Bridge when it arrives next year.
AMD isn’t qualifying its 2011 statement with an indication of what quarter to expect systems. Given that the first samples are going out now, I’d expect to see Llano sometime in the first half of 2011 but that’s purely conjecture on my part. Sandy Bridge is scheduled to ship in volume in the first quarter of 2011.
The big questions going forward are 1) how much AMD and Intel are going to scale up its graphics performance on these chips, and 2) how important DX11 support will be to the upcoming APU race.

Friday, April 2, 2010

AMD Reveals More Llano Details at ISSCC: 32nm, Power Gating, 4-cores, Turbo

After cashing Intel’s check and appearing more competitive than expected against Clarkdale 2010 is like a fresh start for AMD. The news gets better.




Late last year AMD said that before the end of 2010 it would be sampling its first APU (Accelerated Processing Unit) - codenamed Llano. Today AMD is announcing that the first Llano samples, built on Global Foundries 32nm high-k + metal gate, SOI process will be sampling to partners in the first half of this year.





GF's 32nm SOI High-K + MG process will be used with Llano



For those not in the know, Llano is AMD’s first hybrid CPU-GPU with on-die graphics. The graphics core is a derivative of AMD’s DirectX 11 Evergreen lineup (the same lineage as the Radeon HD 5970, 5870, 5850, 5670, 5570, 5450, etc...).







Llano will go up against Sandy Bridge, which seems to have been pushed back to 2011 for volume availability according to Intel’s internal roadmaps. While Sandy Bridge will have graphics on-die, it will still only be DX10 class - AMD will have the feature-set advantage as far as graphics is concerned.





Llano's Features



Today we learn a bit more about the CPU side of Llano. The first chip will be a quad-core processor plus on-die graphics. Each core is Phenom II derived, but there’s no shared L3 cache. So Llano cores look a lot like Athlon II cores. I’m hearing that they may have some architectural tweaks, so performance could be better than present-day Athlon IIs.







At 32nm each core (minus L2 cache) is only 9.69 mm^2 and is made up of over 35M transistors. Each core is paired with its own 1MB L2 cache, meaning the quad-core processor will have a total of 4MB of L2 on-die. AMD expects Llano to run at above 3GHz, which should be more than possible at 32nm given that we’re already at close to 3GHz with the 45nm Athlon II X4.







AMD’s First Power Gated CPU

With Nehalem Intel introduced power gating, a technique that allows a core to be near-completely powered down minimizing leakage current when inactive. This not only reduces idle power but it also enables Intel to use extra TDP to turbo up active cores.







Llano uses power gating as well as a Digital APM Module. AMD doesn’t go into much detail on the digital APM module but I’m guessing we’ll see the same sort of turbo-like functionality out of Llano, including graphics turbo.







AMD also pointed out that Llano uses a “power aware clock grid design”. I couldn’t get much more information out of AMD on this one, other than its expecting a ~2x reduction in clock switching power. Simply distributing the clock to all parts of a modern day microprocessor can take up quite a bit of power, any improvements in efficiency there are very important.











I’ll keep digging to see if I can get any more details on this aspect of Llano.



Final Words

Llano will obviously require a new socket. All AMD is saying is that OEMs will be shipping systems in 2011. It’s unclear if we’ll see anything in the channel before then, but with sampling in the coming months it appears that AMD could be ready for Sandy Bridge when it arrives next year.



AMD isn’t qualifying its 2011 statement with an indication of what quarter to expect systems. Given that the first samples are going out now, I’d expect to see Llano sometime in the first half of 2011 but that’s purely conjecture on my part. Sandy Bridge is scheduled to ship in volume in the first quarter of 2011.







The big questions going forward are 1) how much AMD and Intel are going to scale up its graphics performance on these chips, and 2) how important DX11 support will be to the upcoming APU race.

Thursday, April 1, 2010

AMD's 890GX Chipset - Same Graphics, Better South Bridge


AMD's 890GX Chipset - Same Graphics, Better South Bridge

No news isn’t always good news. AMD has been purposefully vague on the graphics specifications of its 8-series chipsets. Today we know why. The AMD 890GX has the exact same graphics horsepower as the 790GX:

AMD 890GXAMD 790GXAMD 785G
CPUAMD Socket-AM3AMD Socket-AM3/AM2+AMD Socket-AM3/AM2+
Manufacturing Process55nm55nm55nm
PCI Express24 PCIe 2.0 lanes22 PCIe 2.0 lanes22 PCIe 2.0 lanes
GraphicsRadeon HD 4290 (DirectX 10.1)Radeon HD 3300 (DirectX 10.0)Radeon HD 4200 (DirectX 10.1)
Core Clock700MHz700MHz500MHz
Shader Processors8 (5-way)8 (5-way)
Full H.264/VC-1/MPEG-2 HW DecodeYes (UVD2)Yes (UVD)Yes (UVD2)
8-channel LPCMNoNoNo
South BridgeSB850SB750SB750
USB14 USB 2.0 ports12 USB 2.0 ports12 USB 2.0 ports
SATA6 SATA 6Gbps ports6 SATA 3Gbps ports6 SATA 3Gbps ports

In fact, it’s virtually the same graphics core as the 790GX and 785G built on the same 55nm process.


AMD 890GX North Bridge (left) vs. AMD 790GX North Bridge (right) - nothing has changed


SidePort memory on a motherboard

For those of you who aren’t familiar with AMD’s integrated graphics, it’s a 40 SP implementation of the RV6xx core running at 700MHz. The chipset supports optional SidePort memory. It’s an optional dedicated frame buffer that provides additional low latency bandwidth to the integrated graphics core. If your application demands more memory, the GPU can still use the CPU’s memory controller and system memory.

There haven’t been any improvements to the audio side of the chipset either. The 890GX still supports 2-channel LPCM or 5.1 Dolby Digital/DTS bitstreaming over HDMI. If you want more, AMD will happily sell you a Radeon HD 5450 to plug in to your new board.

AMD continues to offer full hardware H.264/MPEG-2/VC-1 video decode acceleration. The UVD2 engine responsible for the video acceleration remains unchanged from the 785G and 790GX.


Boards based on the 890GX will be well equipped with video outputs like this ASUS M4A89GTD Pro/USB3

Overall system performance remains unchanged. The 890GX performs no differently than the 790FX/GX chipsets:

AMD 890GXAMD 790GX
x264 HD Encode - Average FPS26.6 fps26.6 fps
7-zip Benchmark5962 MIPS5917 MIPS

With nothing new on the integrated graphics front, the 890GX relies mostly on its new South Bridge, the SB850, to excite. AMD first doubled the bandwidth between its North and South Bridges. Then, it added a native 6Gbps SATA controller with 6 ports, a Gigabit Ethernet MAC and two PCIe 2.0 lanes. The SB850 doesn’t offer any native USB 3 support, so we’ll still see motherboard manufacturers rely on NEC’s external USB 3 controller. The new South Bridge does support two more USB 2.0 ports as well, bringing the total up to 14.

AMD SB850AMD SB750
NB-SB Link2GB/s Each Direction1GB/s Each Direction
Additional PCIeTwo PCIe 2.0 x1 LanesNone
USB14 USB 2.0 ports12 USB 2.0 ports
SATA6 SATA 6Gbps ports6 SATA 3Gbps ports
PATA2 Channels2 Channels
HD Audio InterfaceYesYes
Integrated Gigabit Ethernet MACYesNo

The first 890GX motherboards will be starting at $130. ASUS' M4A89GTD Pro will sell for $145, add another $10 if you want USB 3.0 support.


ASUS M4A89GTD Pro/USB3

I'd expect Gigabyte's pricing to be somewhere in line with that as well.


Gigabyte's GA-890GPA-UD3H

Ready for Integration

The 8-series of chipsets will be AMD’s last incarnation of traditional integrated graphics chipsets. Sometime next year we’ll get Llano, AMD’s first APU with a CPU and GPU on the same die.


AMD's Llano 32nm CPU/GPU due in 2011

At 55nm, the 890GX North Bridge is tiny. At 32nm the North Bridge would be about 1/3 the size. Now you can see why it makes sense to bring this on-die. The South Bridge is even smaller:


AMD SB750 (left) vs. AMD SB850 (right).

We’re not too far away from having nearly all of this technology integrated into the CPU.

The Test

Motherboard:ASUS M4A89GTD Pro/USB3 (AMD 890GX)
Intel DH55TC (Intel H55)
Gigabyte's GA-890GPA-UD3H (AMD 890GX)
Chipset Drivers:Intel 9.1.1.1020 (Intel)
AMD Catalyst 10.3
Hard Disk:Corsair P256 SSD
Memory:Corsair DDR3-1333 2 x 2GB (7-7-7-20)
Video Card:ATI Radeon HD 5450
Video Drivers:AMD Catalyst 10.3
Desktop Resolution:1920 x 1200
OS:Windows Vista 7 64-bit