The iPhone 5's A6 SoC: Not A15 or A9, a Custom Apple Core Instead
by Anand Lal Shimpi on September 15, 2012 2:15 PM EST- Posted in
- Smartphones
- Apple
- Mobile
- SoCs
- iPhone 5
When Apple announced the iPhone 5, Phil Schiller officially announced what had leaked several days earlier: the phone is powered by Apple's new A6 SoC.
As always, Apple didn't announce clock speeds, CPU microarchitecture, memory bandwidth or GPU details. It did however give us an indication of expected CPU performance:
Prior to the announcement we speculated the iPhone 5's SoC would simply be a higher clocked version of the 32nm A5r2 used in the iPad 2,4. After all, Apple seems to like saving major architecture shifts for the iPad.
However, just prior to the announcement I received some information pointing to a move away from the ARM Cortex A9 used in the A5. Given Apple's reliance on fully licensed ARM cores in the past, the expected performance gains and unpublishable information that started all of this I concluded Apple's A6 SoC likely featured two ARM Cortex A15 cores.
It turns out I was wrong. But pleasantly surprised.
The A6 is the first Apple SoC to use its own ARMv7 based processor design. The CPU core(s) aren't based on a vanilla A9 or A15 design from ARM IP, but instead are something of Apple's own creation.
Hints in Xcode 4.5
The iPhone 5 will ship with and only run iOS 6.0. To coincide with the launch of iOS 6.0, Apple has seeded developers with a newer version of its development tools. Xcode 4.5 makes two major changes: it drops support for the ARMv6 ISA (used by the ARM11 core in the iPhone 2G and iPhone 3G), keeps support for ARMv7 (used by modern ARM cores) and it adds support for a new architecture target designed to support the new A6 SoC: armv7s.
What's the main difference between the armv7 and armv7s architecture targets for the LLVM C compiler? The presence of VFPv4 support. The armv7s target supports it, the v7 target doesn't. Why does this matter?
Only the Cortex A5, A7 and A15 support the VFPv4 extensions to the ARMv7-A ISA. The Cortex A8 and A9 top out at VFPv3. If you want to get really specific, the Cortex A5 and A7 implement a 16 register VFPv4 FPU, while the A15 features a 32 register implementation. The point is, if your architecture supports VFPv4 then it isn't a Cortex A8 or A9.
It's pretty easy to dismiss the A5 and A7 as neither of those architectures is significantly faster than the Cortex A9 used in Apple's A5. The obvious conclusion then is Apple implemented a pair of A15s in its A6 SoC.
For unpublishable reasons, I knew the A6 SoC wasn't based on ARM's Cortex A9, but I immediately assumed that the only other option was the Cortex A15. I foolishly cast aside the other major possibility: an Apple developed ARMv7 processor core.
Balancing Battery Life and Performance
There are two types of ARM licensees: those who license a specific processor core (e.g. Cortex A8, A9, A15), and those who license an ARM instruction set architecture for custom implementation (e.g. ARMv7 ISA). For a long time it's been known that Apple has both types of licenses. Qualcomm is in a similar situation; it licenses individual ARM cores for use in some SoCs (e.g. the MSM8x25/Snapdragon S4 Play uses ARM Cortex A5s) as well as licenses the ARM instruction set for use by its own processors (e.g. Scorpion/Krait implement in the ARMv7 ISA).
For a while now I'd heard that Apple was working on its own ARM based CPU core, but last I heard Apple was having issues making it work. I assumed that it was too early for Apple's own design to be ready. It turns out that it's not. Based on a lot of digging over the past couple of days, and conversations with the right people, I've confirmed that Apple's A6 SoC is based on Apple's own ARM based CPU core and not the Cortex A15.
Implementing VFPv4 tells us that this isn't simply another Cortex A9 design targeted at higher clocks. If I had to guess, I would assume Apple did something similar to Qualcomm this generation: go wider without going substantially deeper. Remember Qualcomm moved from a dual-issue mostly in-order architecture to a three-wide out-of-order machine with Krait. ARM went from two-wide OoO to three-wide OoO but in the process also heavily pursued clock speed by dramatically increasing the depth of the machine.
The deeper machine plus much wider front end and execution engines drives both power and performance up. Rumor has it that the original design goal for ARM's Cortex A15 was servers, and it's only through big.LITTLE (or other clever techniques) that the A15 would be suitable for smartphones. Given Apple's intense focus on power consumption, skipping the A15 would make sense but performance still had to improve.
Why not just run the Cortex A9 cores from Apple's A5 at higher frequencies? It's tempting, after all that's what many others have done in the space, but sub-optimal from a design perspective. As we learned during the Pentium 4 days, simply relying on frequency scaling to deliver generational performance improvements results in reduced power efficiency over the long run.
To push frequency you have to push voltage, which has an exponential impact on power consumption. Running your cores as close as possible to their minimum voltage is ideal for battery life. The right approach to scaling CPU performance is a combination of increasing architectural efficiency (instructions executed per clock goes up), multithreading and conservative frequency scaling. Remember that in 2005 Intel hit 3.73GHz with the Pentium Extreme Edition. Seven years later Intel's fastest client CPU only runs at 3.5GHz (3.9GHz with turbo) but has four times the cores and up to 3x the single threaded performance. Architecture, not just frequency, must improve over time.
At its keynote, Apple promised longer battery life and 2x better CPU performance. It's clear that the A6 moved to 32nm but it's impossible to extract 2x better performance from the same CPU architecture while improving battery life over only a single process node shrink.
Despite all of this, had it not been for some external confirmation, I would've probably settled on a pair of higher clocked A9s as the likely option for the A6. In fact, higher clocked A9s was what we originally claimed would be in the iPhone 5 in our NFC post.
I should probably give Apple's CPU team more credit in the future.
The bad news is I have no details on the design of Apple's custom core. Despite Apple's willingness to spend on die area, I believe an A15/Krait class CPU core is a likely target. Slightly wider front end, more execution resources, more flexible OoO execution engine, deeper buffers, bigger windows, etc... Support for VFPv4 guarantees a bigger core size than the Cortex A9, it only makes sense that Apple would push the envelope everywhere else as well. I'm particularly interested in frequency targets and whether there's any clever dynamic clock work happening. Someone needs to run Geekbench on an iPhone 5 pronto.
I also have no indication how many cores there are. I am assuming two but Apple was careful not to report core count (as it has in the past). We'll get more details as we get our hands on devices in a week. I'm really interested to see what happens once Chipworks and UBM go to town on the A6.
163 Comments
View All Comments
Death666Angel - Sunday, September 16, 2012 - link
That seems to be true. But I don't see how that influences his conclusion that Cortex A7 is not enough of a step up in performance.From where I'm standing, A15 is still a possibility, but highly unlikely given the time frame we are talking about. And I have no reason not to trust Anand's word that it isn't A15.
Honestly, I don't care that much either way right now. I'm not going to buy an iPhone and I will be eating up any hard facts about the SoC regardless of whether its a new design of Apple's take on an A15. It's interesting in either case. :D
MrSpadge - Sunday, September 16, 2012 - link
Not sure it has been said before.. but overall power consumption doesn't scale exponentially with voltage. Some leakage may scale exponentially, but that's only a small part. Basic dynamic switching consumption scales with V^2, overall it's about V^3 for an entire modern chip - if I remember something I read during the last 1 or 2 years correctly. I think it was on Anandtech ;)>I should probably give Apple's CPU team more credit in the future.
That's only fair - however, if they don't tell anyone about the new design, they can't expect to be given credit.
KPOM - Sunday, September 16, 2012 - link
It's a dual-core 1.0GHz model. It scored 1601 vs. 629 for the iPhone 4S and 1560 for the S3.http://forums.macrumors.com/showthread.php?t=14432...
It's pretty impressive.
shank2001 - Sunday, September 16, 2012 - link
And considering that geekbench utilizes all cores, and the S3 is a quad core device, the real world single threaded performance gap will be MASSIVE!Hardly anything running on a phone whether iOS or android is multithreaded.
akdj - Sunday, September 16, 2012 - link
Exactly! Half the clock speed...half the RAM, and half the cores, yet just as 'quick' with what I'm sure will be a significant savings in 'power' accomplishing the same efficiency, speed and pure--end user enjoymentThese speeds from Geekbench are very impressive. Comparing to the actual dual core SIII sold in the US, there's an even larger 'gap' in performance as the international version is the only one sold with 4 cores. This is an engineering marvel and a big surprise. Apple is now ready and executing their own chip architecture. Married with designing their own OS, even bigger gains in perceived performance will be noticable. I'm a developer on both platforms and own the 3GS, 4 and 4S...as well as the Galaxy Note, Nexus and GSIII. My daily drivers are the 4s and GNote (the latter strictly for business apps and functionality). Regardless of the apps I use that are ambidextrous (both Android and iOS), the 4s, to this day, are more fluid them their counter part 'Droid app. I'm blown away by the processing power in these mobile devices...but even more excited by what Apple continues to achieve using less power, less speed, and less memory.
My hope is Google will follow Apple's lead wrestling control of their OS back from the manufacturers and providers...not allowing the 'bloat-ware' and skins that are dragging out the update/upgrade process and bogging down system performance. ICS and Jelly Bean are excellent updates but the fluidity and efficiency of iOS and the SDK from Apple make development a much easier endeavor. As well, knowing when and what you're users/customers are using as far as the OS build is invaluable.
This is a phenomenal engineering improvement. A huge pat on the back to the chefs (engineers) at Apple. Thinner, lighter, faster...better screen and color gamut, twice the speed (of even the 'new' iPad), screaming graphics AND as good or better battery life? Holy. Shit! Accolades to Apple. That's true innovation.
Jeremy
name99 - Monday, September 17, 2012 - link
You don't have to guess, you can compare the single vs multi-core benchmarks line by line. They're a mixed bag --- A6 is usually ahead, but not always.BUT A6 is running at 1 GHz vs 1.4GHz for SGS III, which probably means A6 is burning half the power or so. It also suggests that if Apple ever get desperate enough that they need a quick speed bump to compete in the market, they can do so, boosting either the speed or adding an additional core or two, whereas Samsung has less of this sort of flexibility.
Finally Apple, just like with iPad3, are obsessed with memory bandwidth --- this device has about 1.5 to 2x what SGS III offers, depending on the benchmark. I wonder how much of this reflects company memory of the past. Back in the PPC days, this was our biggest flaw compared to Intel, the thing that made me, as an Apple SW engineer focussed on optimization and performance, weep in frustration --- we had better CPU cores, as long as you stayed in L1, cache, but our memory systems were so crippled that the core never had a chance to shine.
We seem to be living through the mirror image of those days --- now the Apple HW engineers aren't going to make that mistake again, while the competition hope that synthetic (ie core-only) benchmarks will hide their memory system real-world limitations.
RogerShepherd - Sunday, September 16, 2012 - link
If you look at the breakdown of the Geekbench scores, it's in line with a Cortex A15 being used http://bloggershepherd.blogspot.co.uk/2012/09/more...Death666Angel - Sunday, September 16, 2012 - link
Interesting. Also, disgusting forum posts over there. Yikes.tipoo - Sunday, September 16, 2012 - link
Geekbench also got the frequency, 1GHz. Even more impressive that it's faster than Krait at 1.5GHz. But then, Krait is what, a a year or so old.Didn't think I'd feel it, but the mobile SoC wars have gotten more interesting than the desktop CPU ones. At least there's leapfrogging going on here.
Brayan Schroeder - Sunday, September 16, 2012 - link
Ok, but the Apple A6 is better than ARM Cortex A15? Thanks.