CPU MT Performance: A Real Monster

What’s more interesting than ST performance, is MT performance. With 8 performance cores and 2 efficiency cores, this is now the largest iteration of Apple Silicon we’ve seen.

As a prelude into the scores, I wanted to remark some things on the previous smaller M1 chip. The 4+4 setup on the M1 actually resulted that a significant chunk of the MT performance being enabled by the E-cores, with the SPECint score in particular seeing a +33% performance boost versus just the 4 P-cores of the system. Because the new M1 Pro and Max have 2 less E-cores, just assuming linear scaling, the theoretical peak of the M1 Pro/Max should be +62% over the M1. Of course, the new chips should behave better than linear, due to the better memory subsystem.

In the detailed scores I’m showcasing the full 8+2 scores of the new chips, and later we’ll talk about the 8 P scores in context. I hadn’t run the MT scores of the new Fortran compiler set on the M1 and some numbers will be missing from the charts because of that reason.

SPECint2017 Rate-N Estimated Scores

Looking at the data – there’s very evident changes to Apple’s performance positioning with the new 10-core CPU. Although, yes, Apple does have 2 additional cores versus the 8-core 11980HK or the 5980HS, the performance advantages of Apple’s silicon is far ahead of either competitor in most workloads. Again, to reiterate, we’re comparing the M1 Max against Intel’s best of the best, and also nearly AMD’s best (The 5980HX has a 45W TDP).

The one workload standing out to me the most was 502.gcc_r, where the M1 Max nearly doubles the M1 score, and lands in +69% ahead of the 11980HK. We’re seeing similar mind-boggling performance deltas in other workloads, memory bound tests such as mcf and omnetpp are evidently in Apple’s forte. A few of the workloads, mostly more core-bound or L2 resident, have less advantages, or sometimes even fall behind AMD’s CPUs.

SPECfp2017 Rate-N Estimated Scores

The fp2017 suite has more workloads that are more memory-bound, and it’s here where the M1 Max is absolutely absurd. The workloads that put the most memory pressure and stress the DRAM the most, such as 503.bwaves, 519.lbm, 549.fotonik3d and 554.roms, have all multiple factors of performance advantages compared to the best Intel and AMD have to offer.

The performance differences here are just insane, and really showcase just how far ahead Apple’s memory subsystem is in its ability to allow the CPUs to scale to such degree in memory-bound workloads.

Even workloads which are more execution bound, such as 511.porvray or 538.imagick, are – albeit not as dramatically, still very much clearly in favour of the M1 Max, achieving significantly better performance at drastically lower power.

We noted how the M1 Max CPUs are not able to fully take advantage of the DRAM bandwidth of the chip, and as of writing we didn’t measure the M1 Pro, but imagine that design not to score much lower than the M1 Max here. We can’t help but ask ourselves how much better the CPUs would score if the cluster and fabric would allow them to fully utilise the memory.

SPEC2017 Rate-N Estimated Total

In the aggregate scores – there’s two sides. On the SPECint work suite, the M1 Max lies +37% ahead of the best competition, it’s a very clear win here and given the power levels and TDPs, the performance per watt advantages is clear. The M1 Max is also able to outperform desktop chips such as the 11900K, or AMD’s 5800X.

In the SPECfp suite, the M1 Max is in its own category of silicon with no comparison in the market. It completely demolishes any laptop contender, showcasing 2.2x performance of the second-best laptop chip. The M1 Max even manages to outperform the 16-core 5950X – a chip whose package power is at 142W, with rest of system even quite above that. It’s an absolutely absurd comparison and a situation we haven’t seen the likes of.

We also ran the chip with just the 8 performance cores active, as expected, the scores are a little lower at -7-9%, the 2 E-cores here represent a much smaller percentage of the total MT performance than on the M1.

Apple’s stark advantage in specific workloads here do make us ask the question how this translates into application and use-cases. We’ve never seen such a design before, so it’s not exactly clear where things would land, but I think Apple has been rather clear that their focus with these designs is catering to the content creation crowd, the power users who use the large productivity applications, be it in video editing, audio mastering, or code compiling. These are all areas where the microarchitectural characteristics of the M1 Pro/Max would shine and are likely vastly outperform any other system out there.

CPU ST Performance: Not Much Change from M1 GPU Performance: 2-4x For Productivity, Mixed Gaming
Comments Locked

493 Comments

View All Comments

  • sthambi - Wednesday, November 3, 2021 - link

    Hi Anand, I stumbled across your blog post, and I enjoyed reading it. I'm a professional video editor, photographer. Ordered the 32 core, 64GB, M1 Pro Max for $3900. I'm upgrading from the iMac 5k, late 2015 model. I personally feel like am overkilling my configuration. I don't want to look back 2 years from now, and feel like I lost 4k, and now apple doubled again. Do you think I really need this much heavy configuration to use premiere pro cc, max 5k video editing, and canon raw images, and simultaneous creative cloud application running? what would you recommend, which can help me save money and not compromise on the performance? is my decision of going full configuration bad?
  • MykeM - Sunday, November 14, 2021 - link

    Read the byline (the names under the headlines). The site’s namesake- Anand- left a few years ago. He no longer writes here. The people replacing him are every bit as capable but none of them are actually named Anand.
  • razer555 - Thursday, November 4, 2021 - link

    https://www.youtube.com/watch?v=OMgCsvcMIaQ
    https://www.youtube.com/watch?v=mN8ve8Hp4I4

    Anandtech, your tests about the graphic seems wrong.
  • Sheepshot - Sunday, November 7, 2021 - link

    Anand tech = Apple shills.

    M1 beats both the M1Pro and max in power efficiency. Draws 50% of watt butt provides almost 65-70% of the performance in most relevant benches.
  • Hrunga_Zmuda - Sunday, November 7, 2021 - link

    Shills?

    The 90s called and want your insult back.
  • evernessince - Wednesday, November 10, 2021 - link

    HWUB just did a review of the M1 pro in actual applications and performance is good but not nearly as impressive as Anand suggests. These chips are competitive with laptop chips but you certainly don't need to bust out server class components as suggested in the article. Performance is very good in certain areas and in others it's very poor. Most of the time it's about as good as X86 laptop chips. GPU is decent but given the price, you can get much much more performance on X86 at a much lower price.
  • Motti.shneor - Sunday, November 14, 2021 - link

    I think I heard M1 Max and M1 Pro have different number of CPU cores? Here you say they're identical?

    Also, I keep asking myself why a tech visionary like yourself doesn't see the "big picture" and the bold transitional step in computing taken here.

    For me, that sheer "horse power" means very little - I'm using a 1st generation M1 Mac-Mini, with medium configuration, beside an i9 MacBookPro from 2020 - and the Mini is SO MUCH BETTER in each and every way and meaning (except of course for the terrible bugs and deteriorating quality and bad behavior at boot time, the EFI and such)

    As a power user, and Mac/iOS software engineer/tech-lead for over 35 years, and with my pack of 400 applications installed, some native, some emulated, and my 0.5TB library of photos and 0.5TB library of music.... well, with all this, I can testify that MacMini "feels" 5 times faster than the MBP, in most everything I DO. Maybe it'll fail on benchmarks, but I couldn't care less. Rebuild a project? snap. Export a video while rescaling it? Immediate! heavy image conversion? no time. Launch a heavy app? before you know it. It FEELS very very fast, and that's 1st generation.

    What I think IS IMPRTANT and not being said by anyone, is that the whole mode of computing goes back from "general purpose" into "specialized hardware". You can no longer appreciate a computer by its linear-programming CPU cycles, and if you do - you just get a completely wrong evaluation.

    Moreover - you CANT just "port some general C code from somewhere" and expect it to run fast. You MUST be using system APIs at SOME LEVEL, that will dispatch your work onto specialized hardware, so you gain from all those monstrous engines under the hood. If you will just compile some neural-networks engine or drag it over in python or something, it'll crawl and it will suck. But if you use Vision Framework from Apple, you'll have jaw-dropping performance. You MUST build software FOR the M1, to have the software shine. This is a paradigm shift, that contradicts everything we've seen in the last 40 years (moving from custom hardware into general-purpose computing devices).

    If history is to repeat itself like so many times in the past - soon enough all the competitors in the Computing arena will be forced into similar changes, so not to lose market share - and we'll have a very strange market, much harder to compare - because the Apple guys will always bring in Apple software highly optimized to use the hardware, and the "other" guys will pull their "specialized" software for their special processors... I

    I am quite thrilled, and I really want to have one of them M1 Max machines, just to feel them a little.

    Despite the long threads underneath, I think Gaming is not even secondary in the list of important aspects - And I also predict that Game makers will skip the Mac in the future just like today. It's not because they don't like it, but because of tradition, and because of the high priced entry point of Powerful Mac computers. Still - Corporate-America is buying MBPs like mad, and they'll keep doing that in the coming 3-5 years.
  • stevenLu - Tuesday, December 7, 2021 - link

    I am a fan of Apple technology. And I am only glad to read news about their development and some new technologies. Another new technology is used by Lucid Reality Labs. You can read more on their website https://lucidrealitylabs.com/blog/5-vr-headsets-ma...
  • Cloakstar - Tuesday, December 21, 2021 - link

    One reason this M1 Max performs so well is that even though the CPU is in control, the memory hierarchy is more GPU+CPU than the typical CPU+GPU, so the typical APU memory bottleneck is gone. :D AMD APUs, for example, are highly memory bound, doubling in performance when you go form 1 stick of RAM to 4 sticks with bank+channel interleaving.
  • wr3@k0n - Friday, December 24, 2021 - link

    For a $3499 no shit it's competing with server grade, if it comes with that price. Though PC still ends up being cheaper and infinitely more repairable and upgradeable. This article doesn't address many of the drawbacks of the Apple ecosystem and it will take more than "close to PC" performance.

Log in

Don't have an account? Sign up now