Apple’s AI Strategy: Apple Datacenters, On-device, Cloud, And More
When to run on-device, in Apple datacenters, or in the cloud with OpenAI, Deal economics
Nvidia continues to ramp their production to service the world’s insatiable demand for GPUs, and yet, our Accelerator Model’s extensive checks show Apple’s purchases of GPUs are quite miniscule. In fact, they aren’t even a top 10 customer. Furthermore, while all eyes are on WWDC, Apple’s only announcing AI there, not shipping. The question on everyone’s mind is… what the heck is Apple doing in AI?
Mark Gurman laid out the features Apple is announcing at WWDC. Furthermore there’s a variety of rumors floating around from others, so let’s get to the bottom of what’s really happening, how, and what Apple can do.
The first thing is that multiple sources have reported that Apple is ramping up production of its M-series processors this year to record volumes. This is primarily Apple’s M2 Ultra SKUs which is 2 M2 Max SoCs stitched together with what Apple calls “UltraFusion.” Note that Apple’s M3 Ultra was cancelled.
Ultrafusion is Apple’s marketing name for using a local silicon interconnect (bridge die) to connect the two M2 Max chips in a package. The two chips are exposed as a single chip to many layers of software. M2 Ultra utilizes TSMC’s InFO-LSI packaging technology. This is a similar concept as TSMC’s CoWoS-L that is being adopted by Nvidia’s Blackwell and future accelerators down the road to make large chips. The only major difference between Apple and Nvidia’s approaches are that InFO is chip-first vs CoWoS-L is chip-last process flow, and that they are using different types of memory.
What’s curious about Apple’s production increases is that there is nothing on the demand front to support a sudden increase in M2 Ultra shipments. M2 Ultra is only used in high-end Mac Studio and Mac Pro products. There has been no meaningful refresh of these products in a year, and there are no plans for one any time soon. Furthermore, no new products are shipping with it either.
High end desktop PCs and Macs remains quite sluggish compared to the peak covid demand, yet 2024 production of the chips that would purportedly power these high end Macs is going to be significantly higher than the last few years even though there is nothing to suggest there will be consumer demand to soak up all these units.
How do we reconcile this?
Apple’s Own AI Servers
This additional production of the M2 Ultras is consistent with recent reporting out from the WSJ and Bloomberg about Apple using their own silicon in their own Datacenters for serving AI to Apple users.
Furthermore, Apple has extensive expansion plans for their own datacenter infrastructure. We are tracking 7 different datacenter sites with over 30 buildings for Apple as well as their planned buildout. Their total capacity is doubling in a relatively short period of time.
Above is Apple’s soon to be largest datacenter site. Currently they only have 1 datacenter there, but many will be coming up next year. Our datacenter model has more details on the upcoming Apple datacenters.
Apple’s Infrastructure Team
The other indication that Cupertino is serious about their AI hardware and infrastructure strategy is they made a number of major hires a few months ago. This includes Sumit Gupta who joined to lead cloud infrastructure at Apple in March. He’s an impressive hire. He was at Nvidia from 2007 to 2015, and involved in the beginning of Nvidia's foray into accelerated computing. After working on AI at IBM, he then joined Google’s AI infrastructure team in 2021 and eventually was the product manager for all Google infrastructure including the Google TPU and Arm based datacenter CPUs.
He’s been heavily involved in AI hardware at Nvidia and Google who are both the best in the business and are the only companies that are deploying AI infrastructure at scale today. This is the perfect hire.
Given this context, let’s look at what Apple is doing with their current and future in-house chips and external chips. We will look into Apple’s grudge with Nvidia. We will also discuss what Apple can run on-device, in cloud, and when they have to go to external service provider based AI. The economics around the deal differ from the $20B Google search deal, but not in the way you’d think. We will also discuss how Apple can offer this to customers and grow revenue.