At a glance: Apple-Google Gemini deal for Siri. Announced January 12, 2026. $1 billion/year. Custom 1.2T parameter model. 2 billion devices. DOJ scrutiny. Phase 1: iOS 26.4 (spring 2026). Phase 2: iOS 27 / iPhone 18 (September 2026). Part of our AI Models & Companies reviews.


The Headline

On January 12, 2026, Apple and Google published a joint statement announcing a multi-year partnership: Google Gemini will power the next generation of Apple Foundation Models, a rebuilt Siri, and future Apple Intelligence features across the entire iOS/macOS ecosystem.

Apple is reportedly paying $1 billion per year for access to a custom-built Gemini model with approximately 1.2 trillion parameters — eight times larger than Apple’s existing cloud inference models and hundreds of times larger than its on-device models. That model will run on Apple’s Private Cloud Compute infrastructure, meaning Gemini handles the intelligence layer while Apple retains control of the data pipeline.

The deal does not replace OpenAI’s existing arrangement — Apple’s “Ask ChatGPT” integration in iOS 18/19 remains in place. But the Google partnership is structurally different. This isn’t a user-facing option you choose from a menu. Gemini becomes the backbone of Siri itself.


What Apple Is Admitting

The most important sentence in the joint statement is one neither company wrote explicitly: Apple cannot build a competitive frontier model.

Apple is the world’s most valuable company. It employs thousands of ML researchers. It has custom silicon — the Neural Engine on every A-series and M-series chip — optimized for inference. It has Private Cloud Compute, one of the most sophisticated privacy-preserving inference architectures in the industry. It has two billion active devices generating more user interaction data than nearly any company on earth.

And it still couldn’t close the gap.

The story of how that happened matters. Apple’s AI strategy through 2023-2024 was built around on-device execution: small, efficient models that ran locally, preserved privacy, and worked offline. That strategy was coherent and defensible — until the ChatGPT moment made it clear that users wanted intelligence that required frontier-scale compute, not just efficiency. Apple’s ~3 billion parameter on-device models and ~150 billion parameter cloud models simply didn’t perform at the level users had come to expect from GPT-4, Gemini Ultra, and Claude.

By January 2026, the decision was apparently made: license what you can’t build, build what you can. Apple keeps the device layer, the privacy infrastructure, the user interface, and the trust relationship. Google provides the model.


The Scale of the Distribution

To understand why this deal matters to Google, consider the distribution math.

Google has approximately 400 million monthly active Gemini users across its own products. Apple has approximately two billion active devices — iPhones, iPads, Macs, Apple Watches, HomePods. Every one of those devices, under iOS 27 and macOS 27, will route Siri’s intelligence through a Gemini-backed backend.

That’s the largest single distribution pipeline for any AI model in existence. By comparison, Google’s own deployment of Gemini through Search, Workspace, and the Gemini app looks modest. The Apple deal effectively puts Gemini in front of every Apple user whether they’ve heard of it or not.

From Google’s position, this is worth far more than $1 billion a year in revenue. It’s market presence that no amount of direct-to-consumer marketing could buy.


Why Google, Not Anthropic or OpenAI

The question that circulated when the deal broke: why not Anthropic, whose models were leading most benchmarks in late 2025? Or why not double down on OpenAI, given Apple’s existing ChatGPT integration?

The answer appears to be infrastructure. Apple’s Private Cloud Compute requires a specific integration architecture — models can’t just be called via API from a third-party cloud. They need to run within Apple’s controlled inference environment. Google had both the model capability and the enterprise infrastructure flexibility to build a custom deployment pathway. Google Cloud’s technical reach made the integration tractable in a way that smaller lab deployments weren’t.

There’s also a prior relationship: Google has paid Apple approximately $20 billion per year to remain the default search engine on iOS. That relationship comes with deep technical integration work between the two companies. Adding AI to an existing partnership is operationally simpler than building a new one from scratch.


The Antitrust Dimension

The deal landed in a legally complicated moment.

Google was found liable in August 2024 for monopolizing the general search market. The DOJ’s remedies hearing ran through early 2025, with proposals including forcing Google to divest Chrome and end payments to device makers for default placement. The Apple default search payments — that $20 billion/year arrangement — were exhibit A in the government’s case.

Now there’s a second billion-dollar payment from Apple to Google for a second category of default placement: AI.

Bloomberg Law’s legal analysis called it “the next Microsoft antitrust case.” The argument: Google now occupies both the search bar and the AI assistant on every iPhone, making it structurally impossible for rival AI companies to reach iPhone users at the default layer. An independent AI startup can build a better product than Gemini, but it will never be the thing that answers when someone says “Hey Siri.”

The DOJ has not yet opened a formal investigation into the AI deal specifically, but regulatory observers expect it to surface in the ongoing Google remedies proceedings. EU regulators under the Digital Markets Act are also reviewing whether the dual default arrangement creates a foreclosure effect in the AI assistant market.

Neither Apple nor Google has confirmed that the deal is under regulatory scrutiny.


Rollout: What Ships and When

The integration is rolling out in two phases.

Phase 1 — iOS 26.4 (spring 2026): Siri begins using Gemini for complex multi-step queries, contextual awareness, and tasks that require world knowledge beyond what on-device models can handle. Users in this phase may notice that Siri’s responses to complicated questions feel more capable. The Gemini backend is not explicitly branded in the Siri interface.

Phase 2 — iOS 27 / iPhone 18 (September 2026): “Full Conversational Siri” launches. Multi-turn dialogue. Complex task completion. Cross-app reasoning. The rebuilt Siri — codenamed “Campos” internally, according to Bloomberg — represents the most significant redesign of the assistant since its 2011 introduction. WWDC 2026 (June 8) is expected to be the unveiling event.

Throughout both phases, Apple maintains that data processing occurs within Private Cloud Compute infrastructure, not on Google’s servers. The Gemini model runs in Apple’s environment, not Google’s. Apple has emphasized this distinction heavily in its privacy communications, though the specifics of the data sharing and processing arrangement haven’t been published in full.


What This Means for the AI Industry

The Apple-Google deal has structural implications beyond the two companies involved.

For Anthropic and OpenAI: The most coveted distribution channel in consumer AI — the default assistant on two billion devices — is now locked to a competitor for the foreseeable future. Neither company was selected. Claude won’t answer by default when an iPhone user asks a question. Neither will ChatGPT (except when explicitly invoked via “Ask ChatGPT”). The channel is closed.

For independent AI assistants: Pi, Perplexity, You.com, and a dozen other AI assistant startups now compete for users who have to actively choose to install a third-party app rather than just using the built-in option. Default placement is, historically, decisive — Google’s dominance in search was built almost entirely on it.

For device makers: Samsung, whose Bixby partnership ecosystem includes both Google and Anthropic integrations, now faces a competitor whose AI layer is far more unified. The question of whether Android device makers will structure similar agreements with frontier labs — rather than relying on Google’s default Android AI — becomes more urgent.

For regulators: The deal crystallizes the argument that AI distribution is becoming as concentrated as search distribution was in 2010. If that comparison holds, the regulatory response will take a decade to arrive and may or may not change the competitive landscape.


The Privacy Question

Apple’s privacy framing deserves scrutiny.

The company’s position is that Gemini runs on Apple’s Private Cloud Compute servers, not Google’s. Apple controls the hardware. Apple controls the network path. Gemini is, in this framing, a set of model weights that Apple is licensed to run — not a cloud service Apple is calling into.

That distinction matters legally and technically. It means Google doesn’t receive raw user queries in the traditional sense. But it also doesn’t fully resolve the question of what Google learns from training updates, model telemetry, or any data that flows back as part of the ongoing technical relationship between the two companies.

Apple hasn’t published the full technical architecture of the arrangement. The privacy story is plausible, but it’s also told entirely by the party selling privacy as a product feature.


Verdict

The Apple-Google Gemini deal is one of the most consequential AI business agreements of 2026, not because of its size — $1 billion a year is a rounding error for both companies — but because of what it signals.

Apple, the company that redefined consumer computing four times across three product categories, decided that building a frontier AI model was not a problem it could solve in the timeframe that mattered. The competitive advantage of being on-device, private, and efficient turned out not to be enough when users wanted intelligence at a scale that on-device compute couldn’t deliver.

Google gets two billion distribution touchpoints. Apple gets a Siri that might finally be worth talking to. The DOJ gets new material for its remedies brief. And the AI industry learns that distribution still beats capability, if you’re patient enough to wait for someone with the right distribution to need what you’ve built.


This article covers the January 12, 2026 Apple-Google Gemini announcement and the subsequent rollout. For WWDC 2026 details, see our WWDC 2026 preview.