Skip to content
Tech News
← Back to articles

Report details Apple’s plan to use Nvidia chips for the Gemini-powered Siri

read original get Nvidia GeForce RTX 4090 → more articles
Why This Matters

Apple's integration of Nvidia's Blackwell B200 chips for Siri's AI processing highlights a strategic move towards enhanced privacy and high-performance AI capabilities. By leveraging Nvidia's advanced hardware and confidential compute technology via Google Cloud, Apple aims to deliver more secure and efficient AI services to consumers, signaling a focus on privacy-preserving AI infrastructure in the industry.

Key Takeaways

A new report from The Information shares fresh details on how the new Gemini-powered Siri will work under the hood. Here are the details.

Apple doubles down on privacy assurances amid AI partnerships

A few days ago, The Information reported on some of the technical aspects behind Apple’s AI plans for WWDC, including how the company is expected to use Nvidia chips through Google Cloud for some Siri queries.

From the original report:

[…] as part of an Apple agreement with Google, some user queries to a new version of Siri will run in Google Cloud on a licensed version of the search giant’s Gemini model. Apple recently approved the use of a privacy technology from Nvidia in that setting, suggesting it will use Nvidia AI chips for at least some of its computing needs in Google Cloud, according to people familiar with the matter.

Now, The Information has published a new report, with fresh details on how Apple’s use of Nvidia chips is going to work:

Specifically, Apple will tap into Google’s fleet of Nvidia’s Blackwell B200 data center chips, said the people. Apple will enable Nvidia’s confidential compute feature that encrypts data as it’s being processed on the chips.

The Nvidia Blackwell B200 is one of Nvidia’s flagship data center GPUs for large-scale AI training and inference. It is based on Nvidia’s Blackwell architecture, which is the successor to Hopper.

Nvidia positions Blackwell as a platform for running and training very large AI models (including trillion-parameter models), with major improvements in inference, memory bandwidth, and multi-GPU scaling compared with the Hopper architecture.

As for the confidential compute feature, it is a hardware-based security system that protects data while it is actively being processed by Nvidia GPUs.

... continue reading