Skip to content
Tech News
← Back to articles

Tech titans team up to form optical interconnect alliance to solve the AI buildout's big data bottleneck — Nvidia, AMD, Broadcom & more set sights on building PHY to break through the limitations of copper

read original get Optical Interconnect Kit → more articles
Why This Matters

The formation of the optical interconnect alliance marks a significant step toward overcoming the limitations of copper-based data transfer in AI data centers. By standardizing optical interconnects, industry leaders aim to boost data transfer speeds, reduce power consumption, and address supply chain constraints, enabling more scalable and efficient AI infrastructure. This shift could accelerate AI development and deployment, benefiting both industry and consumers through faster, more reliable AI services.

Key Takeaways

This week, AMD, Broadcom, Nvidia, OpenAI, Meta, and Microsoft announced plans to standardize a protocol-agnostic, scale-up interconnect for AI data centers. The Optical Compute Interconnect Multi-Source Agreement (OCI MSA) group is tasked with defining an open connectivity specification for optical interconnections in AI data centers. This would allow for higher domain scale-up sizes, and enable a multi-vendor supply chain for optical interconnects, which the ongoing AI infrastructure buildout demands.

The group's primary goal is to enable data centers to scale by using optical interconnections rather than relying solely on copper, which is currently hitting its physical limits for optimal data transfer speeds and power consumption. Copper is also facing significant supply chain constraints, and an industry-wide shift to optical interconnections would alleviate some of this demand. An optical interconnection would also bolster data transfer speeds, crucial for large-scale AI workloads.

For copper, pushing electrical signals to high speeds results in signal degradation and unsustainable levels of power consumption; the solution is optical interconnects. Copper is inherently a lossy, resistant medium, necessitating huge amounts of power to send data over distances at high speeds. An optical physical layer (PHY) can overcome this electrical resistance challenge, allowing for higher-speed data transmission. The goal of the newly-founded group is to develop a PHY capable of delivering up to 3.2Tb/s and beyond.

Article continues below

Given that power is an ongoing challenge in AI data center buildouts, a solution that not only stabilizes power usage but also increases interconnection speeds seems like an obvious choice.

Optical cables would let more systems be concurrently connected over greater distances, without many of the penalties that come with copper, improving scale-up domains. However, optics also comes with its own downsides: Failure rates, increased heat output, higher overall costs, and overall failure rates. With the technology nascent in its application, new standards must be created in order for it to mature.

Going platform-agnostic

(Image credit: Intel)

"Optical cables and the silicon photonics technology already exist when it comes to connecting different switches as part of a pluggable transfer ecosystem.” said Vivek Raghunathan, CEO and co-founder at Xscape Photonics, in a recent interview with Tom's Hardware Premium. Indeed, TSMC's COUPE technology is a foundational bedrock for enabling optical and photonics in chips; where the new OCI MSA standard comes in is to enable these physical chips to effectively travel across the same lanes.

The open standard developed by the OCI MSA would allow multiple vendors within the optical supply chain to offer components to a singlular, unified spec. In theory, it should drive down the cost of optics and optical interconnects at scale. Moreover, it divests the sole reliance on TSMC, as the standard would ensure interoperability between products and chips made with COUPE, and those using alternative CPO packaging platforms, effectively de-risking optical supply chains in the process.

... continue reading