Positron bets on energy-efficient AI chips to challenge Nvidia's dominance

Highly anticipated: A new front is emerging in the race to power the next generation of artificial intelligence, and at the center of it is a startup called Positron whose bold ambitions are gaining traction in the semiconductor industry. As companies scramble to rein in the soaring energy demands of AI systems, Positron and a handful of challengers are betting that radically different chip architectures could loosen the grip of industry giants like Nvidia and reshape the AI hardware landscape.

Founded in 2023, Positron has rapidly attracted investment and attention from major cloud providers. The startup recently raised $51.6 million in new funding, bringing its total to $75 million. Its value proposition is straightforward: delivering AI inference – the process of generating responses from trained models – far more efficiently and at a significantly lower cost than current hardware.

Positron's chips are already being tested by customers such as Cloudflare. Andrew Wee, Cloudflare's head of hardware, has expressed concern over the unsustainable energy demands of AI data centers. "We need to find technical solutions, policy solutions, and other solutions that solve this collectively," he told The Wall Street Journal.

According to CEO Mitesh Agrawal, Positron's next-generation hardware will directly compete with Nvidia's upcoming Vera Rubin platform. Agrawal claims that Positron's chips can deliver two to three times better performance per dollar – and up to six times greater energy efficiency – compared to Nvidia's projected offerings.

Rather than trying to match Nvidia across every workload, Positron is focusing on a narrower but critical slice of AI inference, optimizing for speed and power savings. Its latest chip design embraces this focus by simplifying functions to handle the most demanding inference tasks with maximum efficiency.

Positron isn't alone in the race for energy-efficient AI chips. Competitors like Groq and Mythic are also pursuing alternative architectures.

Groq, for example, integrates memory directly into its processors, an approach it says enables faster inference while consuming a fraction of the power required by Nvidia's GPUs. The company claims its chips operate at one-third to one-sixth of Nvidia's energy costs. Meanwhile, tech giants such as Google, Amazon, and Microsoft are investing billions in developing their own proprietary inference hardware for internal deployment and cloud customers.

Nvidia, for its part, has acknowledged both the market's appetite for alternatives and the growing concerns around energy use. According to senior director Dion Harris, the company's new Blackwell chips deliver up to 30 times the inference efficiency of previous models.

Those claims are now being tested, as cloud providers seek practical solutions that can scale. Cloudflare has launched long-term trials of Positron's chips, with Wee noting that only one other startup has ever warranted such in-depth evaluation. Pressed on the stakes, he said that if the chips "deliver the advertised metrics, we will open the spigot and allow them to deploy in much larger numbers globally."

As competition heats up, analysts caution that improved chip efficiency alone won't be enough to counter the explosive growth in AI workloads. Historically, gains in hardware performance are quickly consumed by new use cases and increasingly powerful models.

... continue reading