Skip to content
Tech News
← Back to articles

ChatGPT leak reveals new Bidi 1 voice model that can listen and speak simultaneously

read original more articles
Why This Matters

OpenAI's leaked Bidi 1 voice model introduces a groundbreaking bidirectional AI capable of listening, speaking, and responding simultaneously, significantly enhancing conversational fluidity. Early tests suggest it will soon be integrated into ChatGPT, transforming user interactions with more natural and dynamic voice capabilities. This development signals a major leap forward in AI communication, with potential impacts across consumer applications and the broader tech industry.

Key Takeaways

Edgar Cervantes / Android Authority

TL;DR OpenAI is reportedly testing an unannounced bidirectional voice model called “GPT-Bidi-1.”

Code references and early user tests show that the model can speak, hear, and listen simultaneously, handling mid-sentence interruptions naturally.

The unannounced model has already started rolling out to a select group of app users, hinting at an official release window this week.

OpenAI is reportedly planning to turn ChatGPT into a superapp, with a major overhaul in the pipeline. The overhaul is said to focus on OpenAI’s Codex coding tool and agentic AI tools that can perform tasks for users. But there seems to be more in store, as a new bidirectional audio model named “GPT Bidi 1” has also been spotted, which would be a massive upgrade to ChatGPT’s conversational abilities.

Bidi is said to be shorthand for bidirectional design, which allows the assistant to speak, hear, and listen simultaneously. TestingCatalog spotted references to Bidi 1 last week, with internal code presenting it as a “major leap in intelligence,” and “the next generation of Voice.”

Bidi 1 is said to sit in the model selector under settings, besides the standard and advanced options. The voice bubble turns yellow once Bidi 1 is picked.

According to a recent report from TestingCatalog, the new model has already begun rolling out to a subset of ChatGPT app users, suggesting a possible release this week.

The model is said to offer small, natural acknowledgments, like an “okay,” when you pause or slow down, without cutting you off. It is also said to switch tasks on the fly: for example, ask it to count to ten, interrupt to reverse the count, and it adjusts immediately.

BREAKING 🔥: First tests of “Bidi 1”, an upcoming bidirectional voice model from OpenAI. This upgrade will arrive in ChatGPT and, potentially, in Codex soon as well.

... continue reading