Is Opus 4.5 really 'the best model in the world for coding'? It just failed half my tests

Screenshot by David Gewirtz/ZDNET

Follow ZDNET: Add us as a preferred source on Google.

ZDNET's key takeaways

Opus 4.5 failed half my coding tests, despite bold claims

File handling glitches made basic plugin testing nearly impossible

Two tests passed, but reliability issues still dominate the story

I've got to tell you: I've had fairly okay coding results with Claude's lower-end Sonnet AI model. But for whatever reason, its high-end Opus model has never done well on my tests.

Usually, you expect the super-duper coding model to code better than the cheap seats, but with Opus, not so much.

Also: Google's Antigravity puts coding productivity before AI hype - and the result is astonishing

Now, we're back with Opus 4.5. Anthropic, the company behind Claude claims, and I quote, "Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient, and the best model in the world for coding, agents, and computer use."

... continue reading