Screenshot by David Gewirtz/ZDNET
Follow ZDNET: Add us as a preferred source on Google.
ZDNET's key takeaways
Opus 4.5 failed half my coding tests, despite bold claims
File handling glitches made basic plugin testing nearly impossible
Two tests passed, but reliability issues still dominate the story
I've got to tell you: I've had fairly okay coding results with Claude's lower-end Sonnet AI model. But for whatever reason, its high-end Opus model has never done well on my tests.
Usually, you expect the super-duper coding model to code better than the cheap seats, but with Opus, not so much.
Also: Google's Antigravity puts coding productivity before AI hype - and the result is astonishing
Now, we're back with Opus 4.5. Anthropic, the company behind Claude claims, and I quote, "Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient, and the best model in the world for coding, agents, and computer use."
... continue reading