I set 10 honesty traps for Claude Opus 4.8 - and a legal test broke it

David Gewirtz/ZDNET

Follow ZDNET: Add us as a preferred source on Google.

ZDNET's key takeaways

Claude Opus 4.8 handled uncertainty better than 4.7.

Multiple AIs helped cross-check the test results.

Even honest AIs can still rationalize bad assumptions.

Last week, Anthropic released its latest frontier large language model, Claude Opus 4.8. One of the signature features of this new release is that it is more honest and "has noticeably better judgment" than previous releases.

Also: Anthropic launches Opus 4.8, with honesty as its killer feature

But is that true? In this article, we put this claim to the test.

Before I take you through the whole testing process and some detailed results, let me bottom-line it for you. In some ways, Opus 4.8 is better than the previous Opus 4.7 model. Opus 4.7 itself is quite capable.

... continue reading