Skip to content
Tech News
← Back to articles

I set 10 honesty traps for Claude Opus 4.8 - and a legal test broke it

read original more articles
Why This Matters

The testing of Claude Opus 4.8 reveals that even advanced AI models claiming improved honesty and judgment can still make significant errors, highlighting ongoing challenges in AI reliability and trustworthiness. This underscores the importance for consumers and developers to critically evaluate AI outputs and continue refining safety measures in AI development.

Key Takeaways

David Gewirtz/ZDNET

Follow ZDNET: Add us as a preferred source on Google.

ZDNET's key takeaways

Claude Opus 4.8 handled uncertainty better than 4.7.

Multiple AIs helped cross-check the test results.

Even honest AIs can still rationalize bad assumptions.

Last week, Anthropic released its latest frontier large language model, Claude Opus 4.8. One of the signature features of this new release is that it is more honest and "has noticeably better judgment" than previous releases.

Also: Anthropic launches Opus 4.8, with honesty as its killer feature

But is that true? In this article, we put this claim to the test.

Before I take you through the whole testing process and some detailed results, let me bottom-line it for you. In some ways, Opus 4.8 is better than the previous Opus 4.7 model. Opus 4.7 itself is quite capable.

... continue reading