OpenAI tested GPT-5, Claude, and Gemini on real-world tasks - the results were surprising
NurPhoto / Getty Images Follow ZDNET: Add us as a preferred source on Google. ZDNET's key takeaways AI's efficacy at work is still proving lukewarm at best. OpenAI's new evaluation measures its GDP impact in certain tasks. Companies are under pressure to justify their tools' existence. Despite so many AI tools flooding the market, promising increased productivity and even fully automated work, their impact so far has been inconsistent at best. As a recent MIT report noted, 95% of enterpr