Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: gdpval Clear Filter

OpenAI tested GPT-5, Claude, and Gemini on real-world tasks - the results were surprising

NurPhoto / Getty Images Follow ZDNET: Add us as a preferred source on Google. ZDNET's key takeaways AI's efficacy at work is still proving lukewarm at best. OpenAI's new evaluation measures its GDP impact in certain tasks. Companies are under pressure to justify their tools' existence. Despite so many AI tools flooding the market, promising increased productivity and even fully automated work, their impact so far has been inconsistent at best. As a recent MIT report noted, 95% of enterpr

OpenAI says GPT-5 stacks up to humans in a wide range of jobs

OpenAI released a new benchmark on Thursday that tests how its AI models perform compared to human professionals across a wide range of industries and jobs. The test, GDPval, is an early attempt at understanding how close OpenAI’s systems are to outperforming humans at economically valuable work — a key part of the company’s founding mission to develop artificial general intelligence or AGI. OpenAI says its found that its GPT-5 model and Anthropic’s Claude Opus 4.1 “are already approaching the