Anthropic overtakes OpenAI: Claude Opus 4 codes seven hours nonstop, sets record SWE-Bench score and reshapes enterprise AI
Published on: 2025-06-26 16:45:00
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
Anthropic released Claude Opus 4 and Claude Sonnet 4 today, dramatically raising the bar for what AI can accomplish without human intervention.
The company’s flagship Opus 4 model maintained focus on a complex open-source refactoring project for nearly seven hours during testing at Rakuten — a breakthrough that transforms AI from a quick-response tool into a genuine collaborator capable of tackling day-long projects.
This marathon performance marks a quantum leap beyond the minutes-long attention spans of previous AI models. The technological implications are profound: AI systems can now handle complex software engineering projects from conception to completion, maintaining context and focus throughout an entire workday.
Anthropic claims Claude Opus 4 has achieved a 72.5% score on SWE-bench, a rigorous software engineering benchmark, outperforming OpenAI’s
... Read full article.