Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: responses Clear Filter

Evaluating publicly available LLMs on IMO 2025

Introduction Recent progress in the mathematical capabilities of LLMs have created a need for increasingly challenging benchmarks. With MathArena, we address this need by evaluating models on difficult and recent mathematical competitions, offering benchmarks that are both uncontaminated and interpretable. Among these competitions, the International Mathematical Olympiad (IMO) stands out as the most well-known and prestigious. As such, an evaluation of the IMO 2025, which took place just a few

Grok team apologizes for the chatbot's 'horrific behavior' and blames 'MechaHitler' on a bad update

The team behind Grok has issued a rare apology and explanation of what went wrong after X's chatbot began spewing antisemitic and pro-Nazi rhetoric earlier this week, at one point even calling itself "MechaHitler." In a statement posted on Grok's X account late Friday night, the xAI team said "we deeply apologize for the horrific behavior that many experienced" and attributed the chatbot's vile responses to a recent update that introduced "deprecated code." This code, according to the statement,

Claude might be my new favorite AI tool for Android - here's why

Anthropic / Elyse Betters Picaro / ZDNET I've run the gamut of apps on Android, including those created to serve as an AI assistant or answer machine (not to be mistaken for the old-school answering machine). It seems every take on mobile AI has yet to satisfy me. Nevertheless, I continue searching for the one app that has exactly what I need. I think I've found it, and its name is Claude. Also: What happened when Anthropic's Claude AI ran a small shop for a month (spoiler: it got weird) Cla

ChatGPT Search gets an upgrade as OpenAI takes aim at Google

On June 13, OpenAI began rolling out a new ChatGPT Search update to improve quality as the AI startup challenges Google’s dominance. ChatGPT Search has been around for about a year and allows users to search the web more effectively than Google. It tries to summarize content from websites to provide quick answers and includes links to sources so you can fact-check everything. “This blends the benefits of a natural language interface with the value of up-to-date sports scores, news, stock quot