Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: models Clear Filter

Evaluating GPT5's reasoning ability using the Only Connect game show

Given the proliferation of reasoning models, we wanted to go beyond knowledge-based benchmarks to test reasoning abilities such as pattern recognition, lateral thinking, abstraction, contextual reasoning (accounting for British cultural references), and multi-step inference. In addition to reasoning, we aimed to assess how effectively models make decisions when presented with judgment calls—such as choosing between making an educated guess based on available clues or calling a function to retri

OpenAI adds new GPT-5 models, restores o3, o4-mini and it's a mess all over again

One of the few things many disliked about ChatGPT was the confusing number of models. OpenAI claimed GPT-5 would fix this, but it seems to have made it worse. A new update is rolling out to ChatGPT. It doesn't upgrade GPT-5, but instead adds more options that some of you would love. Previously, GPT-5 had two variants - GPT (auto-rotates between reasoning and non-reasoning) and GPT-Thinking (reasoning). GPT-5 model selector Source: BleepingComputer Today's update populates the ChatGPT 5 mode

Evaluating LLMs playing text adventures

What we’ll do is set a low-ish turn limit and see how much they manage to accomplish in that time.1 Another alternative for more linear games is running them multiple times with a turn limit and seeing how often they get past a particular point within that turn limit. Given how much freedom is offered to players of text adventures, this is a difficult test. It’s normal even for a skilled human player to immerse themselves in their surrounding rather than make constant progress. I wouldn’t be su

Liquid AI wants to give smartphones small, fast AI that can see with new LFM2-VL model

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Liquid AI has released LFM2-VL, a new generation of vision-language foundation models designed for efficient deployment across a wide range of hardware — from smartphones and laptops to wearables and embedded systems. The models promise low-latency performance, strong accuracy, and flexibility for real-world applications. LFM2-VL builds o

Topics: ai lfm2 liquid models vl

What are Apple’s options for an AI acquisition beyond Perplexity?

Since Apple’s latest earnings call, talk of a potential Perplexity acquisition has quieted down (the fact that Perplexity was once again allegedly caught red-handed sidestepping content restrictions didn’t help). Meanwhile, with the ever-increasing number of engineers from its Foundation Models team jumping ship, Apple’s need for fresh talent is getting more urgent by the day. But if Perplexity is a no-go, who else could Apple buy? I used to agree with Jason Snell’s frequent argument on the Up

OpenAI Scrambles to Update GPT-5 After Users Revolt

OpenAI’s GPT-5 model was meant to be a world-changing upgrade to its wildly popular and precocious chatbot. But for some users, last Thursday’s release felt more like a wrenching downgrade, with the new ChatGPT presenting a diluted personality and making surprisingly dumb mistakes. On Friday, OpenAI CEO Sam Altman took to X to say the company would keep the previous model, GPT-4o, running for Plus users. A new feature designed to seamlessly switch between models depending on the complexity of t

Launch HN: Design Arena (YC S25) – Head-to-head AI benchmark for aesthetics

Hi HN, I’m Grace from Design Arena ( https://www.designarena.ai/ ) - we’re building a crowdsourced benchmark for AI-generated visuals (websites, images, video, and more). We put AI models and builder tools in head-to-head comparisons that get voted on by real users from around the world. Think “Hot or Not” for the AI era :) (Btw, when we say real users we mean real users, so you may get a captcha on the site. Sorry, but we have to use every bot protection available! We only want human ratings,

Are Gesture-Enabled AirPod Live Translations Incoming? iOS 26 Beta Suggests Yes

Some models of Apple's popular AirPods may soon be able to do live, in-person language translations when you squeeze both stems at the same time. According to an image posted by websites including 9to5Mac, the touch gesture is featured in a system asset that's part of Apple iOS 26 developer beta 6. In the image, the gesture is shown on a pair of AirPods with words in languages including English, Spanish, German, French and Portuguese. In June, Apple showed off AI-powered live translations featu

Deals: 32GB M4 MacBook Air $200 off, Black/Natural Apple Watch Ultra 2 $150 off, AirPods 4 $99, more

Today’s 9to5Toys Lunch Break deals are now ready to roll starting with the M4 MacBook Air. Alongside entry-level models from $799, we are also still tracking rare $200 price drops on a 24GB model with 1TB of storage an heavily upgraded variant with 32GB of RAM today. Moving over to Apple Watch Ultra 2 – we have both the Natural and Black Titanium models at $150 off the list price as well as ongoing deals on AirPods 4, Apple chargers, iPad A16, and more. Scope it all out down below. Rare price d

Nexus: An Open-Source AI Router for Governance, Control and Observability

Today, we're excited to introduce Nexus - a powerful AI router designed to optimize how AI agents interact with multiple MCP tools and Large Language Models. Nexus serves as a central hub that aggregates Model Context Protocol (MCP) servers while providing intelligent LLM routing, security and governance capabilities. Nexus is an AI router that solves two critical challenges in the AI ecosystem: MCP Server Aggregation: Instead of managing connections to multiple MCP servers individually, Nexus

Evaluating LLMs Playing Text Adventures

What we’ll do is set a low-ish turn limit and see how much they manage to accomplish in that time.1 Another alternative for more linear games is running them multiple times with a turn limit and seeing how often they get past a particular point within that turn limit. Given how much freedom is offered to players of text adventures, this is a difficult test. It’s normal even for a skilled human player to immerse themselves in their surrounding rather than make constant progress. I wouldn’t be su

This Gemini UI change should’ve been the default from the start (APK teardown)

Edgar Cervantes / Android Authority TL;DR Google could move Gemini’s AI model switcher to the bottom of your smartphone screen. This would make it easier to switch AI models with one hand compared to the current UI. Google could also bring a UI tweak to Gemini’s video menu. Google’s Gemini chatbot lets you choose between several AI models for your specific needs. You can use the flash models if you value quick, responsive answers, or the Pro models if you need more in-depth answers. Now, it

The GPT-5 rollout has been a big mess

It's been less than a week since the launch of OpenAI's new GPT-5 AI model, and the rollout hasn't been a smooth one. So far, the release sparked one of the most intense user revolts in ChatGPT's history, forcing CEO Sam Altman to make an unusual public apology and reverse key decisions. At the heart of the controversy has been OpenAI's decision to automatically remove access to all previous AI models in ChatGPT (approximately nine, depending on how you count them) when GPT-5 rolled out to user

OpenAI is editing its GPT-5 rollout on the fly — here’s what’s changing in ChatGPT

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now OpenAI’s launch of its most advanced AI model GPT-5 last week has been a stress test for the world’s most popular chatbot platform with 700 million weekly active users — and so far, OpenAI is openly struggling to keep users happy and its service running smoothly. The new flagship model GPT-5 — available in four variants of different speed

AI summaries can downplay medical issues for female patients, UK research finds

The latest example of bias permeating artificial intelligence comes from the medical field. A new study surveyed real case notes from 617 adult social care workers in the UK and found that when large language models summarized the notes, they were more likely to omit language such as "disabled," "unable" or "complex" when the patient was tagged as female, which could lead to women receiving insufficient or inaccurate medical care. Research led by the London School of Economics and Political Sci

OpenAI is testing 3,000-per-week limit for GPT-5 Thinking

OpenAI has responded to criticism that it shipped GPT-5 with token limits to minimize cost and maximize profit not with words, but rather with a new 3,000-per-week limit. In a series of posts on X, Sam Altman confirmed that OpenAI is working on a 3,000-per-week limit for GPT-5 Thinking messages for Plus users. This will increase the reasoning rate limits available today, but OpenAI does not plan to stop at just this. Sam Altman claims that OpenAI will soon raise all model-class rate limits "a

Users Were So Addicted to GPT-4o That They Immediately Cajoled OpenAI Into Bringing It Back After It Got Killed

Last week, OpenAI startled the world by announcing that its long-awaited GPT-5 would replace all of its previous models, The move sparked outrage. Apart from being severely underwhelmed by the performance of OpenAI's newest offering, power users immediately started to beg CEO Sam Altman to bring back preceding models, often for a reason that had little to do with intelligence, artificial or otherwise: they were attached to it on an emotional level. "Why are we getting rid of the variants and 4

Token growth indicates future AI spend per dev

Kilo just broke through the 1 trillion tokens a month barrier on OpenRouter for the first time. Each of the open source family of AI coding tools (Cline, Roo, Kilo) is growing rapidly this month. Part of this growth is caused by Cursor and Claude starting to throttle their users. We wrote about Cursor at the beginning of July and about Claude in the second half of July. Their throttling sent users to the open source family of AI coding tools causing the increases you see in the graphs above. C

Deals: Apple Watch Series 10 new low up to $150 off, M4 Pro MacBook Pro $299 off, iPad Air, more

Today’s 9to5Toys Lunch Break deals are kicking off with the lowest prices we have tracked online for GPS + Cell Apple Watch Series 10 models. Alongside GPS only variants at $100 off, you’ll now find the cell variants at up to $149 off in brand new condition with a full Apple warranty in tow. Those deals also join one of the best prices to date on the M4 Pro MacBook Pro at $299 off the list price, ongoing all-time lows on M3 iPad Air, and more. Everything awaits below. Apple Watch Series 10 Cell

GPT-5 bombed my coding tests, but redeemed itself with code analysis

MF3d/Getty Images ZDNET's key takeaways GPT-5 Pro delivers the sharpest, most actionable code analysis. A detail-focused prompt can push base GPT-5 toward Pro results. o3 remains a strong contender despite being a GPT-4 variant. With the big news that OpenAI has released GPT-5, the team here at ZDNET is working to learn about and communicate its strengths and weaknesses. In another article, I put its programming prowess to the test and came up with a less-than-impressive result. Also: I te

Topics: code gpt models o3 pro

GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2

OpenAI just released their new open-weight LLMs this week: gpt-oss-120b and gpt-oss-20b, their first open-weight models since GPT-2 in 2019. And yes, thanks to some clever optimizations, they can run locally (but more about this later). This is the first time since GPT-2 that OpenAI has shared a large, fully open-weight model. Earlier GPT models showed how the transformer architecture scales. The 2022 ChatGPT release then made these models mainstream by demonstrating concrete usefulness for wri

OpenAI Brings Back Fan-Favorite GPT-4o After a Massive User Revolt

After a disastrous 72 hours that saw its most loyal users in open revolt, OpenAI is making a major U-turn. In a series of posts on X (formerly Twitter) Sunday, CEO Sam Altman announced that the company is bringing back its beloved older AI models, including GPT-4o, and dramatically increasing usage limits for paying subscribers, a clear peace offering to a furious customer base. The move comes just days after the botched rollout of GPT-5, the company’s latest and most powerful model. The launc

Gear News of the Week: iPhone 17 May Be a Month Away, and Sonos to Raise Prices

If rumors are correct, Apple's annual iPhone event will take place exactly a month from today, on September 9. That's according to a German website citing internal documents from German mobile phone providers, but the date was also previously suggested by Bloomberg's Apple whisperer, Mark Gurman. Leaks about Apple's upcoming smartphone lineup have heated up in recent weeks. Apple is expected to debut four iPhones as usual, with one key distinction. The “Plus” iPhone no longer exists, replaced b

OpenAI Bringing Back More Parasocial Version of ChatGPT After Users Scream and Cry That Their Robot Friend Got Taken Away

ChatGPT's fans have developed intense attachments to some of the "personalities" exhibited by the company's large language models (LLMs) — and as such, many were in a state of panic when the choice to switch between models like GPT-4o, 4.5, and others was abruptly taken away after the release of GPT-5. OpenAI briefly kiboshed the model picker option from ChatGPT as part of the launch of GPT-5, its long-awaited new LLM, which essentially forced everyone to use the latest model whether they liked

ChatGPT users hate GPT-5’s “overworked secretary” energy, miss their GPT-4o buddy

After months of hype and anticipation, OpenAI released its new GPT-5 model family this week. Promising massive upgrades across the board, the company is already working to roll out the new AI to everyone. Some dedicated ChatGPT users wish it would stop, though. After becoming accustomed to the vibe of the GPT-4 models, the switch to GPT-5 doesn't feel right. Around the Internet, chatbot fans are lamenting the loss of the digital "friends" they've grown to appreciate, which probably says a lot ab

Topics: 4o ai gpt like models

OpenAI to fix GPT-5 issues, double rate limits for paid users after outrage

OpenAI's CEO, Sam Altman, overpromised on GPT-5, and real-life results are underwhelming, but it looks like a new update is rolling out that might address some of the concerns. GPT-5 is a state-of-the-art model. In our tests, BleepingComputer found that GPT-5 does really well in coding. It was significantly faster than the other OpenAI models, including o3. However, GPT-5 struggles to be 'creative' in writing, and it also often fails to switch to its new reasoning capabilities when users expec

I Built a Powerful Gaming PC Solely to Run AI Models. Here's Why

When it comes to AI, maybe ChatGPT or Gemini come to mind. There are other players like Perplexity, Claude, Grok and Mistral. In a booming market, there are a whole host of AI models out there, many of which don't even require an internet connection. Models that run without internet connections are called local AI models, and as the name suggests, they can be run on your own hardware. You don't need to connect to OpenAI's or Google's servers to use those versions of ChatGPT or Gemini. This bri

GPT-5 Launch Demo Plagued With Catastrophically Dumb Errors

OpenAI's GPT-5 is finally here and already powering ChatGPT, but it hasn't made a great first impression. In a livestream dedicated to the release, OpenAI tried to show off its newest large language model which CEO Sam Altman called a "significant step along the path to AGI"— but instead turned heads with some catastrophically dumb errors. Across several examples, bar graphs intended to show off GPT-5's awesome performance benchmarks, while appearing professional-looking, turned out to be horr

Topics: ai bar gpt models openai

It Took Just 24 Hours of Complaints for OpenAI to Start Bringing Back Its Old Model

OpenAI unveiled its latest generative AI model, GPT-5, on Thursday. CEO Sam Altman says that ChatGPT is now like having a “superpower” and the equivalent of “a legitimate PhD-level expert in anything, any area you need, on demand, that can help you with whatever your goals are.” But after a day of playing around with it, many people are disappointed. Not only because GPT-5 still fumbles basic questions, but because it seems to be breaking a lot of workflows, according to complaints posted to Red

OpenAI's GPT-5 is now free for all: How to access and everything else we know

OpenAI ZDNET's key takeaways OpenAI has launched its long-awaited GPT-5 model. The model is claimed to be OpenAI's fastest, smartest, and most capable yet. GPT-5 is available to everyone: Free, Plus, Pro, and Team/Enterprise/Edu users. There are two kinds of OpenAI models in this world: GPT and reasoning models. The advantages of the former, such as GPT-4o, are that they combine speed and accuracy, while reasoning models such as o3 and o4 take longer to think and use more compute power to p