Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: models Clear Filter

M5 MacBook Pro No Longer Coming in 2025

Apple does not plan to refresh any Macs with updated M5 chips in 2025, according to Bloomberg's Mark Gurman. Updated MacBook Air and MacBook Pro models are now planned for the first half of 2026. Gurman previously said that Apple would debut the M5 ‌MacBook Pro‌ models in late 2025, but his newest report suggests that Apple is "considering" pushing them back to 2026. Apple is now said to be "internally targeting" a launch early next year. The current M4, M4 Pro, and M4 Max ‌MacBook Pro‌ mode

How attention sinks keep language models stable

We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of j

I want everything local – Building my offline AI workspace

I want everything local — no cloud, no remote code execution. That’s what a friend said. That one-line requirement, albeit simple, would need multiple things to work in tandem to make it happen. What does a mainstream LLM (Large Language Model) chat app like ChatGPT or Claude provide at a high level? Ability to use chat with a cloud hosted LLM, Ability to run code generated by them mostly on their cloud infra, sometimes locally via shell, Ability to access the internet for new content or se

How Google's Genie 3 could change AI video - and let you build your own interactive worlds

Google ZDNET's key takeaways: World models could help to advance AI research, entertainment, etc. Genie 3, Google DeepMind's world model, debuted on Tuesday. Google DeepMind says Genie 3 has an "understanding" of the world. Imagine exploring a virtual environment without boundaries, where everything you see looks and behaves just as it would in reality. This is precisely what many tech developers today are working to create through AI "world models," or algorithms that can build and act up

South Korea charts one-of-a-kind course in AI race with U.S. and China

In this article .FKRX300 Follow your favorite stocks CREATE FREE ACCOUNT Ryu Young-sang, CEO of South Korean telecoms giant SK Telecom, told CNBC that AI is helping telecoms firms improve efficiency in their networks. Manaure Quintero | Afp | Getty Images South Korea has tasked some of its biggest companies and promising startups to build a national foundational AI model using mainly domestic technology, in a rare move to keep the country apace with the U.S. and China. The project will feature

Topics: ai korea model models sk

South Korea launches national AI model in tech race with U.S. and China

In this article .FKRX300 Follow your favorite stocks CREATE FREE ACCOUNT Ryu Young-sang, CEO of South Korean telecoms giant SK Telecom, told CNBC that AI is helping telecoms firms improve efficiency in their networks. Manaure Quintero | Afp | Getty Images South Korea has tasked some of its biggest companies and promising startups to build a national foundational AI model using mainly domestic technology, in a rare move to keep the country apace with the U.S. and China. The project will feature

Topics: ai korea model models sk

FLUX.1-Krea and the Rise of Opinionated Models

AI-generated images have a general look to them. Shiny, bright, waxy-skin, and over-use of bokeh. From Midjourney to Gemini to OpenAI, the AI Look is consistent. Enthusiasts and professionals wrestle with prompts and even fine-tune these models to tamp down the AI smell, with varying degrees of success. Examples of the "AI Look", provided by Krea in their technical paper. Last week, Krea launched an open model, FLUX.1-Krea, that’s built to avoid the “AI Look”. Their writeup is tremendous: it d

How Attention Sinks Keep Language Models Stable

We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of j

ChatGPT users dismayed as OpenAI pulls popular models GPT-4o, o3 and more — enterprise API remains (for now)

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now After announcing the release of its newest flagship model family, GPT-5, OpenAI said the model will power all of ChatGPT, and that it will sunset the existing models in the chat platform. OpenAI, through a spokesperson, told VentureBeat that GPT-5 “will replace all other models in ChatGPT, so users don’t have to pick depending on each task

Tesla exec leading development of chip tech and Dojo supercomputer is leaving company

Tesla's vice president of hardware design engineering, Pete Bannon, is leaving the company after first joining in 2016 from Apple , CNBC has confirmed. Bannon was leading the development of Tesla's Dojo supercomputer and reported directly to Musk. Bloomberg first reported on Bannon's departure, and added that Musk ordered his team to shut down, with engineers in the group getting reassigned to other initiatives. Tesla didn't immediately respond to a request for comment. Since early last year,

GPT-5: Key characteristics, pricing and system card

GPT-5: Key characteristics, pricing and model card I’ve had preview access to the new GPT-5 model family for the past two weeks (see related video) and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still an LLM—it’s not a dramatic departure from what we’ve had before—but it rarely screws up and generally feels competent or occasionally impressive at the kinds of things I like to use models for. I’ve collected a lot of notes over the past two weeks, so I’ve decided

Topics: 00 gpt mini model models

OpenAI’s GPT-5 is here

OpenAI has launched GPT-5, a new flagship AI model that will power the company’s next generation of ChatGPT. GPT-5, which was released Thursday, is OpenAI’s first “unified” AI model and combines the reasoning abilities of its o-series of models with the fast responses of its GPT series. The next-generation model signals a new era for ChatGPT — and its creator, OpenAI — pointing to OpenAI’s broader ambitions to develop AI systems that are more like agents than chatbots. While GPT-4 enabled AI c

OpenAI's new open-source model is basically Phi-5

OpenAI just released its first ever open-source large language models, called gpt-oss-120b and gpt-oss-20b. You can talk to them here. Are they good models? Well, that depends on what you’re looking for. They’re great at some benchmarks, of course (OpenAI would never have released them otherwise) but weirdly bad at others, like SimpleQA. Some people really like them. Others on Twitter really don’t. From what I can tell, they’re technically competent but lack a lot of out-of-domain knowledge: fo

Show HN: Octofriend, a cute coding agent that can swap between GPT-5 and Claude

Get Started npm install --global octofriend And then: octofriend About Octo is a small, helpful, cephalopod-flavored coding assistant that works with any OpenAI-compatible or Anthropic-compatible LLM API, and allows you to switch models at will mid-conversation when a particular model gets stuck. Octo can optionally use (and we recommend using) ML models we custom-trained and open-sourced (1, 2) to automatically handle tool call and code edit failures from the main coding models you're work

GPT-5 is here. Now what?

Whereas o1 was a major technological advancement, GPT-5 is, above all else, a refined product. During a press briefing, Sam Altman compared GPT-5 to Apple’s Retina displays, and it’s an apt analogy, though perhaps not in the way that he intended. Much like an unprecedentedly crisp screen, GPT-5 will furnish a more pleasant and seamless user experience. That’s not nothing, but it falls far short of the transformative AI future that Altman has spent much of the past year hyping. In the briefing, A

OpenAI nearly confirms GPT-5 launch today - how to tune in

Elyse Betters Picaro / ZDNET ZDNET's key takeaways OpenAI's livestream at 10 AM PT/1 PM ET will likely launch GPT-5. GPT-5 will automatically select the best model for prompts, improving efficiency. That approach should help produce higher-quality answers more quickly. OpenAI just launched its highly anticipated open-source models on Tuesday, but the company is already moving on to what will likely be its biggest product launch of the year: GPT-5. Also: How ChatGPT actually works (and why

Senior AI researchers desert Apple amid ‘a crisis of confidence’

There have been a number of reports of senior AI researchers leaving Apple, and the latest of these indicates the problem may be bigger than previously known. One AI recruitment company has suggested there is a crisis of confidence within Apple, with tech rivals now considering it open season on poaching the company’s engineers … We learned a month ago that Apple’s top AI exec, Ruoming Pang, had left the company to join Meta. Pang joined Apple from Google in 2021, and had been managing the ro

Splatshop: Efficiently Editing Large Gaussian Splat Models

Splatshop: Efficiently Editing Large Gaussian Splat Models Markus Schütz, Christoph Peters, Florian Hahlbohm, Elmar Eisemann, Marcus Magnor, Michael Wimmer. 2025–06 in Computer Graphics Forum (Proc. HPG) 44, 8. Official version Abstract We present Splatshop, a highly optimized toolbox for interactive editing (selection, deletion, painting, transformation, ...) of 3D Gaussian Splatting models. Utilizing a comprehensive collection of heuristic approaches, we carefully balance between exact an

The initial reactions to OpenAI’s landmark open source gpt-oss models are highly varied and mixed

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now OpenAI’s long-awaited return to the “open” of its namesake occurred yesterday with the release of two new large language models (LLMs): gpt-oss-120B and gpt-oss-20B. But despite achieving technical benchmarks on par with OpenAI’s other powerful proprietary AI model offerings, the broader AI developer and user community’s initial response h

Topics: ai model models open oss

The Download: OpenAI’s open-weight models, and the future of internet search

The news: OpenAI has finally released its first open-weight large language models since 2019’s GPT-2. Unlike the models available through OpenAI’s web interface, these new open models can be freely downloaded, run, and even modified on laptops and other local devices. Why it matters: These releases re-establish OpenAI as a presence for users of open models. That’s particularly notable at a time when Meta, which had previously dominated the American open-model landscape with its Llama models, ma

OpenAI returns to its open-source roots with new open-weight AI models, and it's a big deal

Beata Zawrzel/NurPhoto via Getty Images We all know AI relies on open-source software, but most of the big AI companies avoid opening their code or their large language model (LLM) weights. Today, things have changed. OpenAI, the artificial intelligence titan behind ChatGPT, announced a landmark return to its open-source origins. The company unveiled two new open-weight language models, gpt-oss-120b and gpt-oss-20b, marking its first public release of freely available AI model weights since GP

Topics: ai gpt models open openai

OpenAI's New Models Aren't Really Open: What to Know About Open-Weights AI

Despite the company's name, OpenAI hasn't dropped an open version of its AI models since GPT-2 in 2019. That changed on Tuesday, as CEO Sam Altman shared two new open-weights, reasoning AI models, named gpt-oss-120b (120 billion parameters) and gpt-oss-20b (20 billion parameters). If open-weights is a new piece of AI jargon to you, don't worry. In the simplest possible terms, open-weights is a category of AI models that power products like chatbots, image and video generators. But they are phil

OpenAI could launch GPT-5 any minute now - what to expect

Elyse Betters Picaro / ZDNET ZDNET's key takeaways OpenAI GPT-5 will be released "soon," which could be this week. It will automatically select the best model for prompts. It should help produce higher-quality and faster answers. Despite OpenAI just launching its highly anticipated open-source models on Tuesday, people are already on the lookout for OpenAI's next big move, with rumblings of an even bigger release on the near horizon: GPT-5. Also: ChatGPT can no longer tell you to break up

For the first time, OpenAI models are available on AWS

Sam Altman’s blowtorch to his competitors is so hot, it even includes a new partnership with Amazon Web Services. As OpenAI announced two open-weight reasoning models with capabilities on par with its o-series, Amazon announced that the new models would become available on AWS on Tuesday. This is the first time that OpenAI models will be offered by AWS, the company confirmed to TechCrunch. They will be available as a model choice with Amazon AI services Bedrock and SageMaker AI. While anyone c

OpenAI Finally Lives Up to Its Name, Drops Two New Open Source AI Models

For the first time in five years, OpenAI has released two new free and open-source AI models that are lightweight and designed to be easily integrated into other software programs. In a blog post on Tuesday, the company characterized gpt-oss-120b and gpt-oss-20b as flexible but powerful AI algorithms that can perform a variety of tasks and be used in numerous settings. The company also included a feedback portal and a more extensive blog that further explains the models and how they work. OpenA

OpenAI releases a free GPT model that can run on your laptop

Posts from this author will be added to your daily email digest and your homepage feed. OpenAI is releasing a new open-weight model dubbed GPT-OSS that can be downloaded for free, be customized, and even run on a laptop. The model comes in two variants: 120-billion-parameter and 20-billion-parameter versions. The bigger version can run on a single Nvidia GPU and performs similarly to OpenAI’s existing o4-mini model, while the smaller version performs similarly to o3-mini and runs on just 16GB

OpenAI returns to open source roots with new models gpt-oss-120b and gpt-oss-20b

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now OpenAI is getting back to its roots as an open source AI company with today’s announcement and release of two new, open source, frontier large language models (LLMs): gpt-oss-120b and gpt-oss-20b. The former is a 120-billion parameter model as the name would suggest, capable of running on a single Nvidia H100 graphics processing unit (GPU)

OpenAI has finally released open-weight language models

“The vast majority of our [enterprise and startup] customers are already using a lot of open models,” said Casey Dvorak, a research program manager at OpenAI, in a media briefing about the model release. “Because there is no [competitive] open model from OpenAI, we wanted to plug that gap and actually allow them to use our technology across the board.” The new models come in two different sizes, the smaller of which can theoretically run on 16 GB of RAM—the minimum amount that Apple currently o

OpenAI releases two open-weight AI models, including one that runs well on Apple Silicon Macs

Living up to its name, OpenAI has released not one but two new open-weight AI models after promising to deliver a new open-weight model earlier this year. The two models, gpt-oss-20b and gpt-oss-120b, are available to download for free now. What makes open-weight models special? Specifically, these are AI models that can be downloaded and run on computers with adequate resources for powering local AI models. No internet connection is required because access to the model provider’s server isn’t

OpenAI's first new open-weight LLMs in six years are here

For the first time since GPT-2 in 2019, OpenAI is releasing new open-weight large language models. It's a major milestone for a company that has increasingly been accused of forgoing its original stated mission of "ensuring artificial general intelligence benefits all of humanity." Now, following multiple delays for additional safety testing and refinement, gpt-oss-120b and gpt-oss-20b are available to download from Hugging Face. Before going any further, it's worth taking a moment to clarify w