Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: del Clear Filter

How attention sinks keep language models stable

We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of j

The surprise deprecation of GPT-4o for ChatGPT consumers

The surprise deprecation of GPT-4o for ChatGPT consumers I’ve been dipping into the r/ChatGPT subreddit recently to see how people are reacting to the GPT-5 launch, and so far the vibes there are not good. This AMA thread with the OpenAI team is a great illustration of the single biggest complaint: a lot of people are very unhappy to lose access to the much older GPT-4o, previously ChatGPT’s default model for most users. A big surprise for me yesterday was that OpenAI simultaneously retired ac

Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?

Sam said yesterday that chatgpt handles ~700M weekly users. Meanwhile, I can't even run a single GPT-4-class model locally without insane VRAM or painfully slow speeds. Sure, they have huge GPU clusters, but there must be more going on - model optimizations, sharding, custom hardware, clever load balancing, etc. What engineering tricks make this possible at such massive scale while keeping latency low? Curious to hear insights from people who've built large-scale ML systems.

OpenAI’s GPT-5 rollout is not going smoothly

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The launch of OpenAI’s long anticipated new model, GPT-5, is off to a rocky start to say the least. Even forgiving errors in charts and voice demoes during yesterday’s livestreamed presentation of the new model (actually four separate models, and a ‘Thinking’ mode that can be engaged for three of them), a number of user reports have emerge

Topics: 2025 ai gpt model users

I want everything local – Building my offline AI workspace

I want everything local — no cloud, no remote code execution. That’s what a friend said. That one-line requirement, albeit simple, would need multiple things to work in tandem to make it happen. What does a mainstream LLM (Large Language Model) chat app like ChatGPT or Claude provide at a high level? Ability to use chat with a cloud hosted LLM, Ability to run code generated by them mostly on their cloud infra, sometimes locally via shell, Ability to access the internet for new content or se

OpenAI priced GPT-5 so low, it may spark a price war

OpenAI astounded the tech industry for the second time this week by launching its newest flagship model, GPT-5, just days after releasing two new freely available models under an open source license. OpenAI CEO Sam Altman went so far as to call GPT-5 “the best model in the world.” That may be pride or hyperbole, as TechCrunch’s Maxwell Zeff reports that GPT-5 only slightly outperforms other leading AI models from Anthropic, Google DeepMind, and xAI on some key benchmarks, and slightly lags on o

Google Working on Fix for Glum Gemini, Stuck in 'Infinite Loop' of Self-Esteem Issues

Maybe Google Gemini needs to take some PTO. The company's large language AI model, which is increasingly spreading across Google's many services and products, has been saying some things lately that are leading users to worry: Does Gemini have low self esteem? A series of posts on social media showing some of the self-critical responses Gemini has given users show a disturbing pattern. For instance, in one screenshot, Gemini admits it can't solve a coding problem and concludes, "I have failed.

Bank of England chief says no rift with UK government as Revolut licence delay draws scrutiny

Revolut cards is seen in this illustration photo taken in Krakow, Poland on March 29, 2024. LONDON — Bank of England Governor Andrew Bailey told CNBC there hasn't been a "falling out" with the U.K. government over delays to fintech giant Revolut's long-awaited bank license. Last week, the Financial Times reported that a meeting arranged by British Finance Minister Rachel Reeves with Revolut and the Prudential Regulation Authority (PRA) — an arm of the BOE that oversees banks — was cancelled af

Apple will bring GPT-5 to Apple Intelligence in iOS, iPad OS and macOS 26

OpenAI finally released its long-awaited GPT-5 model this week, unsurprisingly proclaiming it its best yet with regards to coding, accuracy, safety and more. CEO Sam Altman even compared the jump up in quality to when the iPhone first adopted a Retina display in a press briefing ahead of the announcement. Big talk indeed. Given ChatGPT’s integration with Apple Intelligence , you might be wondering when the latest model will arrive on the devices that support it. The answer is sooner rather than

How Google's Genie 3 could change AI video - and let you build your own interactive worlds

Google ZDNET's key takeaways: World models could help to advance AI research, entertainment, etc. Genie 3, Google DeepMind's world model, debuted on Tuesday. Google DeepMind says Genie 3 has an "understanding" of the world. Imagine exploring a virtual environment without boundaries, where everything you see looks and behaves just as it would in reality. This is precisely what many tech developers today are working to create through AI "world models," or algorithms that can build and act up

South Korea charts one-of-a-kind course in AI race with U.S. and China

In this article .FKRX300 Follow your favorite stocks CREATE FREE ACCOUNT Ryu Young-sang, CEO of South Korean telecoms giant SK Telecom, told CNBC that AI is helping telecoms firms improve efficiency in their networks. Manaure Quintero | Afp | Getty Images South Korea has tasked some of its biggest companies and promising startups to build a national foundational AI model using mainly domestic technology, in a rare move to keep the country apace with the U.S. and China. The project will feature

Topics: ai korea model models sk

Here’s everything OpenAI announced at its GPT-5 event

During an uncharacteristically long video stream yesterday, OpenAI announced GPT-5, alongside a series of interface and usability improvements to its chatbot. Here’s everything new with ChatGPT. One model to rule them all After many years of confusion with similarly-named models imbued with overlapping abilities, OpenAI finally streamlined the user experience and trimmed down its model offerings to: GPT-5 GPT-5 Thinking GPT-5 Pro (limited to the US$200/mo plan) OpenAI says that ChatGPT wil

South Korea launches national AI model in tech race with U.S. and China

In this article .FKRX300 Follow your favorite stocks CREATE FREE ACCOUNT Ryu Young-sang, CEO of South Korean telecoms giant SK Telecom, told CNBC that AI is helping telecoms firms improve efficiency in their networks. Manaure Quintero | Afp | Getty Images South Korea has tasked some of its biggest companies and promising startups to build a national foundational AI model using mainly domestic technology, in a rare move to keep the country apace with the U.S. and China. The project will feature

Topics: ai korea model models sk

Microsoft rolls out GPT-5 across its Copilot suite - here's where you'll find it

Adam Gray/Bloomberg via Getty Images ZDNET's key takeaways Microsoft is rolling out GPT-5 to all its AI offerings, Thursday. The Copilot chatbot will provide GPT-5, even to free users. GPT-5 will also be available to coding and enterprise tools. OpenAI released its much-anticipated upgrade to the engine that powers ChatGPT and many other AI implementations, including Microsoft's AI offerings, on Thursday. Concurrent with the GPT-5 release, Microsoft announced that it is upgrading its consu

FLUX.1-Krea and the Rise of Opinionated Models

AI-generated images have a general look to them. Shiny, bright, waxy-skin, and over-use of bokeh. From Midjourney to Gemini to OpenAI, the AI Look is consistent. Enthusiasts and professionals wrestle with prompts and even fine-tune these models to tamp down the AI smell, with varying degrees of success. Examples of the "AI Look", provided by Krea in their technical paper. Last week, Krea launched an open model, FLUX.1-Krea, that’s built to avoid the “AI Look”. Their writeup is tremendous: it d

How Attention Sinks Keep Language Models Stable

We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of j

Rocket Report: Firefly lights the markets up; SpaceX starts selling trips to Mars

Welcome to Edition 8.06 of the Rocket Report! After years of disappointing results from SPACs and space companies, it is a good sign to see Firefly's more traditional initial public offering doing so well. The company has had such a long and challenging road over more than a decade; the prospect of their success should be heartening to the commercial space industry. As always, we welcome reader submissions. If you don't want to miss an issue, please subscribe using the box below (the form will

Microsoft’s new Copilot 3D feature is great for Ikea, bad for my dog

is a senior editor and author of Notepad , who has been covering all things Microsoft, PC, and tech for over 20 years. Posts from this author will be added to your daily email digest and your homepage feed. While Microsoft was busy updating Copilot yesterday with OpenAI’s new GPT-5 model, it also quietly launched Copilot 3D. It’s a free-to-use feature that can transform a regular 2D image into a 3D model that can then be used in game creation, animation, 3D printing, VR / AR, and much more. C

Benchmarking GPT-5 on 400 real-world code reviews

GPT-5 is now available in Qodo’s platform for all free and paid users. Get started today. At Qodo, we believe benchmarks should reflect how developers actually work. That’s why we built the PR Benchmark—a benchmark designed to assess how well language models handle tasks like code review, suggesting improvements, and understanding developer intent. Unlike many public benchmarks, the PR Benchmark is private, and its data is not publicly released. This ensures models haven’t seen it during train

ChatGPT users dismayed as OpenAI pulls popular models GPT-4o, o3 and more — enterprise API remains (for now)

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now After announcing the release of its newest flagship model family, GPT-5, OpenAI said the model will power all of ChatGPT, and that it will sunset the existing models in the chat platform. OpenAI, through a spokesperson, told VentureBeat that GPT-5 “will replace all other models in ChatGPT, so users don’t have to pick depending on each task

Microsoft rolls out GPT-5 across its Copilot suite - here's what we know

Matthias Balk/picture alliance via Getty Images ZDNET's key takeaways Microsoft is rolling out GPT-5 to all its AI offerings, Thursday. The Copilot chatbot will provide GPT-5, even to free users. GPT-5 will also be available to coding and enterprise tools. OpenAI released its much-anticipated upgrade to the engine that powers ChatGPT and many other AI implementations, including Microsoft's AI offerings, Thursday. Concurrent with the GPT-5 release, Microsoft announced that it is upgrading i

Tesla exec leading development of chip tech and Dojo supercomputer is leaving company

Tesla's vice president of hardware design engineering, Pete Bannon, is leaving the company after first joining in 2016 from Apple , CNBC has confirmed. Bannon was leading the development of Tesla's Dojo supercomputer and reported directly to Musk. Bloomberg first reported on Bannon's departure, and added that Musk ordered his team to shut down, with engineers in the group getting reassigned to other initiatives. Tesla didn't immediately respond to a request for comment. Since early last year,

Achieving 10,000x training data reduction with high-fidelity labels

Classifying unsafe ad content has proven an enticing problem space for leveraging large language models (LLMs). The inherent complexity involved in identifying policy-violating content demands solutions capable of deep contextual and cultural understanding, areas of relative strength for LLMs over traditional machine learning systems. But fine-tuning LLMs for such complex tasks requires high-fidelity training data that is difficult and expensive to curate at the necessary quality and scale. Stan

GPT-5: Key characteristics, pricing and system card

GPT-5: Key characteristics, pricing and model card I’ve had preview access to the new GPT-5 model family for the past two weeks (see related video) and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still an LLM—it’s not a dramatic departure from what we’ve had before—but it rarely screws up and generally feels competent or occasionally impressive at the kinds of things I like to use models for. I’ve collected a lot of notes over the past two weeks, so I’ve decided

Topics: 00 gpt mini model models

GPT-5: Here's What's New in ChatGPT's Big Update

Expect your ChatGPT experience to get faster and smarter today. OpenAI updated its flagship line of large language models Thursday, unveiling the GPT-5 generative AI model after months of anticipation. While the developer has released a lot of model updates in recent months, including new open-weights models just this week, it's been more than two years since the debut of GPT-4. With a new generation worthy of a new number, how big of a change should you expect? "I tried going back to GPT-4 an

OpenAI’s most powerful AI model is here and free for everyone

Edgar Cervantes / Android Authority TL;DR OpenAI has announced the launch of GPT-5. The new AI model offers improvements across the board, delivering better accuracy, reduced hallucinations, faster performance, and more. GPT-5 is rolling out today to Plus, Pro, Team, and free users today, with Enterprise and Edu subscribers gaining access in one week. In July, OpenAI CEO Sam Altman confirmed that the company’s highly anticipated new AI model, GPT-5, was nearing release. A report later claime

OpenAI’s GPT-5 is here

OpenAI has launched GPT-5, a new flagship AI model that will power the company’s next generation of ChatGPT. GPT-5, which was released Thursday, is OpenAI’s first “unified” AI model and combines the reasoning abilities of its o-series of models with the fast responses of its GPT series. The next-generation model signals a new era for ChatGPT — and its creator, OpenAI — pointing to OpenAI’s broader ambitions to develop AI systems that are more like agents than chatbots. While GPT-4 enabled AI c

High costs and thin margins threatening AI coding startups

In February, AI coding startup Windsurf was in talks to raise a big new round at a $2.85 billion valuation led by Kleiner Perkins, at double the valuation it hit six months earlier, sources told TechCrunch at the time. That deal didn’t happen, according to a source familiar with the matter. Instead, news broke in April that the startup planned to sell itself to OpenAI for roughly the same valuation: $3 billion. While that deal famously fell apart, one bigger question remains: If the startup was

OpenAI's new open-source model is basically Phi-5

OpenAI just released its first ever open-source large language models, called gpt-oss-120b and gpt-oss-20b. You can talk to them here. Are they good models? Well, that depends on what you’re looking for. They’re great at some benchmarks, of course (OpenAI would never have released them otherwise) but weirdly bad at others, like SimpleQA. Some people really like them. Others on Twitter really don’t. From what I can tell, they’re technically competent but lack a lot of out-of-domain knowledge: fo

ChatGPT's GPT-5 models released: everything you need to know

After a long wait, GPT-5 is finally rolling out. It's available for free, Plus, Pro and Team users today. This means everyone gets to try GPT-5 today, but paid users get higher limits. In a blog post, OpenAI says GPT-5 is a big leap compared to previous models. OpenAI added that GPT-5 is the best coding model, and early benchmarks suggest it beats Opus 4.1 from Claude by a small margin, but real-life benchmarks are awaited. Unlike previous models, GPT-5 has built-in reasoning. It is a unifie