GoKawiil - Latest Tech News & Aggregated Headlines

How attention sinks keep language models stable

news.ycombinator.com Guangxuan Xiao 2025-12-02 16:53:10

We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of j

Topics: attention models sink sinks tokens

Shop Amazon

The surprise deprecation of GPT-4o for ChatGPT consumers

news.ycombinator.com Simon Willison 2025-12-03 04:04:26

The surprise deprecation of GPT-4o for ChatGPT consumers I’ve been dipping into the r/ChatGPT subreddit recently to see how people are reacting to the GPT-5 launch, and so far the vibes there are not good. This AMA thread with the OpenAI team is a great illustration of the single biggest complaint: a lot of people are very unhappy to lose access to the much older GPT-4o, previously ChatGPT’s default model for most users. A big surprise for me yesterday was that OpenAI simultaneously retired ac

Topics: 4o chatgpt gpt model users

Shop Amazon

Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?

news.ycombinator.com Unknown 2025-12-03 05:27:28

Sam said yesterday that chatgpt handles ~700M weekly users. Meanwhile, I can't even run a single GPT-4-class model locally without insane VRAM or painfully slow speeds. Sure, they have huge GPU clusters, but there must be more going on - model optimizations, sharding, custom hardware, clever load balancing, etc. What engineering tricks make this possible at such massive scale while keeping latency low? Curious to hear insights from people who've built large-scale ML systems.

Topics: 700m built class model scale

Shop Amazon

OpenAI’s GPT-5 rollout is not going smoothly

venturebeat.com Carl Franzen 2025-12-03 09:40:57

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The launch of OpenAI’s long anticipated new model, GPT-5, is off to a rocky start to say the least. Even forgiving errors in charts and voice demoes during yesterday’s livestreamed presentation of the new model (actually four separate models, and a ‘Thinking’ mode that can be engaged for three of them), a number of user reports have emerge

Topics: 2025 ai gpt model users

Shop Amazon

I want everything local – Building my offline AI workspace

news.ycombinator.com Unknown 2025-12-03 10:19:05

I want everything local — no cloud, no remote code execution. That’s what a friend said. That one-line requirement, albeit simple, would need multiple things to work in tandem to make it happen. What does a mainstream LLM (Large Language Model) chat app like ChatGPT or Claude provide at a high level? Ability to use chat with a cloud hosted LLM, Ability to run code generated by them mostly on their cloud infra, sometimes locally via shell, Ability to access the internet for new content or se

Topics: code container local models tool

Shop Amazon

OpenAI priced GPT-5 so low, it may spark a price war

techcrunch.com Julie Bort 2025-12-03 20:10:04

OpenAI astounded the tech industry for the second time this week by launching its newest flagship model, GPT-5, just days after releasing two new freely available models under an open source license. OpenAI CEO Sam Altman went so far as to call GPT-5 “the best model in the world.” That may be pride or hyperbole, as TechCrunch’s Maxwell Zeff reports that GPT-5 only slightly outperforms other leading AI models from Anthropic, Google DeepMind, and xAI on some key benchmarks, and slightly lags on o

Topics: gpt model openai pricing tech

Shop Amazon

Google Working on Fix for Glum Gemini, Stuck in 'Infinite Loop' of Self-Esteem Issues

cnet.com See Full Bio 2025-12-03 23:14:00

Maybe Google Gemini needs to take some PTO. The company's large language AI model, which is increasingly spreading across Google's many services and products, has been saying some things lately that are leading users to worry: Does Gemini have low self esteem? A series of posts on social media showing some of the self-critical responses Gemini has given users show a disturbing pattern. For instance, in one screenshot, Gemini admits it can't solve a coding problem and concludes, "I have failed.

Topics: ai disgrace gemini google model

Shop Amazon

Bank of England chief says no rift with UK government as Revolut licence delay draws scrutiny

cnbc.com Ryan Browne 2025-12-04 05:01:23

Revolut cards is seen in this illustration photo taken in Krakow, Poland on March 29, 2024. LONDON — Bank of England Governor Andrew Bailey told CNBC there hasn't been a "falling out" with the U.K. government over delays to fintech giant Revolut's long-awaited bank license. Last week, the Financial Times reported that a meeting arranged by British Finance Minister Rachel Reeves with Revolut and the Prudential Regulation Authority (PRA) — an arm of the BOE that oversees banks — was cancelled af

Topics: authority bailey bank delays revolut

Shop Amazon

Apple will bring GPT-5 to Apple Intelligence in iOS, iPad OS and macOS 26

engadget.com Unknown 2025-12-04 05:32:11

OpenAI finally released its long-awaited GPT-5 model this week, unsurprisingly proclaiming it its best yet with regards to coding, accuracy, safety and more. CEO Sam Altman even compared the jump up in quality to when the iPhone first adopted a Retina display in a press briefing ahead of the announcement. Big talk indeed. Given ChatGPT’s integration with Apple Intelligence , you might be wondering when the latest model will arrive on the devices that support it. The answer is sooner rather than

Topics: apple chatgpt gpt intelligence model

Shop Amazon

How Google's Genie 3 could change AI video - and let you build your own interactive worlds

zdnet.com Webb Wright 2025-12-04 10:41:43

Google ZDNET's key takeaways: World models could help to advance AI research, entertainment, etc. Genie 3, Google DeepMind's world model, debuted on Tuesday. Google DeepMind says Genie 3 has an "understanding" of the world. Imagine exploring a virtual environment without boundaries, where everything you see looks and behaves just as it would in reality. This is precisely what many tech developers today are working to create through AI "world models," or algorithms that can build and act up

Topics: ai genie models video world

Shop Amazon

South Korea charts one-of-a-kind course in AI race with U.S. and China

cnbc.com Arjun Kharpal 2025-12-04 15:19:16

In this article .FKRX300 Follow your favorite stocks CREATE FREE ACCOUNT Ryu Young-sang, CEO of South Korean telecoms giant SK Telecom, told CNBC that AI is helping telecoms firms improve efficiency in their networks. Manaure Quintero | Afp | Getty Images South Korea has tasked some of its biggest companies and promising startups to build a national foundational AI model using mainly domestic technology, in a rare move to keep the country apace with the U.S. and China. The project will feature

Topics: ai korea model models sk

Shop Amazon

Here’s everything OpenAI announced at its GPT-5 event

9to5mac.com Marcus Mendes 2025-12-04 15:40:18

During an uncharacteristically long video stream yesterday, OpenAI announced GPT-5, alongside a series of interface and usability improvements to its chatbot. Here’s everything new with ChatGPT. One model to rule them all After many years of confusion with similarly-named models imbued with overlapping abilities, OpenAI finally streamlined the user experience and trimmed down its model offerings to: GPT-5 GPT-5 Thinking GPT-5 Pro (limited to the US$200/mo plan) OpenAI says that ChatGPT wil

Topics: chatgpt gpt model openai users

Shop Amazon

South Korea launches national AI model in tech race with U.S. and China

cnbc.com Arjun Kharpal 2025-12-04 22:51:17

In this article .FKRX300 Follow your favorite stocks CREATE FREE ACCOUNT Ryu Young-sang, CEO of South Korean telecoms giant SK Telecom, told CNBC that AI is helping telecoms firms improve efficiency in their networks. Manaure Quintero | Afp | Getty Images South Korea has tasked some of its biggest companies and promising startups to build a national foundational AI model using mainly domestic technology, in a rare move to keep the country apace with the U.S. and China. The project will feature

Topics: ai korea model models sk

Shop Amazon

Microsoft rolls out GPT-5 across its Copilot suite - here's where you'll find it

zdnet.com David Gewirtz 2025-12-05 02:51:00

Adam Gray/Bloomberg via Getty Images ZDNET's key takeaways Microsoft is rolling out GPT-5 to all its AI offerings, Thursday. The Copilot chatbot will provide GPT-5, even to free users. GPT-5 will also be available to coding and enterprise tools. OpenAI released its much-anticipated upgrade to the engine that powers ChatGPT and many other AI implementations, including Microsoft's AI offerings, on Thursday. Concurrent with the GPT-5 release, Microsoft announced that it is upgrading its consu

Topics: ai copilot gpt microsoft model

Shop Amazon

FLUX.1-Krea and the Rise of Opinionated Models

news.ycombinator.com Drew Breunig 2025-11-30 20:14:02

AI-generated images have a general look to them. Shiny, bright, waxy-skin, and over-use of bokeh. From Midjourney to Gemini to OpenAI, the AI Look is consistent. Enthusiasts and professionals wrestle with prompts and even fine-tune these models to tamp down the AI smell, with varying degrees of success. Examples of the "AI Look", provided by Krea in their technical paper. Last week, Krea launched an open model, FLUX.1-Krea, that’s built to avoid the “AI Look”. Their writeup is tremendous: it d

Topics: ai image krea model models

Shop Amazon

How Attention Sinks Keep Language Models Stable

news.ycombinator.com Guangxuan Xiao 2025-12-04 22:53:10

We discovered why language models catastrophically fail on long conversations: when old tokens are removed to save memory, models produce complete gibberish. We found models dump massive attention onto the first few tokens as "attention sinks"—places to park unused attention since softmax requires weights to sum to 1. Our solution, StreamingLLM, simply keeps these first 4 tokens permanently while sliding the window for everything else, enabling stable processing of 4 million+ tokens instead of j

Topics: attention models sink sinks tokens

Shop Amazon

Rocket Report: Firefly lights the markets up; SpaceX starts selling trips to Mars

arstechnica.com Unknown 2025-12-05 02:00:14

Welcome to Edition 8.06 of the Rocket Report! After years of disappointing results from SPACs and space companies, it is a good sign to see Firefly's more traditional initial public offering doing so well. The company has had such a long and challenging road over more than a decade; the prospect of their success should be heartening to the commercial space industry. As always, we welcome reader submissions. If you don't want to miss an issue, please subscribe using the box below (the form will

Topics: 2026 company delta firefly space

Shop Amazon

Microsoft’s new Copilot 3D feature is great for Ikea, bad for my dog

theverge.com Tom Warren 2025-12-05 02:00:26

is a senior editor and author of Notepad , who has been covering all things Microsoft, PC, and tech for over 20 years. Posts from this author will be added to your daily email digest and your homepage feed. While Microsoft was busy updating Copilot yesterday with OpenAI’s new GPT-5 model, it also quietly launched Copilot 3D. It’s a free-to-use feature that can transform a regular 2D image into a 3D model that can then be used in game creation, animation, 3D printing, VR / AR, and much more. C

Topics: 3d copilot image images model

Shop Amazon

Benchmarking GPT-5 on 400 real-world code reviews

news.ycombinator.com Dedy Kredo 2025-12-05 02:00:22

GPT-5 is now available in Qodo’s platform for all free and paid users. Get started today. At Qodo, we believe benchmarks should reflect how developers actually work. That’s why we built the PR Benchmark—a benchmark designed to assess how well language models handle tasks like code review, suggesting improvements, and understanding developer intent. Unlike many public benchmarks, the PR Benchmark is private, and its data is not publicly released. This ensures models haven’t seen it during train

Topics: benchmark code gpt model review

Shop Amazon

ChatGPT users dismayed as OpenAI pulls popular models GPT-4o, o3 and more — enterprise API remains (for now)

venturebeat.com Emilia David 2025-12-05 00:45:03

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now After announcing the release of its newest flagship model family, GPT-5, OpenAI said the model will power all of ChatGPT, and that it will sunset the existing models in the chat platform. OpenAI, through a spokesperson, told VentureBeat that GPT-5 “will replace all other models in ChatGPT, so users don’t have to pick depending on each task

Topics: 4o gpt model models openai

Shop Amazon

Microsoft rolls out GPT-5 across its Copilot suite - here's what we know

zdnet.com David Gewirtz 2025-12-05 03:01:02

Matthias Balk/picture alliance via Getty Images ZDNET's key takeaways Microsoft is rolling out GPT-5 to all its AI offerings, Thursday. The Copilot chatbot will provide GPT-5, even to free users. GPT-5 will also be available to coding and enterprise tools. OpenAI released its much-anticipated upgrade to the engine that powers ChatGPT and many other AI implementations, including Microsoft's AI offerings, Thursday. Concurrent with the GPT-5 release, Microsoft announced that it is upgrading i

Topics: ai copilot gpt microsoft model

Shop Amazon

Tesla exec leading development of chip tech and Dojo supercomputer is leaving company

cnbc.com Lora Kolodny 2025-12-05 05:42:06

Tesla's vice president of hardware design engineering, Pete Bannon, is leaving the company after first joining in 2016 from Apple , CNBC has confirmed. Bannon was leading the development of Tesla's Dojo supercomputer and reported directly to Musk. Bloomberg first reported on Bannon's departure, and added that Musk ordered his team to shut down, with engineers in the group getting reassigned to other initiatives. Tesla didn't immediately respond to a request for comment. Since early last year,

Topics: company models musk said tesla

Shop Amazon

Achieving 10,000x training data reduction with high-fidelity labels

news.ycombinator.com Unknown 2025-12-05 03:11:20

Classifying unsafe ad content has proven an enticing problem space for leveraging large language models (LLMs). The inherent complexity involved in identifying policy-violating content demands solutions capable of deep contextual and cultural understanding, areas of relative strength for LLMs over traditional machine learning systems. But fine-tuning LLMs for such complex tasks requires high-fidelity training data that is difficult and expensive to curate at the necessary quality and scale. Stan

Topics: content data llms model training

Shop Amazon

GPT-5: Key characteristics, pricing and system card

news.ycombinator.com Simon Willison 2025-12-04 22:46:18

GPT-5: Key characteristics, pricing and model card I’ve had preview access to the new GPT-5 model family for the past two weeks (see related video) and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still an LLM—it’s not a dramatic departure from what we’ve had before—but it rarely screws up and generally feels competent or occasionally impressive at the kinds of things I like to use models for. I’ve collected a lot of notes over the past two weeks, so I’ve decided

Topics: 00 gpt mini model models

Shop Amazon

GPT-5: Here's What's New in ChatGPT's Big Update

cnet.com See Full Bio 2025-12-05 08:50:25

Expect your ChatGPT experience to get faster and smarter today. OpenAI updated its flagship line of large language models Thursday, unveiling the GPT-5 generative AI model after months of anticipation. While the developer has released a lot of model updates in recent months, including new open-weights models just this week, it's been more than two years since the debut of GPT-4. With a new generation worthy of a new number, how big of a change should you expect? "I tried going back to GPT-4 an

Topics: gpt model openai said users

Shop Amazon

OpenAI’s most powerful AI model is here and free for everyone

androidauthority.com Unknown 2025-12-05 15:54:57

Edgar Cervantes / Android Authority TL;DR OpenAI has announced the launch of GPT-5. The new AI model offers improvements across the board, delivering better accuracy, reduced hallucinations, faster performance, and more. GPT-5 is rolling out today to Plus, Pro, Team, and free users today, with Enterprise and Edu subscribers gaining access in one week. In July, OpenAI CEO Sam Altman confirmed that the company’s highly anticipated new AI model, GPT-5, was nearing release. A report later claime

Topics: gpt model new openai users

Shop Amazon

OpenAI’s GPT-5 is here

techcrunch.com Maxwell Zeff 2025-12-05 10:00:00

OpenAI has launched GPT-5, a new flagship AI model that will power the company’s next generation of ChatGPT. GPT-5, which was released Thursday, is OpenAI’s first “unified” AI model and combines the reasoning abilities of its o-series of models with the fast responses of its GPT series. The next-generation model signals a new era for ChatGPT — and its creator, OpenAI — pointing to OpenAI’s broader ambitions to develop AI systems that are more like agents than chatbots. While GPT-4 enabled AI c

Topics: ai gpt model models openai

Shop Amazon

High costs and thin margins threatening AI coding startups

techcrunch.com Marina Temkin 2025-12-05 15:05:01

In February, AI coding startup Windsurf was in talks to raise a big new round at a $2.85 billion valuation led by Kleiner Perkins, at double the valuation it hit six months earlier, sources told TechCrunch at the time. That deal didn’t happen, according to a source familiar with the matter. Instead, news broke in April that the startup planned to sell itself to OpenAI for roughly the same valuation: $3 billion. While that deal famously fell apart, one bigger question remains: If the startup was

Topics: ai anysphere coding model startup

Shop Amazon

OpenAI's new open-source model is basically Phi-5

news.ycombinator.com Unknown 2025-12-05 17:59:46

OpenAI just released its first ever open-source large language models, called gpt-oss-120b and gpt-oss-20b. You can talk to them here. Are they good models? Well, that depends on what you’re looking for. They’re great at some benchmarks, of course (OpenAI would never have released them otherwise) but weirdly bad at others, like SimpleQA. Some people really like them. Others on Twitter really don’t. From what I can tell, they’re technically competent but lack a lot of out-of-domain knowledge: fo

Topics: data model models source synthetic

Shop Amazon

ChatGPT's GPT-5 models released: everything you need to know

bleepingcomputer.com Unknown 2025-12-06 00:03:32

After a long wait, GPT-5 is finally rolling out. It's available for free, Plus, Pro and Team users today. This means everyone gets to try GPT-5 today, but paid users get higher limits. In a blog post, OpenAI says GPT-5 is a big leap compared to previous models. OpenAI added that GPT-5 is the best coding model, and early benchmarks suggest it beats Opus 4.1 from Claude by a small margin, but real-life benchmarks are awaited. Unlike previous models, GPT-5 has built-in reasoning. It is a unifie

Topics: gpt model openai reasoning unified

Shop Amazon

Latest Tech News

How attention sinks keep language models stable

The surprise deprecation of GPT-4o for ChatGPT consumers

Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?

OpenAI’s GPT-5 rollout is not going smoothly

I want everything local – Building my offline AI workspace

OpenAI priced GPT-5 so low, it may spark a price war

Google Working on Fix for Glum Gemini, Stuck in 'Infinite Loop' of Self-Esteem Issues

Bank of England chief says no rift with UK government as Revolut licence delay draws scrutiny

Apple will bring GPT-5 to Apple Intelligence in iOS, iPad OS and macOS 26

How Google's Genie 3 could change AI video - and let you build your own interactive worlds

South Korea charts one-of-a-kind course in AI race with U.S. and China

Here’s everything OpenAI announced at its GPT-5 event

South Korea launches national AI model in tech race with U.S. and China

Microsoft rolls out GPT-5 across its Copilot suite - here's where you'll find it

FLUX.1-Krea and the Rise of Opinionated Models

How Attention Sinks Keep Language Models Stable

Rocket Report: Firefly lights the markets up; SpaceX starts selling trips to Mars

Microsoft’s new Copilot 3D feature is great for Ikea, bad for my dog

Benchmarking GPT-5 on 400 real-world code reviews

ChatGPT users dismayed as OpenAI pulls popular models GPT-4o, o3 and more — enterprise API remains (for now)

Microsoft rolls out GPT-5 across its Copilot suite - here's what we know

Tesla exec leading development of chip tech and Dojo supercomputer is leaving company

Achieving 10,000x training data reduction with high-fidelity labels

GPT-5: Key characteristics, pricing and system card

GPT-5: Here's What's New in ChatGPT's Big Update

OpenAI’s most powerful AI model is here and free for everyone

OpenAI’s GPT-5 is here

High costs and thin margins threatening AI coding startups

OpenAI's new open-source model is basically Phi-5

ChatGPT's GPT-5 models released: everything you need to know

About GoKawiil

Privacy

Advertising

Latest Tech News

Trending Topics

Hot Now

Popular

Emerging

About GoKawiil

Privacy

Advertising