Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: model Clear Filter

Generative AI's crippling failure to induce robust models of the world

Synthesized video from Dawid van Straaten, prompt (“Generate me a video of two men playing chess”) in which the player for black reaches across the table and, in the midst of a rather unusual position moves his opponent’s pawn horizontally, and quite illegally, several squares across the board. A few weeks ago, I had the singular honor of recording a podcast (to be released soon) with one of my heroes, Garry Kasparov, not only one of the greatest chess players of all time, but also one of the b

Tesla says it made its first driverless delivery of a new car to a customer

Tesla CEO Elon Musk said the automaker completed its first driverless delivery of a new car to a customer, routing a Model Y SUV from the company's Austin, Texas, Gigafactory to an apartment building in the area on June 27. The Tesla account on social network X, which is also owned by Musk, shared a video overnight showing the Model Y traversing public roads in Austin, including highways, with no human in the driver's seat or front passenger seat of the car. Tesla did not say which version of

CTGT wins Best Presentation Style award at VB Transform 2025

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more San Francisco-based CTGT, a startup focused on making AI more trustworthy through feature-level model customization, won the Best Presentation Style award at VB Transform 2025 in San Francisco. Founded by 23-year-old Cyril Gorlla, the company showcased how its technology helps enterprises overcome AI trust barriers by directly modifying mo

From hallucinations to hardware: Lessons from a real-world computer vision project gone sideways

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Computer vision projects rarely go exactly as planned, and this one was no exception. The idea was simple: Build a model that could look at a photo of a laptop and identify any physical damage — things like cracked screens, missing keys or broken hinges. It seemed like a straightforward use case for image models and large language models (

Evaluating Long-Context Question and Answer Systems

While evaluating Q&A systems is straightforward with short paragraphs, complexity increases as documents grow larger. For example, technical documentation, novels and movies, as well as multi-document scenarios. Although some of these evaluation challenges also appear in shorter contexts, long-context evaluation amplifies issues such as: Information overload: Irrelevant details in large documents obscure relevant facts, making it harder for retrievers and models to locate the right evidence for

Life of an inference request (vLLM V1): How LLMs are served efficiently at scale

Life of an inference request (vLLM V1): How LLMs are served efficiently at scale Junhao Li Senior Software Engineer Ubicloud is an open source alternative to AWS. We offer managed cloud services that build on top of PostgreSQL, Kubernetes, vLLM, and others.‍ ‍vLLM is an open-source inference engine that serves large language models. We deploy multiple vLLM instances across GPUs and load open weight models like Llama 4 into them. We then load balance traffic across vLLM instances, run health

OpenAI Loses 4 Key Researchers to Meta

Four OpenAI researchers are leaving the company to go to Meta, two sources confirm to WIRED. Shengjia Zhao, Shuchao Bi, Jiahui Yu, and Hongyu Ren have joined Meta’s superintelligence team. Their OpenAI Slack profiles have been deactivated. The Information first reported on the departures. It’s the latest in a series of aggressive moves by Mark Zuckerberg, who is racing to catch up to OpenAI, Anthropic and Google in building artificial general intelligence. Earlier this month, OpenAI CEO Sam Al

Nvidia RTX 5060 Ti 8GB vs. 16GB Tested Across PCIe 3.0, 4.0 and 5.0

Recently we examined how PCI Express bandwidth influences the performance of the 8 GB Radeon RX 9060 XT when local video memory (VRAM) is exceeded. The entire purpose of that testing was to push past the VRAM limit, which, unfortunately for 8 GB graphics cards, is a relatively easy task in 2025. This can happen even when using settings that would otherwise be highly playable, as demonstrated by the 16 GB model. This is an interesting test for several reasons, the most notable being that PCIe ba

Nvidia RTX 5060 Ti 8GB vs. 16GB Tested Across PCIe 3.0, 4.0 and 5.0

Recently we examined how PCI Express bandwidth influences the performance of the 8 GB Radeon RX 9060 XT when local video memory (VRAM) is exceeded. The entire purpose of that testing was to push past the VRAM limit, which, unfortunately for 8 GB graphics cards, is a relatively easy task in 2025. This can happen even when using settings that would otherwise be highly playable, as demonstrated by the 16 GB model. This is an interesting test for several reasons, the most notable being that PCIe ba

Anker issues another recall for multiple power banks that pose fire safety risk

Anker has issued its second recall this month for several power bank models sold around the world, as MacRumors has reported. If you'll recall, its previous recall that launched earlier this month focused on the Anker PowerCore 10000 power bank model A1263, which were sold between June 1, 2016 and December 31st, 2022 in the United States. The company found that the lithium-ion battery it used for the model has a risk of overheating that could then lead to the power bank melting, producing smoke

Your BNPL Plans Could Soon Impact Your Credit Score. Here's When

CNET/Getty Images Have you ever opted for Buy Now, Pay Later at the checkout? You'd hardly be the only one. About 86.5 million people used BNPL in 2024, according to Capital One's research. Starting later this year, your BNPL plans could start appearing on your credit report. You can use BNPL for just about everything now, from Costco purchases to DoorDash (although that doesn't mean you should). But the one thing BNPL couldn't do was improve your credit. Although some of Affirm's plans do rep

AlphaGenome: AI for Better Understanding the Genome

Science AlphaGenome: AI for better understanding the genome Share Copy link × Introducing a new, unifying DNA sequence model that advances regulatory variant-effect prediction and promises to shed new light on genome function — now available via API. The genome is our cellular instruction manual. It’s the complete set of DNA which guides nearly every part of a living organism, from appearance and function to growth and reproduction. Small variations in a genome’s DNA sequence can alter a

Anker issues new global recall for five power bank models over fire hazard

Just days after issuing a voluntary recall for the Anker PowerCore 10000, the company has now issued a second, broader recall, this time with global reach and five additional power bank models affected. Anker says the move is a proactive step following recent improvements to its internal quality assurance systems. While the company claims the risk of malfunction is low, it’s recalling the affected models “out of an abundance of caution.” All five power banks flagged in the recall use lithium-io

Reinforcement learning, explained with a minimum of math and jargon

It’s Agent Week at Understanding AI! This week I’m going to publish a series of articles explaining the most important AI trend of 2025: agents! Today is a deep dive into reinforcement learning, the training technique that made agentic models like Claude 3.5 Sonnet and o3 possible. Today’s article is available for free, but some articles in the series—including tomorrow’s article on MCP and tool use—will be for paying subscribers only. I’m offering a 20 percent discount on annual subscriptions

Why your enterprise AI strategy needs both open and closed models: The TCO reality check

This article is part of VentureBeat’s special issue, “The Real Cost of AI: Performance, Efficiency and ROI at Scale.” Read more from this special issue. For the last two decades, enterprises have had a choice between open-source and closed proprietary technologies. The original choice for enterprises was primarily centered on operating systems, with Linux offering an open-source alternative to Microsoft Windows. In the developer realm, open-source languages like Python and JavaScript dominate,

The rise of prompt ops: Tackling hidden AI costs from bad inputs and context bloat

This article is part of VentureBeat’s special issue, “The Real Cost of AI: Performance, Efficiency and ROI at Scale.” Read more from this special issue. Model providers continue to roll out increasingly sophisticated large language models (LLMs) with longer context windows and enhanced reasoning capabilities. This allows models to process and “think” more, but it also increases compute: The more a model takes in and puts out, the more energy it expends and the higher the costs. Couple this wi

Model minimalism: The new AI strategy saving companies millions

This article is part of VentureBeat’s special issue, “The Real Cost of AI: Performance, Efficiency and ROI at Scale.” Read more from this special issue. The advent of large language models (LLMs) has made it easier for enterprises to envision the kinds of projects they can undertake, leading to a surge in pilot programs now transitioning to deployment. However, as these projects gained momentum, enterprises realized that the earlier LLMs they had used were unwieldy and, worse, expensive. Ente

How runtime attacks turn profitable AI into budget black holes

This article is part of VentureBeat’s special issue, “The Real Cost of AI: Performance, Efficiency and ROI at Scale.” Read more from this special issue. AI’s promise is undeniable, but so are its blindsiding security costs at the inference layer. New attacks targeting AI’s operational side are quietly inflating budgets, jeopardizing regulatory compliance and eroding customer trust, all of which threaten the return on investment (ROI) and total cost of ownership of enterprise AI deployments. AI

Normalizing Flows Are Capable Generative Models

Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative modeling tasks, but have received relatively little attention in recent years. In this work, we demonstrate that NFs are more powerful than previously believed. We present TarFlow: a simple and scalable architecture that enables highly performant NF models. TarFlow can be thought of as a Transformer-based variant of Masked Autoregressive Fl

The best deals on 4K TVs

Things are looking bright for those who want to nab a great TV in 2025 at a substantial discount. There’s usually a great deal happening on a mid- or high-end TV from LG, TCL, Hisense, or Amazon’s own Fire TV brand — even if the biggest discounts remain reserved for Black Friday, Cyber Monday, Amazon Prime Day, and during the lead-up to the Super Bowl. Right now, there are a number of discounted 4K TVs to choose from, spanning a wide variety of prices, sizes, and feature sets. Whether you want

Topics: 4k amazon inch model tv

How Highmark Health and Google Cloud are using Gen AI to streamline medical claims and improve care: 6 key lessons

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Among the numerous educational and startlingly insightful panel discussions on AI enterprise integrations featuring industry leaders at VentureBeat’s Transform 2025 conference this week was one led by Google Cloud Platform Vice President and Chief Technology Officer (CTO) Will Grannis and Richard Clarke, Highmark Health’s Senior Vice Presi

Show HN: PILF, The ultimate solution to catastrophic oblivion on AI models

Technical Notes: PILF (Predictive Integrity Learning Framework) Document Version: 3.0 Core Concept: A cognitive learning framework designed to transform fixed hyperparameters (like learning rate, model capacity) into dynamic policies driven in real-time by the intrinsic "surprise" ( Surprise ) of data. It is essentially an adaptive hyperparameter scheduling algorithm that allows a model to autonomously decide "how much to learn" and "with what capacity to learn" based on the value of the learn

Apple Research unearthed forgotten AI technique and using it to generate images

Today, most generative image models basically fall into two main categories: diffusion models, like Stable Diffusion, or autoregressive models, like OpenAI’s GPT-4o. But Apple just released two papers that show how there might be room for a third, forgotten technique: Normalizing Flows. And with a dash of Transformers on top, they might be more capable than previously thought. First things first: What are Normalizing Flows? Normalizing Flows (NFs) are a type of AI model that works by learning

AlphaGenome: AI for better understanding the genome

Science AlphaGenome: AI for better understanding the genome Share Copy link × Introducing a new, unifying DNA sequence model that advances regulatory variant-effect prediction and promises to shed new light on genome function — now available via API. The genome is our cellular instruction manual. It’s the complete set of DNA which guides nearly every part of a living organism, from appearance and function to growth and reproduction. Small variations in a genome’s DNA sequence can alter a

Introducing Gemma 3n

The first Gemma model launched early last year and has since grown into a thriving Gemmaverse of over 160 million collective downloads. This ecosystem includes our family of over a dozen specialized models for everything from safeguarding to medical applications and, most inspiringly, the countless innovations from the community. From innovators like Roboflow building enterprise computer vision to the Institute of Science Tokyo creating highly-capable Japanese Gemma variants, your work has shown

Topics: 3n device e4b gemma model

Google DeepMind Releases AlphaGenome

Science AlphaGenome: AI for better understanding the genome Share Copy link × Introducing a new, unifying DNA sequence model that advances regulatory variant-effect prediction and promises to shed new light on genome function — now available via API. The genome is our cellular instruction manual. It’s the complete set of DNA which guides nearly every part of a living organism, from appearance and function to growth and reproduction. Small variations in a genome’s DNA sequence can alter a

Deals: Rare offer on 1TB M4 MacBook Air at $200 off, Apple Solo Loops $15, all-black Ocean Band 20% off, more

Alongside the ongoing $100 price drop on the 512GB M4 Mac mini, today we spotted a rare deal on the 24GB M4 MacBook Air with the upgraded 1TB SSD at $200 off the list price. From there we move over to some official Apple Watch band deals – the Solo/Braided Loop bands start from just $15 today and we have a rare savings opportunity on the official all-black Apple Ocean Band at 20% off. Those offers join a host of charging and accessory offers, some discounts on M4 iPad Pro, and more. Rare deal h

Insta360 has a cheaper Flow 2 gimbal for the masses

It duplicates many of the features from the Flow 2 Pro, which launched in January. Insta360's new gimbal isn't quite "Pro," but its pricing isn't, either. The Flow 2 includes many of the features from the Flow 2 Pro while costing $50 less. The Insta360 Flow 2 ticks most of the boxes that its Pro sibling does. (The more expensive gimbal launched earlier this year.) Like that model, the Flow 2 features a built-in selfie stick and a tripod. It supports advanced subject tracking, golden ratio subj

Nvidia DLSS 4 transformer model exits beta, set to bring improved graphics to more games

Serving tech enthusiasts for over 25 years.TechSpot means tech analysis and advice you can trust Why it matters: Most people think of multi-frame generation when they hear about Nvidia DLSS 4, but the transformer model upgrade in DLSS Super Resolution might be the update's most consequential upgrade. Many games can already benefit from the feature, and it's likely to become the standard across upcoming releases. The latest version of Nvidia's DLSS Super Resolution and Ray Reconstruction SDK, r

Experience Making a 1-minute AI movie with my 7-year old daughter

My daughter Kate (7 years old) really loves Minecraft! Together, we used several generative AI tools to create a 1-minute animation based on only 1 input photo of her. The whole project took around 20 hours of work and I learned several lessons that I want to share here. Context I am still trying to get used to the enormous speed with which generative AI is progressing. 6 months ago, I was blogging about my experiments with Tencent’s Hunyuan Video, which was an absolute breakthrough at that ti