Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: del Clear Filter

Apple study shows LLMs also benefit from the oldest productivity trick in the book

In a new study co-authored by Apple researchers, an open-source large language model (LLM) saw big performance improvements after being told to check its own work by using one simple productivity trick. Here are the details. A bit of context After an LLM is trained, its quality is usually refined further through a post-training step known as reinforcement learning from human feedback (RLHF). With RLHF, every time a model gives an answer, human labelers can either give it a thumbs up, which re

New AI attack hides data-theft prompts in downscaled images

Researchers have developed a novel attack that steals user data by injecting malicious prompts in images processed by AI systems before delivering them to a large language model. The method relies on full-resolution images that carry instructions invisible to the human eye but become apparent when the image quality is lowered through resampling algorithms. Developed by Trail of Bits researchers Kikimora Morozova and Suha Sabi Hussain, the attack builds upon a theory presented in a 2020 USENIX

Something Extremely Scary Happens When Advanced AI Tries to Give Medical Advice to Real World Patients

Image by Getty / Futurism Developments Last week, Google AI pioneer Jad Tarifi sparked controversy when he told Business Insider that it no longer makes sense to get a medical degree — since, in his telling, artificial intelligence will render such an education obsolete by the time you're a practicing doctor. Companies have long touted the tech as a way to free up the time of overworked doctors and even aid them in specialized skills, including scanning medical imagery for tumors. Hospitals ha

How to make things slower so they go faster

Synchronized demand is the moment a large cohort of clients acts almost together. In a service with capacity $\mu$ requests per second and background load $\lambda_0$, the usable headroom is $H = \mu - \lambda_0 > 0$. When $M$ clients align—after a cache expiry, at a cron boundary, or as a service returns from an outage—the bucketed arrival rate can exceed $H$ by large factors. Queues form, timeouts propagate, retries synchronize, and a minor disturbance becomes a major incident. The task is to

We Tested All the Cheapest Meal Kits and Prepared Meal Services. These Are the Best

Our meal delivery experts have logged more than five years of cooking more than 75 meal kits and tasting upwards of 100 prepared meals, from popular meal delivery services to help land on our list of the best cheap meal subscriptions. CNET staff -- not advertisers, partners or business interests -- determine how we review products and services. If you buy through our links, we may earn a commission. Life’s busier than ever -- at least it seems that way -- and carving out time to plan a week’s

Pixel-art cyberpunk platformer Replaced delayed again, this time until spring 2026

The long-awaited cyberpunk platformer Replaced has been delayed once again. Now it's not coming out until next spring. Developer Sad Cat Studio apologized for the delay and said that this is simply "what's right for the project" and that the team remains dedicated to creating a "truly special experience." This isn't the first time the game has suffered a delay. It was originally announced back in 2021, with a pending release date of 2022. That didn't happen. In 2024, the company announced a rel

This new delivery robot will bring the entire grocery store to you

is transportation editor with 10+ years of experience who covers EVs, public transportation, and aviation. His work has appeared in The New York Daily News and City & State. Posts from this author will be added to your daily email digest and your homepage feed. A new company aims to take the idea of sidewalk delivery robots and supersize it. Los Angeles-based Robomart unveiled its new delivery robot Monday, with the goal of making “on‑demand delivery work economically.” The level-four autonom

How to Make Things Slower So They Go Faster

Synchronized demand is the moment a large cohort of clients acts almost together. In a service with capacity $\mu$ requests per second and background load $\lambda_0$, the usable headroom is $H = \mu - \lambda_0 > 0$. When $M$ clients align—after a cache expiry, at a cron boundary, or as a service returns from an outage—the bucketed arrival rate can exceed $H$ by large factors. Queues form, timeouts propagate, retries synchronize, and a minor disturbance becomes a major incident. The task is to

Robomart unveils new delivery robot with $3 flat fee to challenge DoorDash, Uber Eats

Robomart, a startup that builds self-driving delivery robots, is unveiling its latest robot with an ambitious goal of using it to make on-demand delivery profitable. The Los Angeles-based company announced its patented Robomart RM5 on Monday. The level-four autonomous vehicle can carry up to 500 pounds and is made up of 10 individual lockers that hold customer orders. This structure is designed to allow for batch ordering so a robot can work on multiple deliveries at the same time. Robomart pl

Why I'm Now Running Enterprise AI on My Laptop (Without Internet)

Beyond the Cloud: Why I’m Now Running Enterprise AI on My Laptop (Without Internet) Klaudi 6 min read · 1 hour ago 1 hour ago -- Listen Share For years, I’ve been skeptical of the AI revolution. Not because I don’t believe in its potential, but because the how always felt broken. The promise of “AI for everyone” came with a hidden cost: my data, my privacy, and my wallet. Every time I used a cloud based AI service, I’d ask myself: Where is my data going? Who owns it? How much will this cost me

IBM and NASA Develop a Digital Twin of the Sun to Predict Future Solar Storms

The Sun’s most complex mysteries could soon be solved thanks to artificial intelligence. On August 20, IBM and NASA announced the launch of Surya, a foundation model for the sun. Having been trained on large datasets of solar activity, this AI tool aims to deepen humanity’s understanding of solar weather and accurately predict solar flares—bursts of electromagnetic radiation emitted by our star that threaten both astronauts in orbit and communications infrastructure on Earth. Surya was trained

Elon Musk says xAI has open sourced Grok 2.5

In Brief Elon Musk’s xAI has made an older version of its AI model Grok — specifically, the model weights used to shape Grok 2.5 — available on the open source platform Hugging Face. “The @xAI Grok 2.5 model, which was our best model last year, is now open source,” Musk wrote on X. He added that Grok 3 “will be made open source in about 6 months.” AI engineer Tim Kellogg described the Grok license as “custom with some anti-competitive terms.” Grok, which is prominently featured on X (which i

You can now download and tweak Grok 2.5 for yourself as it goes open source

Unhinged as Grok may be, it's now open source. xAI's CEO, Elon Musk, posted on X that the company made the older Grok 2.5 model available to the public and will do the same with the upcoming Grok 3. For now, anyone can download, run and even tweak Grok, whose source code was uploaded to the Hugging Face platform. However, there are restrictions to xAI's open-source license, which doesn't let people use Grok to train, create or improve other AI models. It's not the first time xAI has made its mo

The Hidden Ingredients Behind AI’s Creativity

The original version of this story appeared in Quanta Magazine. We were once promised self-driving cars and robot maids. Instead, we’ve seen the rise of artificial intelligence systems that can beat us in chess, analyze huge reams of text, and compose sonnets. This has been one of the great surprises of the modern era: physical tasks that are easy for humans turn out to be very difficult for robots, while algorithms are increasingly able to mimic our intellect. Another surprise that has long p

ThinkMesh: A Python lib for parallel thinking in LLMs

ThinkMesh ThinkMesh is a python library for running diverse reasoning paths in parallel, scoring them with internal confidence signals, reallocates compute to promising branches, and fuses outcomes with verifiers and reducers. It works with offline Hugging Face Transformers and vLLM/TGI, and with hosted APIs. Note: This is still in it's early development phase and breaking changes can sometimes occur Highlights Parallel reasoning with DeepConf‑style confidence gating and budget reallocation

AGI is an engineering problem, not a model training problem

Published: Aug 13, 2025 | at 11:00 AM We’ve reached an inflection point in AI development. The scaling laws that once promised ever-more-capable models are showing diminishing returns. GPT-5, Claude, and Gemini represent remarkable achievements, but they’re hitting asymptotes that brute-force scaling can’t solve. The path to artificial general intelligence isn’t through training ever-larger language models—it’s through building engineered systems that combine models, memory, context, and determ

Evaluating LLMs for my personal use case

Most models are excellent, so cost and latency dominate. It’s great that AI can win maths Olympiads, but that’s not what I’m doing. I mostly ask basic Rust, Python, Linux and life questions. So I did my own evaluation. I gathered 130 real prompts from my bash history (I use command line tool llm). I had Qwen3 235B Thinking and Gemini 2.5 Pro group them into categories. They both chose very similar ones, broadly (with examples): Programming - “Write a bash script to ..” Sysadmin - “With curl

ChatGPT-5 Impressions: Fast, but a Bit Impersonal

ChatGPT-5 likes getting straight to the point. For some, that's a reprieve from its chattier predecessor, GPT-4o. For others, something will certainly seem off. Despite the hype leading up to OpenAI's launch of GPT-5, it ultimately doesn't feel too different from 4o. The quality of responses in my ongoing testing seems to be at the level of past models, including the o3 "reasoning" model. The major difference is that some responses generate very quickly in relatively few words, while others ca

‘Zipotle’: Chipotle Wants to Drop a Burrito on Your Head with New Drone Delivery

Chipotle says it’s jumping on the drone delivery bandwagon. This week, the company announced a new pilot program in Texas that will test the viability of airdropping customers’ orders into their backyards for them. A press release published Thursday makes the whole process sound quite easy as far as drone-robot hybrid delivery burritos go. Certain customers in the Rowlette suburb of Dallas will be able to download the app for Zipline, Chipotle’s new drone partner, and place their orders. Then,

OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new framework from researchers at The University of Hong Kong (HKU) and collaborating institutions provides an open source foundation for creating robust AI agents that can operate computers. The framework, called OpenCUA, includes the tools, data, and recipes for scaling the development of computer-use agents (CUAs). Models trained usin

Apple trained a large language model to efficiently understand long-form video

Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means. The nerdy bits Very basically, when an LLM is trained to also understand video, it learns to split videos into frames, apply computer vision to extract visual features, analyze how those features change over time, and align all of that with language so it can describe or reason about the video in the form of text. One v

College student’s “time travel” AI experiment accidentally outputs real 1834 history

A hobbyist developer building AI language models that speak Victorian-era English "just for fun" got an unexpected history lesson this week when his latest creation mentioned real protests from 1834 London—events the developer didn't know had actually happened until he Googled them. "I was interested to see if a protest had actually occurred in 1834 London and it really did happen," wrote Reddit user Hayk Grigorian, who is a computer science student at Muhlenberg College in Pennsylvania. For t

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The adoption of interoperability standards, such as the Model Context Protocol (MCP), can provide enterprises with insights into how agents and models function outside their walled confines. However, many benchmarks fail to capture real-life interactions with MCP. Salesforce AI Research developed a new open-source benchmark it calls MCP-Un

Apple reportedly in talks with Google to make Gemini the foundation of Siri

Ryan Haines / Android Authority TL;DR Apple reportedly approached Google to explore developing a custom AI model to serve as Siri’s foundation. This AI model may already be in training to run on Apple’s servers. The Cupertino-based firm is said to be developing a version of Siri powered by its models and a second version that runs on third-party technology. Apple has long been trailing behind its competitors in the AI space. In fact, the company was expected to launch a revamped version of S

Apple considers Google Gemini to power next-gen Siri, internal AI ‘bake-off’ underway

Apple seems open to anything and everything when it comes to delivering the next generation of Siri. After reports that it could be powered by OpenAI or Anthropic, Google has entered the conversation. Mark Gurman reports for Bloomberg that Google, which offers a ChatGPT competitor called Gemini, is actually training a model that could run on Apple’s servers to power the new Siri experience: The iPhone maker recently approached Alphabet Inc.’s Google to explore building a custom AI model that w

Deals: Amazon Labor Day sale, most affordable 24GB M4 MacBook Pro $300 off, Apple Watch Ultra 2 $150 off, more

Today’s 9to5Toys Lunch Break deals are kicking off with the now live Amazon Labor Day sale where you’ll find plenty of big-time Apple gear deals. All of the details you need on that are right here, but we are also once again tracking the 24GB/512GB M3 MacBook Air for $400 less than the M4 model (it is now back in stock) alongside the most affordable M4 MacBook Pro with 24GB of RAM at $300 off, and Apple Watch Ultra 2 models at $150 off the list prices. All of that and more awaits below. Looking

The case against humans in space

“Going to space will not end war because war isn’t caused by anything that space travel is apt to change, even in the most optimistic scenarios,” they write. “Humanity going to space en masse probably won’t reduce the likelihood of war, but we should consider that it might increase the chance of war being horrific.” The pair imagine rival space nations exchanging asteroid fire or poisoning whole biospheres. Proponents of space settlements often point to the fate of the dinosaurs as motivational

GPT-5 usage limitations: what are they, how does this compare to GPT-4 family?

Edgar Cervantes / Android Authority GPT-5 arrived a few weeks ago, though its rollout hasn’t been entirely smooth. While the model shows plenty of promise, its debut also meant the abrupt removal of every other GPT model from ChatGPT’s user-facing UI. Since then, some old models have returned, and there have been a few other changes to the way the system works. Furthermore, many of the initial GPT-5 usage limits have been temporarily enhanced since launch. Let’s dive in and take a closer look

Tesla Will Use a Powerful New Weapon in AI Race

Tesla will partner up with AI darlings DeepSeek and Bytedance’s Doubao on new tools in its cars in China, according to a document uploaded to Tesla’s official website. Bloomberg reports that Doubao will work on voice command-related tools like the temperature in a Tesla, navigation, and in-car entertainment, while DeepSeek will handle the AI side of things. The move may be a way for Tesla to boost its Chinese deliveries, which dropped 8.4% from December to June compared to the same time period

24,000-Watt Scooter Is Going for a 100 MPH Speed Record at Bonneville

Get The Drive’s daily newsletter The latest car news, reviews, and features. Email address Sign Up Thank you! Terms of Service & Privacy Policy. A British outfit called Bo makes a sleek little electric scooter called the Model-M. The best variant claims a 22 mph top speed and a 40-mile range for $2,500. Sounds pricey but practical. The prototype Turbo model is another story—it’s gunning for triple-digit speeds at the Bonneville salt flats next month. Bo has been around since 2022, but seems to