Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: claudius Clear Filter

Anthropic Let an AI Agent Run a Small Shop and the Result Was Unintentionally Hilarious

Anthropic ran an experiment where its Claude chatbot was put in charge of a tiny, automated "shop" inside its San Francisco headquarters — and the results were nothing short of hilarious. Despite claims in an Anthropic post that "Claudius," the name given to the AI agent in charge of stocking the shop's shelves, was "close to success," everything about the gambit seems to demonstrate just how bad AI is at managing things in the real world. Dubbed "Project Vend," the month-long experiment was u

What happened when Anthropic's Claude AI ran a small shop for a month (spoiler: it got weird)

Daniel Grizelj/Getty Images Large language models (LLMs) handle many tasks well -- but at least for the time being, running a small business doesn't seem to be one of them. On Friday, AI startup Anthropic published the results of "Project Vend," an internal experiment in which the company's Claude chatbot was asked to manage an automated vending machine service for about a month. Launched in partnership with AI safety evaluation company Andon Labs, the project aimed to get a clearer sense of h

Anthropic's Claude stocked a fridge with metal cubes when it was put in charge of a snacks business

If you're worried your local bodega or convivence store may soon be replaced by an AI storefront, you can rest easy — at least for the time being. Anthropic recently concluded an experiment, dubbed Project Vend, that saw the company task an offshoot of its Claude chatbot with running a refreshments business out of its San Francisco office at a profit, and things went about as well as you would expect. The agent, named Claudius to differentiate it from Anthropic's regular chatbot, not only made s

Anthropic’s Claude AI became a terrible business owner in experiment that got ‘weird’

For those of you wondering if AI agents can truly replace human workers, do yourself a favor and read the blog post that documents Anthropic’s “Project Vend.” Researchers at Anthropic and AI safety company Andon Labs put an instance of Claude Sonnet 3.7 in charge of an office vending machine, with a mission to make a profit. And, like an episode of “The Office,” hilarity ensued. They named the AI agent Claudius, equipped it with a web browser capable of placing product orders and an email addr

Project Vend: Can Claude run a small shop? (And why does that matter?)

We let Claude manage an automated store in our office as a small business for about a month. We learned a lot from how close it was to success—and the curious ways that it failed—about the plausible, strange, not-too-distant future in which AI models are autonomously running things in the real economy. Anthropic partnered with Andon Labs, an AI safety evaluation company, to have Claude Sonnet 3.7 operate a small, automated store in the Anthropic office in San Francisco. Here is an excerpt of