Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: agent Clear Filter

We Can Just Measure Things

We Can Just Measure Things This week I spent time with friends to letting agents go wild and see what we could build in 24 hours. I took some notes for myself to reflect on that experience. I won't bore you with another vibecoding post, but you can read Peter's post about how that went. As fun as it was, it also was frustrating in other ways and in entire predictable ways. It became a meme about how much I hated working with Xcode for this project. This got me thinking quite a bit more that th

Companies That Replaced Humans With AI Are Realizing Their Mistake

According to tech billionaire and OpenAI CEO Sam Altman, 2025 was supposed to be the year "when AI agents will work." Despite widespread hype, so-called "AI agents" — a software product that's supposed to complete human-level tasks autonomously — have yet to live up to their name. As of April, even the best AI agent could only finish 24 percent of the jobs assigned to it. Still, that didn't stop business executives from swarming to the software like flies to roadside carrion, gutting entire dep

OpenAI open sourced a new Customer Service Agent framework — learn more about its growing enterprise strategy

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Editor’s note: Carl will lead an editorial roundtable on this topic at VB Transform next week. Register today. OpenAI has released a new open-source demo that gives developers a hands-on look at how to build intelligent, workflow-aware AI agents using the Agents SDK. As first noticed by AI influencer and engineer Tibor Blaho (of the thir

Revisiting Minsky's Society of Mind in 2025

A Teenager’s Frustration, a Researcher’s Revelation (Note: this is a deeply nerdy / technical post, with most applicability to folks building AI systems, and of little relevance to most users of AI.) In the late 90s, as a tech-obsessed teenager, I picked up Marvin Minsky’s 1986 book The Society of Mind expecting profound answers about intelligence. It was exciting: Minsky made AI seem so tractable, with beautiful essays arguing that the mind is composed of countless simple “agents” – little pr

Companies That Replaced With Humans With AI Are Realizing Their Mistake

According to tech billionaire and OpenAI CEO Sam Altman, 2025 was supposed to be the year "when AI agents will work." Despite widespread hype, so-called "AI agents" — a software product that's supposed to complete human-level tasks autonomously — have yet to live up to their name. As of April, even the best AI agent could only finish 24 percent of the jobs assigned to it. Still, that didn't stop business executives from swarming to the software like flies to roadside carrion, gutting entire dep

Building Effective AI Agents

Over the past year, we've worked with dozens of teams building large language model (LLM) agents across industries. Consistently, the most successful implementations weren't using complex frameworks or specialized libraries. Instead, they were building with simple, composable patterns. In this post, we share what we’ve learned from working with our customers and building agents ourselves, and give practical advice for developers on building effective agents. What are agents? "Agent" can be de

The Download: power in Puerto Rico, and the pitfalls of AI agents

On the southeastern coast of Puerto Rico lies the country’s only coal-fired power station, flanked by a mountain of toxic ash. The plant, owned by the utility giant AES, has long plagued this part of Puerto Rico with air and water pollution. Before the coal plant opened Guayama had on average just over 103 cancer cases per year. In 2003, the year after the plant opened, the number of cancer cases in the municipality surged by 50%, to 167. In 2022, the most recent year with available data, cas

When AIs bargain, a less advanced agent could cost you

This study is part of a growing body of research warning about the risks of deploying AI agents in real-world financial decision-making. Earlier this month, a group of researchers from multiple universities argued that LLM agents should be evaluated primarily on the basis of their risk profiles, not just their peak performance. Current benchmarks, they say, emphasize accuracy and return-based metrics, which measure how well an agent can perform at its best but overlook how safely it can fail. Th

Use Copilot Agent Mode in Visual Studio (Preview)

Access to this page requires authorization. You can try changing directories . Access to this page requires authorization. You can try signing in or changing directories . Use Copilot agent mode in Visual Studio (Preview) With GitHub Copilot's agent mode in Visual Studio, you can use natural language to specify a high-level task. AI will then autonomously reason through the request, plan the work needed, and apply the changes to your codebase. Agent mode combines code editing and tool invocat

Salesforce study finds LLM agents flunk CRM and confidentiality tests

A new benchmark developed by academics shows that LLM-based AI agents perform below par on standard CRM tests and fail to understand the need for customer confidentiality. A team led by Kung-Hsiang Huang, a Salesforce AI researcher, showed that using a new benchmark relying on synthetic data, LLM agents achieve around a 58 percent success rate on tasks that can be completed in a single step without needing follow-up actions or more information. Using the benchmark tool CRMArena-Pro, the team a

7 trends shaping digital transformation in 2025 - and AI looms large

shomos uddin/Getty Images Welcome to the age of hybrid work, where businesses will augment the human workforce with AI agents -- the birth of the autonomous enterprise, according to research from technology specialist MuleSoft. Here are the top seven trends that Mulesoft suggests are shaping digital transformation in 2025: House of AI agents: The autonomous enterprise built on a "house of agents" will take hold, as organizations augment the human workforce with AI, freeing them to focus on mor

Here's why network infrastructure is vital to maximizing your company's AI adoption

Weiquan Lin/Getty Images When companies begin taking the first steps toward AI adoption, one of the first pieces of advice they receive is to address the quality of their data. However, another foundational element that is often overlooked, but is just as critical to the success of AI deployment, is network infrastructure. At Cisco Live, ZDNET spoke with Anurag Dhingra, SVP and GM of the Enterprise Connectivity and Collaboration Group, to learn more about the role network infrastructure plays

The case for embedding audit trails in AI systems before scaling

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Editor’s note: Emilia will lead an editorial roundtable on this topic at VB Transform this month. Register today. Orchestration frameworks for AI services serve multiple functions for enterprises. They not only set out how applications or agents flow together, but they should also let administrators manage workflows and agents and audit t

AI agents will be ambient, but not autonomous - what that means for us

Harrison Chase, LangChain CEO and co-founder, takes the stage at Cisco Live! to discuss ambient agents. Sabrina Ortiz/ZDNET Until recently, AI solutions that can execute tasks on your behalf seemed futuristic. Now the era of AI agents is here, with nearly every company offering its own solution. On the horizon, though, is a more advanced and even more promising milestone -- ambient agents. On day three of the Cisco Live! conference, LangChain CEO and co-founder Harrison Chase took the stage to

Design Patterns for Securing LLM Agents Against Prompt Injections

Design Patterns for Securing LLM Agents against Prompt Injections This new paper by 11 authors from organizations including IBM, Invariant Labs, ETH Zurich, Google and Microsoft is an excellent addition to the literature on prompt injection and LLM security. In this work, we describe a number of design patterns for LLM agents that significantly mitigate the risk of prompt injections. These design patterns constrain the actions of agents to explicitly prevent them from solving arbitrary tasks.

Unpacking AI Agents

In the past six months, OpenAI, Anthropic, Google, and others have released web-browsing agents that are designed to complete tasks independently, with only minimal input from humans. OpenAI CEO Sam Altman has even called AI agents “the next giant breakthrough.” On today’s episode, we'll dive into what makes these agents different from other forms of machine intelligence, and whether their capabilities can live up to the hype. You can follow Michael Calore on Bluesky at @snackfight, Lauren Good

The Download: AI agents’ autonomy, and sodium-based batteries

In recent months, a new class of agents has arrived on the scene: ones built using large language models. Any action that can be captured by text—from playing a video game using written commands to running a social media account—is potentially within the purview of this type of system. LLM agents don’t have much of a track record yet, but to hear CEOs tell it, they will transform the economy—and soon. Despite that, like chatbot LLMs, agents can be chaotic and unpredictable. Here’s what could

Are we ready to hand AI agents the keys?

The flash crash is probably the most well-known example of the dangers raised by agents—automated systems that have the power to take actions in the real world, without human oversight. That power is the source of their value; the agents that supercharged the flash crash, for example, could trade far faster than any human. But it’s also why they can cause so much mischief. “The great paradox of agents is that the very thing that makes them useful—that they’re able to accomplish a range of tasks—

Agentic Coding Recommendations

Agentic Coding Recommendations There is currently an explosion of people sharing their experiences with agentic coding. After my last two posts on the topic, I received quite a few questions about my own practices. So, here goes nothing. Preface For all intents and purposes, here's what I do: I predominently use Claude Code with the cheaper Max subscription for $100 a month . That works well for several reasons: I exclusively use the cheaper Sonnet model. It's perfectly adequate for my needs,

Why most enterprise AI agents never reach production and how Databricks plans to fix it

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Many enterprise AI agent development efforts never make it to production and it’s not because the technology isn’t ready. The problem, according to Databricks, is that companies are still relying on manual evaluations with a process that’s slow, inconsistent and difficult to scale. Today at the Data + AI Summit, Databricks launched Mosaic

How I Program with Agents

How I program with Agents 2025-06-08 This is the second part of my ongoing self-education in how to adapt my programming experience to a world with computers that talk. The first part, How I program with LLMs, covered ways LLMs can be adapted into our existing tools (basically, autocomplete) and how careful prompting can replace traditional web search. Now I want to talk about the harder, and more rewarding act of using agents to program. Define Agent It is worthwhile starting with a definit

Invisible, autonomous and hackable: The AI agent dilemma no one saw coming

This article is part of VentureBeat’s special issue, “The cyber resilience playbook: Navigating the new era of threats.” Read more from this special issue here. Generative AI poses interesting security questions, and as enterprises move into the agentic world, those safety issues increase. When AI agents enter workflows, they must be able to access sensitive data and documents to do their job — making them a significant risk for many security-minded enterprises. “The rising use of multi-agent

Rabbit shows off the AI agent it should have launched with

is a weekend editor who covers the latest in tech and entertainment. He has written news, reviews, and more as a tech journalist since 2020. The Humane AI Pin has collapsed, but Rabbit is still kicking. The company published a blog post and video today showing off a “generalist Android agent,” slowly controlling apps on a tablet in much the same way that Rabbit claimed its R1 device would over a year ago. (It couldn’t, and can’t.) The work builds on LAM Playground, a “generalist web agent” Rabb