Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: agent Clear Filter

Context Engineering Guide

What is Context Engineering? A few years ago, many, even top AI researchers, claimed that prompt engineering would be dead by now. Obviously, they were very wrong, and in fact, prompt engineering is now even more important than ever. It is so important that it is now being rebranded as context engineering. Yes, another fancy term to describe the important process of tuning the instructions and relevant context that an LLM needs to perform its tasks effectively. Much has been written already

Solo.io wins ‘most likely to succeed’ award at VB Transform 2025 innovation showcase

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Cambridge, Mass.-based Solo.io was awarded “Most Likely to Succeed” at the Innovation Showcase at VB Transform in San Francisco on June 25. Founded in 2017, the cloud-native application networking company — which raised $135 million in a Series C round in 2021 and is valued at $1 billion — provides tools for connecting, securing and observ

The great AI agent acceleration: Why enterprise adoption is happening faster than anyone predicted

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The chatter around artificial general intelligence (AGI) may dominate headlines coming from Silicon Valley companies like OpenAI, Meta and xAI, but for enterprise leaders on the ground, the focus is squarely on practical applications and measurable results. At VentureBeat’s recent Transform 2025 event in San Francisco, a clear picture emerg

AI agent benchmarks are broken

Benchmarks are foundational to evaluating the strengths and limitations of AI systems, guiding both research and industry development. As AI agents move from research demos to mission-critical applications, researchers and practitioners are building benchmarks to evaluate their capabilities and limitations. These AI agent benchmarks are significantly more complex than traditional AI benchmarks in task formulation (e.g., often requiring a simulator of realistic scenarios) and evaluation (e.g., no

Show HN: Vibe Kanban – Kanban board to manage your AI coding agents

Get 10X more out of Claude Code, Gemini CLI, Codex, Amp and other coding agents... Overview AI coding agents are increasingly writing the world's code and human engineers now spend the majority of their time planning, reviewing, and orchestrating tasks. Vibe Kanban streamlines this process, enabling you to: Easily switch between different coding agents Orchestrate the execution of multiple coding agents in parallel or in sequence Quickly review work and start dev servers Track the status o

AI Agent Benchmarks Are Broken

Benchmarks are foundational to evaluating the strengths and limitations of AI systems, guiding both research and industry development. As AI agents move from research demos to mission-critical applications, researchers and practitioners are building benchmarks to evaluate their capabilities and limitations. These AI agent benchmarks are significantly more complex than traditional AI benchmarks in task formulation (e.g., often requiring a simulator of realistic scenarios) and evaluation (e.g., no

AWS is launching an AI agent marketplace next week with Anthropic as a partner

Amazon Web Services (AWS) is launching an AI agent marketplace next week and Anthropic is one of its partners, TechCrunch has exclusively learned. The AWS agent marketplace launch will take place at the AWS Summit in New York City on July 15, two people familiar with the development told TechCrunch. AWS and Anthropic did not respond to requests for comments. AI agents are ubiquitous nowadays. And every single investor in Silicon Valley is bullish on startups building them — even if there is so

These are the T-Mobile plans eligible for the free DashPass perk (Updated: T-Mobile confirmation)

Joe Maring / Android Authority TL;DR T-Mobile is offering subscribers on select postpaid plans a free one-year DoorDash DashPass subscription via the T-Life app. There’s conflicting information about eligible plans between public FAQs and customer service. We’ve got a confirmation from T-Mobile about which plans are eligible for the DashPass perk. Update, July 10, 2025 (03:22 AM ET): A T-Mobile spokesperson has confirmed to us that the list of plans mentioned on the FAQ page is accurate, and

Skip the AI ‘bake-off’ and build autonomous agents: Lessons from Intuit and Amex

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now As generative AI matures, enterprises are shifting from experimentation to implementation—moving beyond chatbots and copilots into the realm of intelligent, autonomous agents. In a conversation with VentureBeat’s Matt Marshall, Ashok Srivastava, SVP and Chief Data Officer at Intuit, and Hillary Packer, EVP and CTO at American Express at VB

Biomni: A General-Purpose Biomedical AI Agent

Biomni: A General-Purpose Biomedical AI Agent Overview Biomni is a general-purpose biomedical AI agent designed to autonomously execute a wide range of research tasks across diverse biomedical subfields. By integrating cutting-edge large language model (LLM) reasoning with retrieval-augmented planning and code-based execution, Biomni helps scientists dramatically enhance research productivity and generate testable hypotheses. Quick Start Installation Our software environment is massive and

Scaling agentic AI: Inside Atlassian’s culture of experimentation

Scaling agentic AI isn’t just about having the latest tools — it requires clear guidance, the right context, and a culture that champions experimentation to unlock real value. At VentureBeat’s Transform 2025, Anu Bharadwaj, president of Atlassian, shared actionable insights into how the company has empowered its employees to build thousands of custom agents that solve real, everyday challenges. To build these agents, Atlassian has fostered a culture rooted in curiosity, enthusiasm and continuous

Zoom's AI gets a huge productivity upgrade - here's what it can do now

alexsl/Getty Images Zoom's AI assistant just got another agentic upgrade. The company announced Wednesday that its AI Companion, unveiled close to two years ago, can now connect with 16 third-party apps -- all without forcing the user to leave Zoom. Also: How ChatGPT actually works (and why it's been so game-changing) The AI Companion can now more seamlessly assist sales and customer service professionals, for example, by connecting directly with apps like Salesforce and Zendesk. It can also

MCP isn’t KYC-ready: Why regulated sectors are wary of open agent exchanges

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now For something launched in November, the Model Context Protocol (MCP) has begun amassing a large number of users, all but guaranteeing the mass adoption needed to make it an industry standard. But there is a subset of enterprises that are not joining the hype for now: regulated industries, especially financial institutions. Banks and other

Supabase MCP can leak your entire SQL database

Model Context Protocol (MCP) has emerged as a standard way for LLMs to interact with external tools. While this unlocks new capabilities, it also introduces new risk surfaces. In this post, we show how an attacker can exploit Supabase’s MCP integration to leak a developer’s private SQL tables. The Problem LLMs are often used to process data according to pre-defined instructions. The system prompt, user instructions, and the data context is provided to the LLM as text. [ SYSTEM PROMPT ] You ar

Linux Foundation adopts A2A protocol to help solve one of AI's most pressing challenges

koto_feja/Getty Images The Linux Foundation announced at the Open Source Summit in Denver that it will now host the Agent2Agent (A2A) protocol. Initially developed by Google and now supported by more than 100 leading technology companies, A2A is a crucial new open standard for secure and interoperable communication between AI agents. Also: What are AI agents? How to access a team of personalized assistants In his keynote presentation, Mike Smith, a Google staff software engineer, told the con

A million customer conversations with AI agents yielded this surprising lesson

Mykyta Atamanchuk/Getty Images Salesforce has had over one million AI agent-customer conversations. The company launched AI agents on its Salesforce Help site in October 2024, a full-screen experience that makes getting support simpler and more intuitive. With more than 60 million visits each year, Salesforce Help offers a wide range of product content through organized directories, search, and direct support. Having handled a million support requests since the launch of AI agents, Salesforce

What is going on in Unix with errno's limited nature

You're using a tool with a too-generic User-Agent You're probably reading this page because you've attempted to access some part of my blog (Wandering Thoughts) or CSpace, the wiki thing it's part of. Unfortunately whatever you're using to do so has a HTTP User-Agent header value that is too generic or otherwise excessively suspicious. Unfortunately, as of early 2025 there's a plague of high volume crawlers (apparently in part to gather data for LLM training) that behave like this. To reduce th

How Capital One built production multi-agent AI workflows to power enterprise use cases

How do you balance risk management and safety with innovation in agentic systems — and how do you grapple with core considerations around data and model selection? In this VB Transform session, Milind Naphade, SVP, technology, of AI Foundations at Capital One, offered best practices and lessons learned from real-world experiments and applications for deploying and scaling an agentic workflow. Capital One, committed to staying at the forefront of emerging technologies, recently launched a produc

I loved Arc browser and was skeptical of its agentic Dia replacement - until I tried it

Jack Wallen / Elyse Betters Picaro / ZDNET When The Browser Company announced they were ending Arc and developing an agentic browser that could leverage AI in ways other browsers were not, I was skeptical. I was starting to see the value in AI, but using it in such a way seemed like just another crutch for users to lean on -- so they didn't have to take the time to do those things themselves. I also saw it as a possible security and privacy issue. Also: Love Arc browser? You can get early acc

5 ways to be great AI agent manager, according to business leaders

akinbostanci/Getty Microsoft recently said that maximizing AI and agents' performance involves learning about management concerns, including delegating, iterating, prompting, and refining the technology. The tech giant's research suggested that the requirement for someone to manage these teams of AIs has led to the evolution of a new role, the agent boss, who is responsible for the best performance of these fast-emerging technologies. Also: 4 questions to ask yourself before betting on AI in

Forget the hype — real AI agents solve bounded problems, not open-world fantasies

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Everywhere you look, people are talking about AI agents like they’re just a prompt away from replacing entire departments. The dream is seductive: Autonomous systems that can handle anything you throw at them, no guardrails, no constraints, just give them your AWS credentials and they’ll solve all your problems. But the reality is that’s ju

Why the simplest desktop agent abstraction wins

This is first post in a series about the design and implementation of Bytebot . Give us a star on our open source repo . We’re still in the early innings of AI agents. There are hundreds of companies building wrappers around LLMs, trying to make them more useful; more tool-aware, more stateful, more capable of completing tasks across applications. But most of them are barking up the same tree: they’re building agents that work by connecting APIs and tools in structured ways. Bytebot was born o

Problems the AI industry is not addressing adequately

I think the AI industry is facing a handful of urgent problems it’s not addressing adequately. I believe everything I write here is at least directionally true, but I could be wrong. My aim isn’t to be definitive, just to spark a conversation. What follows is a set of expanded thoughts on those problems, in no particular order. Disclaimer: Not everyone in AI is as bad as I’m making them sound. I’m flattening a wildly diverse field into a single tone, which is obviously reductive. People are dif

I'm Losing All Trust in the AI Industry

I think the AI industry is facing a handful of urgent problems it’s not addressing adequately. I believe everything I write here is at least directionally true, but I could be wrong. My aim isn’t to be definitive, just to spark a conversation. What follows is a set of expanded thoughts on those problems, in no particular order. Disclaimer: Not everyone in AI is as bad as I’m making them sound. I’m flattening a wildly diverse field into a single tone, which is obviously reductive. People are dif

Context Engineering for Agents

Lance Martin TL;DR Agents need context to perform tasks. Context engineering is the art and science of filling the context window with just the right information at each step of an agent’s trajectory. In this post, I group context engineering into a few common strategies seen across many popular agents today. Context Engineering As Andrej Karpathy puts it, LLMs are like a new kind of operating system. The LLM is like the CPU and its context window is like the RAM, serving as the model’s work

WASM Agents: AI agents running in the browser

One of the main barriers to a wider adoption of open-source agents is the dependency on extra tools and frameworks that need to be installed before the agents can be run. In this post, we show how to write agents as HTML files, which can just be opened and run in a browser. One of the main barriers to a wider adoption and experimentation with open-source agents is the dependency on extra tools and frameworks that need to be installed before the agents can be run. In this post, we introduce the

The Percentage of Tasks AI Agents Are Currently Failing At May Spell Trouble for the Industry

It's safe to say there's a lot riding on "artificial intelligence," a buzzy and nebulous swath of the tech industry pedaling all kinds of large language model (LLM) and similar software products. Since ChatGPT emerged in November 2022, venture capitalist investments in AI have skyrocketed, rising to $131.5 billion in 2024, an increase of 52 percent compared to 2023. In the last three months of 2024, over half of all venture capital in the world went to AI companies. One of the flashier bits of

Dust hits $6M ARR helping enterprises build AI agents that actually do stuff instead of just talking

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Dust, a two-year-old artificial intelligence platform that helps enterprises build AI agents capable of completing entire business workflows, has reached $6 million in annual revenue — a six-fold increase from $1 million just one year ago. The company’s rapid growth signals a shift in enterprise AI adoption from simple chatbots toward sophi

Don’t let hype about AI agents get ahead of reality

Let’s start with the term “agent” itself. Right now, it’s being slapped on everything from simple scripts to sophisticated AI workflows. There’s no shared definition, which leaves plenty of room for companies to market basic automation as something much more advanced. That kind of “agentwashing” doesn’t just confuse customers; it invites disappointment. We don’t necessarily need a rigid standard, but we do need clearer expectations about what these systems are supposed to do, how autonomously th

What to build instead of AI agents

Paul: Today, the scene is owned by Hugo, a brilliant mind who advises and teaches teams building LLM-powered systems, including engineers from Netflix, Meta, and the U.S. Air Force. He runs a course on the LLM software development lifecycle, focusing on everything from retrieval and evaluation to agent design, and all the intermediate steps in between. Enough talking, I’ll let him dig into today’s controversial topic: “Stop building AI agents”. ↓🎙️ P.S. I agree with him. 🤫 Hugo: I've taught