Javier Zayas Photography/Moment via Getty Images
Follow ZDNET: Add us as a preferred source on Google.
ZDNET's key takeaways
Google's new AI model can interact directly with website UIs.
It joins similar tools from OpenAI and Anthropic.
The company also admitted its weaknesses, including hallucinations.
Google DeepMind has debuted a new AI model in public preview that's designed to navigate a web browser just as a human would.
Built atop Gemini 2.5 Pro, the company's new Computer Use model can execute tasks like clicking, typing, and scrolling directly within a web page.
Also: 5 reasons I use local AI on my desktop - instead of ChatGPT, Gemini, or Claude
Users simply have to feed it a prompt in natural language -- such as, "Open Wikipedia, search for 'Atlantis,' and summarize the history of the myth in Western thought." The model will autonomously fetch the URL and screenshots of the requested site to analyze the user interface it needs to act within, and will perform the requested task step by step, all while outlining its reasoning and actions in a text box easily visible to users. It may also respond by asking for confirmation if it's instructed to perform a sensitive task, like making a purchase.
... continue reading