Elyse Betters Picaro
Follow ZDNET: Add us as a preferred source on Google.
Hello, fellow humans! AI chatbots will soon replace us. They have access to more knowledge than our puny brains can hold, and they can easily be turned into powerful agents that can handle routine tasks with ease.
Or so we are told. I keep trying Microsoft Copilot, which uses OpenAI's GPT-5 as its default LLM, and I keep being disappointed. Occasionally, it gets things right, but just as often -- or so it seems -- it face-plants in spectacular fashion.
Also: Google's Gemini 3 is finally here and it's smarter, faster, and free to access
Does that mean it's time to choose a new LLM? Google's Gemini 3 has been winning rave reviews recently, so I decided to put it to the test, with a head-to-head challenge against Copilot.
My goal was to identify a selection of common tasks that an ordinary computer user (not a developer or scientist) would use in a desktop browser on a PC or Mac. For each scenario, I executed the same prompt on each assistant and made note of the results.
Let the games begin.
Challenge No. 1: Put together a trip itinerary
Winner: Gemini
... continue reading