Today, we are releasing updated versions of Gemini 2.5 Flash and 2.5 Flash-Lite, available on Google AI Studio and Vertex AI , aimed at continuing to deliver better quality while also improving the efficiency.
Improvements in quality and speed for Gemini 2.5 Flash and 2.5 Flash Lite preview models compared to the current stable models
The latest version of Gemini 2.5 Flash-Lite was trained and built based on three key themes:
Better instruction following: The model is significantly better at following complex instructions and system prompts.
Reduced verbosity: It now produces more concise answers, a key factor in reducing token costs and latency for high-throughput applications (see charts above).
Stronger multimodal & translation capabilities: This update features more accurate audio transcription, better image understanding, and improved translation quality.
You can start testing this version today using the following model string: gemini-2.5-flash-lite-preview-09-2025 .
This latest 2.5 Flash model comes with improvements in two key areas we heard consistent feedback on:
Better agentic tool use: We've improved how the model uses tools, leading to better performance in more complex, agentic and multi-step applications. This model shows noticeable improvements on key agentic benchmarks, including a 5% gain on SWE-Bench Verified, compared to our last release (48.9% → 54%).
... continue reading