Gemini 3.1 Flash‑Lite is rolling out in preview to developers via the Gemini API in Google AI Studio and to enterprises via Vertex AI, the company said in a company blog post. The model is positioned for high‑volume developer workloads that require low latency and cost efficiency.
Priced at $0.25/1 million input tokens and $1.50/1 million output tokens, Gemini 3.1 Flash‑Lite is presented as faster and cheaper than prior small‑tier models. The blog post cites the Artificial Analysis benchmark, reporting a 2.5X faster Time to First Answer Token and a 45% increase in output speed versus 2.5 Flash while maintaining similar or better quality.
Performance claims include an Elo score of 1432 on the Arena.ai Leaderboard and benchmark results of 86.9% on GPQA Diamond and 76.8% on MMMU Pro, the company said in the blog post. The model is described as capable across reasoning and multimodal understanding tasks, and the release highlights use cases such as high‑volume translation, content moderation and dynamic interface generation.
Related reading
- Google brings Find Hub luggage tracking to airlines
- Google launches Nano Banana 2 image model with faster generation and expanded object fidelity
- Google expands AI Max text guidelines globally
"Today, we're introducing Gemini 3.1 Flash‑Lite, our fastest and most cost‑efficient Gemini 3 series model," The Gemini Team said. The blog post also notes built‑in "thinking levels" in AI Studio and Vertex AI to let developers adjust how much the model reasons on a task, and it lists example applications including e‑commerce wireframe population, real‑time weather dashboards and multi‑step SaaS agents.
Early‑access developers on AI Studio and Vertex AI, and companies including Latitude, Cartwheel and Whering, are reported to be testing the model. Developers can access Gemini 3.1 Flash‑Lite via the Gemini API in Google AI Studio and enterprises can use it through Vertex AI, the company said in the blog post.
The recap
- Gemini 3.1 Flash‑Lite launches in preview for developers and enterprises
- Priced at $0.25/1 million input and $1.50/1 million output
- Available in preview via Google AI Studio and Vertex AI