Subscribe to Our Newsletter

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Ok, Thanks

Gemini 3.1 Flash-Lite enters preview

The latest version is available in preview to developers and enterprises via Google AI Studio and Vertex AI.

Defused News Writer profile image
by Defused News Writer
Gemini 3.1 Flash-Lite enters preview
Photo by Solen Feyissa / Unsplash

Gemini 3.1 Flash‑Lite is rolling out in preview to developers via the Gemini API in Google AI Studio and to enterprises via Vertex AI, the company said in a company blog post. The model is positioned for high‑volume developer workloads that require low latency and cost efficiency.

Priced at $0.25/1 million input tokens and $1.50/1 million output tokens, Gemini 3.1 Flash‑Lite is presented as faster and cheaper than prior small‑tier models. The blog post cites the Artificial Analysis benchmark, reporting a 2.5X faster Time to First Answer Token and a 45% increase in output speed versus 2.5 Flash while maintaining similar or better quality.

Performance claims include an Elo score of 1432 on the Arena.ai Leaderboard and benchmark results of 86.9% on GPQA Diamond and 76.8% on MMMU Pro, the company said in the blog post. The model is described as capable across reasoning and multimodal understanding tasks, and the release highlights use cases such as high‑volume translation, content moderation and dynamic interface generation.

"Today, we're introducing Gemini 3.1 Flash‑Lite, our fastest and most cost‑efficient Gemini 3 series model," The Gemini Team said. The blog post also notes built‑in "thinking levels" in AI Studio and Vertex AI to let developers adjust how much the model reasons on a task, and it lists example applications including e‑commerce wireframe population, real‑time weather dashboards and multi‑step SaaS agents.

Early‑access developers on AI Studio and Vertex AI, and companies including Latitude, Cartwheel and Whering, are reported to be testing the model. Developers can access Gemini 3.1 Flash‑Lite via the Gemini API in Google AI Studio and enterprises can use it through Vertex AI, the company said in the blog post.

The recap

  • Gemini 3.1 Flash‑Lite launches in preview for developers and enterprises
  • Priced at $0.25/1 million input and $1.50/1 million output
  • Available in preview via Google AI Studio and Vertex AI
Defused News Writer profile image
by Defused News Writer