Gemini 3.1 Flash-Lite enters preview

Gemini 3.1 Flash‑Lite is rolling out in preview to developers via the Gemini API in Google AI Studio and to enterprises via Vertex AI, the company said in a company blog post. The model is positioned for high‑volume developer workloads that require low latency and cost efficiency.

Priced at $0.25/1 million input tokens and $1.50/1 million output tokens, Gemini 3.1 Flash‑Lite is presented as faster and cheaper than prior small‑tier models. The blog post cites the Artificial Analysis benchmark, reporting a 2.5X faster Time to First Answer Token and a 45% increase in output speed versus 2.5 Flash while maintaining similar or better quality.

Performance claims include an Elo score of 1432 on the Arena.ai Leaderboard and benchmark results of 86.9% on GPQA Diamond and 76.8% on MMMU Pro, the company said in the blog post. The model is described as capable across reasoning and multimodal understanding tasks, and the release highlights use cases such as high‑volume translation, content moderation and dynamic interface generation.

"Today, we're introducing Gemini 3.1 Flash‑Lite, our fastest and most cost‑efficient Gemini 3 series model," The Gemini Team said. The blog post also notes built‑in "thinking levels" in AI Studio and Vertex AI to let developers adjust how much the model reasons on a task, and it lists example applications including e‑commerce wireframe population, real‑time weather dashboards and multi‑step SaaS agents.

Early‑access developers on AI Studio and Vertex AI, and companies including Latitude, Cartwheel and Whering, are reported to be testing the model. Developers can access Gemini 3.1 Flash‑Lite via the Gemini API in Google AI Studio and enterprises can use it through Vertex AI, the company said in the blog post.

The recap

Gemini 3.1 Flash‑Lite launches in preview for developers and enterprises
Priced at $0.25/1 million input and $1.50/1 million output
Available in preview via Google AI Studio and Vertex AI

Subscribe to Our Newsletter

Gemini 3.1 Flash-Lite enters preview

The recap

Wix launches ChatGPT app for website creation

Flux adds live configuration to speech recognition API

Crypto.com launches Crypto.com IRAs

Copilot introduces Tasks for automated actions

TCS Blockchain, PayPal USD partner on freight settlement

Explore topics

Tech

Artificial Intelligence

Business

Entertainment & Sport

Top tags

Gemini 3.1 Flash-Lite enters preview

Related reading

The recap

Wix launches ChatGPT app for website creation

Flux adds live configuration to speech recognition API

Crypto.com launches Crypto.com IRAs

Copilot introduces Tasks for automated actions

TCS Blockchain, PayPal USD partner on freight settlement