Gemini 3.1 Flash Lite is a lightweight, cost-efficient model optimized for high-throughput tasks with fast response times, multimodal understanding across text, images, and video, and 1M token context at the lowest price point in the Gemini family.
Context tokens
1,000,000
Output tokens
65,536
Training cutoff
Jan 1, 2025
Released
Mar 7, 2026
You can specify content config parameters as a regular JSON object with the JSON prompt format. We handle the model parameter and support text, image, and video modalities.
Schema documentationSupported languages
Supported input media
Google Maps
Grounding with Google Maps connects the generative capabilities of Gemini with the rich, factual, and up-to-date data of Google Maps. This feature enables developers to easily incorporate location-aware functionality into their applications. When a user query has a context related to Maps data, the Gemini model leverages Google Maps to provide factually accurate and fresh answers that are relevant to the user's specified location or general area.
Google Search
Grounding with Google Search connects the Gemini model to real-time web content and works with all available languages. This allows Gemini to provide more accurate answers and cite verifiable sources beyond its knowledge cutoff.
Seclai Content Tools
Inspect source documents connected to your account. Includes tools for loading full content, reading character ranges, searching within documents, viewing stats, and listing available content sources. When a source_connection_content_version_id is provided in agent run metadata it is used as the default. Otherwise the model can discover content via list_content_sources.
Seclai Knowledge Base
Search your knowledge bases using semantic similarity. Includes search_knowledge_base and list_knowledge_bases. When a knowledge_base_id is provided in the prompt or agent run metadata it is used as the default. Otherwise the model can discover available knowledge bases at runtime.
Seclai Memory Banks
Manage persistent memory across agent runs. Includes tools for listing memory banks, writing entries, searching memory via semantic similarity, and loading entries in chronological order. Supports two memory types: 'conversation' (speaker-attributed turns) and 'general' (freeform text). Use key to organize entries by topic, session, or user.
| Type | Credits | Units |
|---|---|---|
| Input | 3.33 | Credits per 1k tokens |
| Output | 19.95 | Credits per 1k tokens |
| Cache hit | 0.33 | Credits per 1k tokens |
No variants available for this model.