Nemotron Nano 12B v2 VL

LLM

NVIDIA

Nemotron Nano 12B v2 VL is a multimodal reasoning model excelling at document intelligence, visual Q&A, and video understanding with support for multi-image analysis, OCR, and 128K context.

Context tokens

128,000

Output tokens

4,096

Docs

Model documentation

Training cutoff

Dec 1, 2023

Released

May 19, 2025

Schema

OpenAI-compatible chat format.

Schema documentation

Capabilities

Tool use

Structured output

OpenAI API

Multimodal

Supported input media

image

text

Supported tools

Seclai Content Tools
Inspect source documents connected to your account. Includes tools for loading full content, reading character ranges, searching within documents, viewing stats, and listing available content sources. When a source_connection_content_version_id is provided in agent run metadata it is used as the default. Otherwise the model can discover content via list_content_sources.
Seclai Knowledge Base
Search your knowledge bases using semantic similarity. Includes search_knowledge_base and list_knowledge_bases. When a knowledge_base_id is provided in the prompt or agent run metadata it is used as the default. Otherwise the model can discover available knowledge bases at runtime.
Seclai Memory Banks
Manage persistent memory across agent runs. Includes tools for listing memory banks, writing entries, searching memory via semantic similarity, and loading entries in chronological order. Supports two memory types: 'conversation' (speaker-attributed turns) and 'general' (freeform text). Use key to organize entries by topic, session, or user.
Seclai Web Tools
Fetch web pages and search the web from within agent prompt calls. Includes seclai_web_fetch for retrieving page content in markdown, HTML, or plain text, and seclai_web_search for finding relevant pages with content snippets.

Variants

Tier

Priority and flex tiers trade speed for cost.