Free Nemotron API. With vision.
No key. No subscription.
Nemotron 3 Nano Omni — 31B / 3.2B active MoE, 256K context. The only free model that takes images, video, and audio. ChartQA 90.3, DocVQA 95.6, MMMU 70.8.
Try it now.
No API key. No wallet. No signup. Paste this into any terminal — the response streams back from Nemotron hosted free on NVIDIA, routed through BlockRun.
curl https://blockrun.ai/api/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning",
"messages": [{"role": "user", "content": "What is the largest planet in our solar system?"}]
}'- Context
- 256K
- Price
- free
- Modalities
- text · image · video · audio
- Best for
- reasoning · vision
- Context
- 131K
- Price
- free
- Best for
- reasoning · coding
6 ways to use Nemotron free.
BlockRun is the access layer. Pick the surface that matches how you build — terminal, notebook, IDE, agent runtime — and the same free models work everywhere.
- shell
curl https://blockrun.ai/api/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning", "messages": [{"role": "user", "content": "What is the largest planet in our solar system?"}] }' - python
# Works with the OpenAI SDK — no key required for free models from openai import OpenAI client = OpenAI( base_url="https://blockrun.ai/api/v1", api_key="not-needed-for-free-models", ) response = client.chat.completions.create( model="nvidia/nemotron-3-nano-omni-30b-a3b-reasoning", messages=[{"role": "user", "content": "What is the largest planet in our solar system?"}], ) print(response.choices[0].message.content) - 03
Franklin
the AI agent with a wallet — free OSS models for routine tasks, paid models on demand
Learn more →shell# Install Franklin curl -fsSL https://franklin.run/install | sh # Run with this model franklin chat --model nvidia/nemotron-3-nano-omni-30b-a3b-reasoning "Summarize the README" - 04
ClawRouter
smart router for OpenClaw / Claude Code — auto-picks free models when possible
Learn more →shell# Install once npm install -g @blockrun/clawrouter # Then point any OpenAI-compatible client at the local proxy. # ClawRouter routes to nvidia/nemotron-3-nano-omni-30b-a3b-reasoning (or the cheapest capable model) # without changing your code. - 05
Claude Code MCP
8 tools for Claude Code, Cursor & ChatGPT — call any free model from inside your editor
Learn more →shell# Add the BlockRun MCP server (Claude Code, Cursor, or ChatGPT desktop) claude mcp add blockrun --transport http https://mcp.blockrun.ai/mcp # Then call from inside the editor: # blockrun_chat(model="nvidia/nemotron-3-nano-omni-30b-a3b-reasoning", messages=[{role:"user", content:"…"}]) - typescript
// Works with the OpenAI SDK — no key required for free models import OpenAI from "openai"; const client = new OpenAI({ baseURL: "https://blockrun.ai/api/v1", apiKey: "not-needed-for-free-models", }); const r = await client.chat.completions.create({ model: "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning", messages: [{ role: "user", content: "What is the largest planet in our solar system?" }], }); console.log(r.choices[0].message.content);
We don't share
your data.
Your prompt goes to the AI provider you picked. Nothing else, nowhere else. No training, no retention beyond the request, no profile linking.
- No training, no retention beyond the request. Your prompt is forwarded only to the AI provider you select.
- Wallet in, prompt out. Pseudonymous by default — no email, no phone number, no identity documents.
- Read the code, audit the wire format, run it yourself. @blockrun/llm and blockrun-llm on npm and PyPI.
Want Claude, GPT-5,
or Gemini too?
No subscription. No monthly minimum. Pay per call in USDC via x402 — works the same endpoint, same SDK, same model IDs. Connect a wallet, top up $5, call any frontier model. No credit card.
Everything you might
be wondering.
- Is this really the only free vision model?
- On BlockRun, yes — Nemotron 3 Nano Omni is the only free model that accepts images, video (up to 2 minutes), and audio (up to 1 hour). Everything else is paid.
- How do I send an image?
- Same as the OpenAI vision API — set message.content to an array with type: 'image_url' and either a URL or a data: base64 URI. Works with any OpenAI-compatible client.
- Vision quality?
- Strong: ChartQA 90.3, DocVQA 95.6, MMMU 70.8. Competitive with paid mid-tier vision models on charts, documents, screenshots. For frontier vision, switch to GPT-5.5 or Claude Opus on the paid tier.
- 256K context — how is that priced?
- Free. The 128 KB request body cap still applies, so very long context plus an image won't fit — for multi-megabyte multimodal payloads, use paid models.
- Are there other free Nemotron models?
- Yes — Nemotron Super 49B and Super 120B for text reasoning. They're hidden from the public catalog over privacy concerns but callable by direct ID.