Free · 9 OSS Models

Free LLM API.
No key. No wallet.

9 open-source models, hosted free on NVIDIA, routed through BlockRun. No key. No wallet. Six ways to call any of them.

curl

curl https://blockrun.ai/api/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nvidia/gpt-oss-120b",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

↑ Paste in any terminal. Returns 200 with a chat completion.

Pick a model family

5 dedicated pages.
One free endpoint.

Free · Llama 3.3 Nemotron Super 49B
Free Llama API.
Llama 3.3 Nemotron Super 49B — NVIDIA's Llama-3.3-derived reasoner, 131K context. No key, no wallet, no subscription. Just call it.
Try Llama free →
Free · Reasoning
Free Reasoning API.
Step 3.7 Flash (131K context, fast) and GPT-OSS 120B — frontier-grade free reasoning. No key, no signup.
Try Reasoning free →
Free · Mistral Small 4 + Mistral Nemotron
Free Mistral API.
Mistral Small 4 119B — fast, free, no key. Plus Mistral Nemotron (Mistral × NVIDIA) as the second free option.
Try Mistral free →
Free · Nemotron Nano Omni
Free Nemotron API. With vision.
Nemotron 3 Nano Omni — 31B / 3.2B active MoE, 256K context. The only free model that takes images, video, and audio. ChartQA 90.3, DocVQA 95.6, MMMU 70.8.
Try Nemotron free →
Free · GPT-OSS 120B + 20B
Free GPT-OSS API.
OpenAI's GPT-OSS — the only open-weights models OpenAI ever released. 120B and 20B variants, 128K context. Hosted free on NVIDIA, called through BlockRun.
Try GPT-OSS free →

Trust / Defaults

We don't share
your data.

Your prompt goes to the AI provider you picked. Nothing else, nowhere else. No training, no retention beyond the request, no profile linking.

Read the privacy policy Terms →

We don't share your data: No training, no retention beyond the request. Your prompt is forwarded only to the AI provider you select.
No accounts, no KYC: Wallet in, prompt out. Pseudonymous by default — no email, no phone number, no identity documents.
Open-source SDKs, MIT: Read the code, audit the wire format, run it yourself. @blockrun/llm and blockrun-llm on npm and PyPI.

When free isn't enough

Frontier models are
pay-per-call. Same endpoint.

Claude Opus, GPT-5.6, Gemini, Grok, Kimi — 58 paid models on the same API. No subscription. No monthly minimum. Top up $5 in USDC and call anything.

See pay-per-call pricing Browse all models

Free LLM API.No key. No wallet.

5 dedicated pages.One free endpoint.

Free Llama API.

Free Reasoning API.

Free Mistral API.

Free Nemotron API. With vision.

Free GPT-OSS API.

We don't shareyour data.

Frontier models arepay-per-call. Same endpoint.

Free LLM API.
No key. No wallet.

5 dedicated pages.
One free endpoint.

We don't share
your data.

Frontier models are
pay-per-call. Same endpoint.