Free LLM API.
No key. No wallet.
9 open-source models, hosted free on NVIDIA, routed through BlockRun. No key. No wallet. Six ways to call any of them.
curl https://blockrun.ai/api/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "nvidia/llama-4-maverick",
"messages": [{"role": "user", "content": "Hello"}]
}'6 dedicated pages.
One free endpoint.
Free Llama 4 API.
Meta's Llama 4 Maverick (17B × 128 experts MoE), 131K context. No key, no wallet, no subscription. Just call it.
Free Reasoning API.
Qwen3-Next 80B (262K context, fast) and Nemotron-3 Super 120B — frontier-grade free reasoning. No key, no signup.
Free Qwen3 API.
Qwen3-Next 80B Instruct (3B active MoE, 262K context, fast) and Qwen3.5 122B (10B active). Reasoning + coding. No key.
Free Mistral API.
Mistral Large 3 675B — Mistral's flagship, free. Plus Mistral Nemotron (Mistral × NVIDIA, fast). No key.
Free Nemotron API. With vision.
Nemotron 3 Nano Omni — 31B / 3.2B active MoE, 256K context. The only free model that takes images, video, and audio. ChartQA 90.3, DocVQA 95.6, MMMU 70.8.
Free GPT-OSS API.
OpenAI's GPT-OSS — the only open-weights models OpenAI ever released. 120B and 20B variants, 128K context. Hosted free on NVIDIA, called through BlockRun.
We don't share
your data.
Your prompt goes to the AI provider you picked. Nothing else, nowhere else. No training, no retention beyond the request, no profile linking.
- No training, no retention beyond the request. Your prompt is forwarded only to the AI provider you select.
- Wallet in, prompt out. Pseudonymous by default — no email, no phone number, no identity documents.
- Read the code, audit the wire format, run it yourself. @blockrun/llm and blockrun-llm on npm and PyPI.
Frontier models are
pay-per-call. Same endpoint.
Claude Opus, GPT-5.5, Gemini 2.5 Pro, Grok, Kimi — 50+ paid models on the same API. No subscription. No monthly minimum. Top up $5 in USDC and call anything.