DeepSeek V4 Flash (Free)
nvidia/deepseek-v4-flash
nvidia
DeepSeek V4 Flash hosted free by NVIDIA. 284B / 13B active MoE, 1M context, ~5x faster than V4 Pro. Best for chat, summarization, light reasoning. Weaker factual recall — pick V4 Pro for fact-heavy agentic loops
Code Examples
from blockrun_llm import LLMClient
client = LLMClient() # Uses BLOCKRUN_WALLET_KEY (never sent to server)
response = client.chat("nvidia/deepseek-v4-flash", "Hello!")Pricing
InputFree / 1M tokens
OutputFree / 1M tokens
Context1M tokens
Max Output16K tokens
Free model — no payment or wallet required.
Payment
Network
Base
Currency
USDC
Protocol
x402
Pay per request with USDC on Base. No subscription required.
Try It
Send a message to try DeepSeek V4 Flash (Free)
Connect your wallet to enable payments