BlockRun
Back to Pricing

DeepSeek V4 Flash (Free)

nvidia/deepseek-v4-flash

nvidia

DeepSeek V4 Flash hosted free by NVIDIA. 284B / 13B active MoE, 1M context, ~5x faster than V4 Pro. Best for chat, summarization, light reasoning. Weaker factual recall — pick V4 Pro for fact-heavy agentic loops

Code Examples

from blockrun_llm import LLMClient

client = LLMClient()  # Uses BLOCKRUN_WALLET_KEY (never sent to server)
response = client.chat("nvidia/deepseek-v4-flash", "Hello!")

Pricing

InputFree / 1M tokens
OutputFree / 1M tokens
Context1M tokens
Max Output16K tokens

Free model — no payment or wallet required.

Payment

Network
Base
Currency
USDC
Protocol
x402

Pay per request with USDC on Base. No subscription required.

Try It

Send a message to try DeepSeek V4 Flash (Free)

Connect your wallet to enable payments