BlockRun

Mainnet Models

Base

Production models on Base mainnet. Pay with real USDC.

39 models

GPT-5.5

openai

Newest OpenAI flagship — first fully retrained base since GPT-4.5. 1M context, 128K output, native agent + computer use

Input: $5.00/MOutput: $30.00/M

GPT-5.4

openai

Most capable and efficient frontier model with 1M context, native computer use, and thinking mode

Input: $2.50/MOutput: $15.00/M

GPT-5.4 Pro

openai

Premium GPT-5.4 with maximum compute for the hardest problems

Input: $30.00/MOutput: $180.00/M

GPT-5.3

openai

High intelligence with medium speed. Multimodal with vision, function calling, and structured outputs

Input: $1.75/MOutput: $14.00/M

GPT-5.2

openai

Frontier model with 400K context and adaptive reasoning

Input: $1.75/MOutput: $14.00/M

GPT-5.4 Mini

openai

Strongest mini model for coding, computer use, and subagents with GPT-5.4 capabilities

Input: $0.75/MOutput: $4.50/M

GPT-5 Mini

openai

Cost-optimized reasoning and chat

Input: $0.25/MOutput: $2.00/M

GPT-5.4 Nano

openai

Fastest and most affordable GPT-5.4 model for high-throughput tasks

Input: $0.20/MOutput: $1.25/M

GPT-5.2 Pro

openai

Uses more compute for consistently better answers

Input: $21.00/MOutput: $168.00/M

GPT-5.3 Codex

openai

Industry-leading agentic coding model. 400K context, reasoning, tool use, and complex execution

Input: $1.75/MOutput: $14.00/M

o1

openai

Advanced reasoning model for complex tasks

Input: $15.00/MOutput: $60.00/M

o1-mini

openai

Fast reasoning model optimized for STEM

Input: $1.10/MOutput: $4.40/M

o3

openai

Latest reasoning model with improved performance

Input: $2.00/MOutput: $8.00/M

o3-mini

openai

Efficient reasoning model for STEM tasks

Input: $1.10/MOutput: $4.40/M

Claude Haiku 4.5

anthropic

Fastest and most efficient Claude, near-frontier intelligence

Input: $1.00/MOutput: $5.00/M

Claude Sonnet 4.6

anthropic

Best balance of intelligence, speed, and cost

Input: $3.00/MOutput: $15.00/M

Claude Opus 4.5

anthropic

Latest Anthropic flagship with enhanced reasoning and creativity

Input: $5.00/MOutput: $25.00/M

Claude Opus 4.7

anthropic

Most capable Claude for complex reasoning and agentic coding. 1M context, 128k output, adaptive thinking

Input: $5.00/MOutput: $25.00/M

Gemini 3.1 Pro

google

Latest Gemini with improved thinking, token efficiency, and agentic capabilities. Optimized for software engineering (requires new SDK)

Input: $2.00/MOutput: $12.00/M

Gemini 3 Pro Preview

google

Flagship frontier model for high-precision multimodal reasoning

Input: $2.00/MOutput: $12.00/M

Gemini 3 Flash Preview

google

Frontier-class performance with Pro-level intelligence at Flash speed and pricing. Includes thinking mode (requires new SDK)

Input: $0.50/MOutput: $3.00/M

Gemini 2.5 Pro

google

State-of-the-art for reasoning, coding, and mathematics

Input: $1.25/MOutput: $10.00/M

Gemini 2.5 Flash

google

Fast and efficient Gemini model with vision support

Input: $0.30/MOutput: $2.50/M

Gemini 3.1 Flash Lite

google

Ultra-fast and lightweight Gemini 3.1 model with thinking mode for high-throughput tasks

Input: $0.25/MOutput: $1.50/M

Gemini 2.5 Flash Lite

google

Most economical Gemini model - ultra-fast and lightweight (requires new SDK)

Input: $0.10/MOutput: $0.40/M

DeepSeek V4 Pro

deepseek75% Off until 2026-05-31

DeepSeek V4 flagship — 1.6T MoE / 49B active, 1M context. Strongest open-weight reasoner. Thinking mode default.

Input: $0.50/MOutput: $1.00/M

DeepSeek V4 Flash Chat

deepseek

Paid V4 Flash in non-thinking mode (1.6T-class quality at $0.20 in / $0.40 out). Same model as the free nvidia/deepseek-v4-flash but on a paid endpoint with higher reliability and 5MB request bodies.

Input: $0.20/MOutput: $0.40/M

DeepSeek V4 Flash Reasoner

deepseek

Paid V4 Flash in thinking mode for reasoning tasks. Same upstream as deepseek/deepseek-chat but with thinking enabled by default.

Input: $0.20/MOutput: $0.40/M

Kimi K2.6

moonshot

Moonshot's flagship multi-modal reasoning model. 256K context, vision + text, returns reasoning_content. Upstream: $0.95 in / $4.00 out per 1M.

Input: $0.95/MOutput: $4.00/M

GLM-5.1

zaiLimited Promotion

Z.AI's latest flagship — #1 open source on SWE-Bench Pro, 8-hour autonomous execution. 200K context

Input: Free/MOutput: Free/M

GLM-5

zaiLimited Promotion

Z.AI's foundation model with 200K context. Strong reasoning and agentic capabilities

Input: Free/MOutput: Free/M

GLM-5 Turbo

zaiLimited Promotion

Optimized GLM-5 variant with faster inference

Input: Free/MOutput: Free/M

MiniMax M2.7

minimax

MiniMax's flagship reasoning model with recursive self-improvement. Great value for complex tasks (~60 tps)

Input: $0.30/MOutput: $1.20/M

DeepSeek V4 Flash (Free)

nvidiaFree

DeepSeek V4 Flash hosted free by NVIDIA. 284B / 13B active MoE, 1M context, ~5x faster than V4 Pro. Best for chat, summarization, light reasoning. Weaker factual recall — pick V4 Pro for fact-heavy agentic loops

Input: Free/MOutput: Free/M

Nemotron 3 Nano Omni (Free)

nvidiaFree

NVIDIA's multimodal reasoning Nemotron Nano Omni hosted free by NVIDIA. 31B / 3.2B active MoE. Accepts text, images, video, audio. ChartQA 90.3, DocVQA 95.6, MMMU 70.8 — the only vision-capable free model in our catalog

Input: Free/MOutput: Free/M

Qwen3 Coder 480B (Free)

nvidiaFree

Qwen's 480B MoE coding model (35B active) hosted by NVIDIA. Optimized for code generation

Input: Free/MOutput: Free/M

Llama 4 Maverick (Free)

nvidiaFree

Meta's Llama 4 Maverick MoE (17B x 128 experts) hosted free by NVIDIA

Input: Free/MOutput: Free/M

Qwen3-Next 80B Thinking (Free)

nvidiaFree

Qwen3-Next 80B MoE (3B active params) with thinking mode. Fastest top-tier reasoning on the free tier — 116 tok/s on our benchmark

Input: Free/MOutput: Free/M

Mistral Small 4 119B (Free)

nvidiaFree

Mistral Small 4 (119B) hosted free by NVIDIA. 114 tok/s — fastest free chat model we ship

Input: Free/MOutput: Free/M