BlockRun

Models

BlockRun provides access to models from multiple providers through a unified API.

List Models

GET https://blockrun.ai/api/v1/models

Returns a list of available models with pricing information. The response now includes extended metadata for each model.

Response Fields

Each model object in the response includes:

FieldTypeDescription
idstringModel identifier (e.g., openai/gpt-5.5)
namestringDisplay name (e.g., "GPT-5.5")
descriptionstringModel description
providerstringProvider name
inputPricenumberInput price per 1M tokens
outputPricenumberOutput price per 1M tokens
context_windownumberContext window size in tokens
max_outputnumberMaximum output tokens
categoriesstring[]Model capabilities: "chat", "reasoning", "coding", "vision"
availablebooleanWhether the model is currently available

Example Response

{
  "models": [
    {
      "id": "openai/gpt-5.5",
      "name": "GPT-5.5",
      "description": "OpenAI's flagship — first fully retrained base since GPT-4.5; 1M context, 128K output, native agent + computer use",
      "provider": "openai",
      "inputPrice": 5.00,
      "outputPrice": 30.00,
      "context_window": 1050000,
      "max_output": 128000,
      "categories": ["chat", "coding", "vision"],
      "available": true
    }
  ]
}

Available Models (50+ across chat / image / video / voice)

60+ models across all categories — LLMs plus image, video, music, and voice. Additional deprecated / superseded LLM IDs remain routable for backwards compatibility but are hidden from the catalog. Call GET /api/v1/models for the exact live list.

All prices shown are provider rates. BlockRun adds a 5% platform fee to cover infrastructure costs.

OpenAI GPT-5.5 Family

Released 2026-04-23 — first fully retrained base since GPT-4.5.

Model IDNameInput PriceOutput PriceContext
openai/gpt-5.5GPT-5.5$5.00/M$30.00/M1M

OpenAI GPT-5.4 Family

Model IDNameInput PriceOutput PriceContext
openai/gpt-5.4GPT-5.4$2.50/M$15.00/M1M
openai/gpt-5.4-proGPT-5.4 Pro$30.00/M$180.00/M1M
openai/gpt-5.4-miniGPT-5.4 Mini$0.75/M$4.50/M400K
openai/gpt-5.4-nanoGPT-5.4 Nano$0.05/M$0.40/M128K

OpenAI GPT-5 Family

Model IDNameInput PriceOutput PriceContext
openai/gpt-5.3GPT-5.3$2.00/M$12.00/M400K
openai/gpt-5.3-codexGPT-5.3 Codex$2.00/M$12.00/M400K
openai/gpt-5.2GPT-5.2$1.75/M$14.00/M400K
openai/gpt-5.2-proGPT-5.2 Pro$21.00/M$168.00/M400K
openai/gpt-5-miniGPT-5 Mini$0.25/M$2.00/M200K

OpenAI O-Series (Reasoning)

Model IDNameInput PriceOutput PriceContext
openai/o1o1$15.00/M$60.00/M200K
openai/o1-minio1-mini$1.10/M$4.40/M128K
openai/o3o3$2.00/M$8.00/M200K
openai/o3-minio3-mini$1.10/M$4.40/M128K

Anthropic Claude

Model IDNameInput PriceOutput PriceContext
anthropic/claude-opus-4.8Claude Opus 4.8$5.00/M$25.00/M1M
anthropic/claude-opus-4.7Claude Opus 4.7$5.00/M$25.00/M1M
anthropic/claude-opus-4.6Claude Opus 4.6$5.00/M$25.00/M1M
anthropic/claude-opus-4.5Claude Opus 4.5$5.00/M$25.00/M200K
anthropic/claude-sonnet-4.6Claude Sonnet 4.6$3.00/M$15.00/M200K
anthropic/claude-haiku-4.5Claude Haiku 4.5$1.00/M$5.00/M200K
Opus 4.7 / 4.8 behavior

These flagship models reject all sampling parameters (temperature, top_p, top_k); the gateway drops them so calls succeed. They use adaptive thinking (built-in, not API-configurable). The model may decline a request with HTTP 200 and stop_reason: "refusal" (finish_reason: "content_filter" on the OpenAI-compatible endpoint) — check the stop reason before reading content.

Google Gemini

Model IDNameInput PriceOutput PriceContext
google/gemini-3.1-proGemini 3.1 Pro$2.00/M$12.00/M1M
google/gemini-3-pro-previewGemini 3 Pro Preview$1.50/M$10.00/M1M
google/gemini-3-flash-previewGemini 3 Flash Preview$0.50/M$3.00/M1M
google/gemini-2.5-proGemini 2.5 Pro$1.25/M$10.00/M1M
google/gemini-2.5-flashGemini 2.5 Flash$0.30/M$2.50/M1M
google/gemini-3.1-flash-liteGemini 3.1 Flash Lite$0.10/M$0.40/M1M
google/gemini-2.5-flash-liteGemini 2.5 Flash Lite$0.10/M$0.40/M1M

DeepSeek

Model IDNameInput PriceOutput PriceContext
deepseek/deepseek-chatDeepSeek V3.2 Chat$0.28/M$0.42/M128K
deepseek/deepseek-reasonerDeepSeek V3.2 Reasoner$0.28/M$0.42/M128K

Z.AI

Model IDNameInput PriceOutput PriceContext
zai/glm-5.2GLM-5.2 (flagship)$1.40/M$4.40/M1M
zai/glm-5.1GLM-5.1$1.40/M$4.40/M200K
zai/glm-5GLM-5$0.60/M$1.92/M200K
zai/glm-5-turboGLM-5 Turbo$1.20/M$4.00/M200K

Moonshot

Model IDNameInput PriceOutput PriceContext
moonshot/kimi-k2.7Kimi K2.7 (flagship)$0.95/M$4.00/M256K
moonshot/kimi-k2.6Kimi K2.6$0.95/M$4.00/M256K
moonshot/kimi-k2.5Kimi K2.5 (legacy)$0.60/M$3.00/M262K

K2.7 is the current flagship — multi-modal (image + video + text input), 256K context, returns reasoning_content on completions. K2.6 is the previous-gen multi-modal model; K2.5 is still routable but superseded.

MiniMax

Model IDNameInput PriceOutput PriceContext
minimax/minimax-m2.7MiniMax M2.7$0.30/M$1.20/M204K

NVIDIA (Free Tier)

Apache 2.0 licensed open-weight models, hosted free by NVIDIA.

Model IDNameInput PriceOutput PriceContext
nvidia/gpt-oss-120bGPT-OSS 120BFREEFREE128K
nvidia/gpt-oss-20bGPT-OSS 20BFREEFREE128K
nvidia/kimi-k2.5Kimi K2.5 (NVIDIA)FREEFREE1M

Image Generation

Model IDNamePrice
openai/gpt-image-1GPT Image 1$0.02-0.04/image
openai/gpt-image-2ChatGPT Images 2.0$0.06-0.12/image
google/nano-bananaNano Banana$0.05/image
google/nano-banana-proNano Banana Pro$0.10-0.15/image

Video Generation

Seedance defaults to 720p with synced audio for text-to-video; pass resolution / generate_audio to override. See Video Generation API.

Model IDNamePrice (5s 720p default)
xai/grok-imagine-videoGrok Imagine Video$0.05/sec (8s = $0.40)
bytedance/seedance-1.5-proSeedance 1.5 Pro~$0.092/sec ($0.46 / 5s clip; flat)
bytedance/seedance-2.0-fastSeedance 2.0 Fast~$0.238/sec ($1.19 / 5s clip t2v; $0.70 with image input)
bytedance/seedance-2.0Seedance 2.0 Pro~$0.298/sec ($1.49 / 5s clip t2v; $0.91 with image input)

For character consistency across multiple Seedance videos, enroll a Virtual Portrait ($0.01 one-time, no KYC) for AI characters, or a RealFace ($0.01 one-time, no KYC, requires brief on-phone liveness check) for real people. Pass the returned ta_xxx as real_face_asset_id.

Model Categories

Each model includes a categories array in the API response. Categories indicate model capabilities:

  • chat - General conversation
  • reasoning - Complex problem-solving
  • coding - Code generation and analysis
  • vision - Image understanding

Filter models by category:

models = client.list_models()
reasoning_models = [m for m in models if "reasoning" in m.get("categories", [])]

Pricing

Prices are per 1 million tokens. Your actual cost depends on:

  1. Input tokens - Length of your prompt and context
  2. Output tokens - Length of the model's response
  3. Platform fee - 5% added to provider rates

The SDK calculates the exact price before each request.

Want to save 78% automatically? ClawRouter routes each request to the cheapest model that can handle it.

Example

What's next?