API Gateway Active

UNIFIED
INTELLIGENCE.

[SYSTEM]: UnoRouter aggregates 45+ AI models through intelligent routing to deliver the fastest, cheapest inference API.

Get Started View Models

Models45

Global

Providers13+

Integrated

Uptime99.9%

SLA Guarantee

Tokens Served

3,192,845,072

Requests

98,418

Tokens / minLive

24,339

Claude Code

Works with Claude Code

Use UnoRouter as your Anthropic API backend. Set two environment variables and access all models from Claude Code.

View Guide

Codex CLI

Works with Codex CLI

Use UnoRouter as your OpenAI API backend. Fully compatible with the OpenAI SDK and Codex CLI.

View Guide

Gemini CLI

Works with Gemini CLI

Use UnoRouter as your Gemini API backend. Access all models through the Gemini CLI.

View Guide

OpenClaw

Works with OpenClaw

Use UnoRouter as a model provider for your OpenClaw agent. Configure once, access every model.

View Guide

Live Inference

kimi-k2.5

stable-diffusion-3-medium

claude-haiku-4-5-20251001

claude-opus-4-5

claude-opus-4-6

claude-opus-4-7

claude-sonnet-4-6

doubao-seedance-1-5-pro-251215

doubao-seedance-2-0-260128

doubao-seedance-2-0-fast-260128

doubao-seedream-4-5-251128

doubao-seedream-5-0-260128

flux-pro-max

flux-schnell

flux.1-kontext-pro

gemini-3-flash-preview

gemini-3-pro-image-preview

gemini-3-pro-preview

gemini-3.1-flash-image-preview

gemini-3.1-flash-lite-preview

gemini-3.1-pro-preview

glm-4.7

glm-5

glm-5.1

gpt-5.4

gpt-5.4-mini

gpt-5.4-nano

gpt-image-1.5

grok-imagine-video

grok-video-3

kling-motion-control

mimo-v2-flash

mimo-v2-omni

mimo-v2-pro

MiniMax-M2.5-highspeed

MiniMax-M2.7

MiniMax-M2.7-highspeed

sora-2

sora-2-pro

veo3.1-fast

veo3.1-pro

veo3.1-pro-4k

wan2.5-i2v-preview

wan2.6-i2v

z-image-turbo

kimi-k2.5

stable-diffusion-3-medium

claude-haiku-4-5-20251001

claude-opus-4-5

claude-opus-4-6

claude-opus-4-7

claude-sonnet-4-6

doubao-seedance-1-5-pro-251215

doubao-seedance-2-0-260128

doubao-seedance-2-0-fast-260128

doubao-seedream-4-5-251128

doubao-seedream-5-0-260128

flux-pro-max

flux-schnell

flux.1-kontext-pro

gemini-3-flash-preview

gemini-3-pro-image-preview

gemini-3-pro-preview

gemini-3.1-flash-image-preview

gemini-3.1-flash-lite-preview

gemini-3.1-pro-preview

glm-4.7

glm-5

glm-5.1

gpt-5.4

gpt-5.4-mini

gpt-5.4-nano

gpt-image-1.5

grok-imagine-video

grok-video-3

kling-motion-control

mimo-v2-flash

mimo-v2-omni

mimo-v2-pro

MiniMax-M2.5-highspeed

MiniMax-M2.7

MiniMax-M2.7-highspeed

sora-2

sora-2-pro

veo3.1-fast

veo3.1-pro

veo3.1-pro-4k

wan2.5-i2v-preview

wan2.6-i2v

z-image-turbo

kimi-k2.5

stable-diffusion-3-medium

claude-haiku-4-5-20251001

claude-opus-4-5

claude-opus-4-6

claude-opus-4-7

claude-sonnet-4-6

doubao-seedance-1-5-pro-251215

doubao-seedance-2-0-260128

doubao-seedance-2-0-fast-260128

doubao-seedream-4-5-251128

doubao-seedream-5-0-260128

flux-pro-max

flux-schnell

flux.1-kontext-pro

gemini-3-flash-preview

gemini-3-pro-image-preview

gemini-3-pro-preview

gemini-3.1-flash-image-preview

gemini-3.1-flash-lite-preview

gemini-3.1-pro-preview

glm-4.7

glm-5

glm-5.1

gpt-5.4

gpt-5.4-mini

gpt-5.4-nano

gpt-image-1.5

grok-imagine-video

grok-video-3

kling-motion-control

mimo-v2-flash

mimo-v2-omni

mimo-v2-pro

MiniMax-M2.5-highspeed

MiniMax-M2.7

MiniMax-M2.7-highspeed

sora-2

sora-2-pro

veo3.1-fast

veo3.1-pro

veo3.1-pro-4k

wan2.5-i2v-preview

wan2.6-i2v

z-image-turbo

TPS: 405.6

Pricing

Simple Pricing.
Pay only for what you use.

Subscribe for better value or pay as you go. All plans include access to every model with automatic failover across providers.

Pay As You Go$0

No minimum. Top up and use any model.

/v1/chat/completions

20$ 1 Month$20/mon

~$40 value. 2x Multiplier.

$10/week

50$ 1 Month$50/mon

~$100 value. 2x Multiplier.

$25/week

100$ 1 Month$100/mon

~$200 value. 2x Multiplier.

$50/week

Why UnoRouter

One endpoint, every AI model. Intelligent routing picks the fastest provider while automatic failover ensures your requests always succeed.

Automatic FailoverFailed requests retry on alternate channels instantly

Multi-Protocol SupportOpenAI, Anthropic, and Gemini API formats

Smart Load BalancingRoutes to fastest available provider automatically

Get Started View All Plans

Supported Providers

Active

Google9

gemini-3.1-pro-previewgemini-3.1-flash-lite-previewgemini-3-pro-preview+6

OpenAI6

gpt-5.4-nanogpt-5.4-minigpt-5.4+3

Anthropic5

claude-sonnet-4-6claude-opus-4-7claude-opus-4-6+2

ByteDance5

doubao-seedream-5-0-260128doubao-seedream-4-5-251128doubao-seedance-2-0-fast-260128+2

Flux3

flux.1-kontext-proflux-schnellflux-pro-max

Zhipu3

glm-5.1glm-5glm-4.7

Xiaomi3

mimo-v2-promimo-v2-omnimimo-v2-flash

MiniMax3

MiniMax-M2.7-highspeedMiniMax-M2.7MiniMax-M2.5-highspeed

Bailian3

z-image-turbowan2.6-i2vwan2.5-i2v-preview

xAI2

grok-video-3grok-imagine-video

Moonshot1

kimi-k2.5

Stability AI1

stable-diffusion-3-medium

Kling1

kling-motion-control

Request Flow

Live

POST /v1/chat/completions

Route to fastest provider

✓

Response delivered

Zero-Downtime Architecture

Always-On
Reliable AI

Your requests are routed through multiple providers simultaneously. If one goes down, another picks up instantly. No single point of failure.

Multi-Provider

Multiple providers with automatic failover. Your requests always find a working path, even during provider outages.

Smart Routing

Latency-based channel selection with priority weights. The fastest provider wins, every time.

Get Started View Documentation

Architecture

Active

Unified Endpoint

Single API for OpenAI, Anthropic, Gemini formats

Smart Routing

Selects fastest provider based on latency and availability

Automatic Failover

Failed requests retry on alternate channels instantly

✓

Always Delivered

99.9% uptime SLA across all models

Supported Formats

{ "openai": "/v1/chat/completions", "anthropic": "/v1/messages", "gemini": "/v1/models" }

All formats auto-detected and routed to the right provider.

Integrate in seconds.
Scale to infinity.

Fully OpenAI-compatible endpoints. Switch from any provider without changing your code. One API key, every model.

OpenAI-compatible endpoints

Automatic provider failover

Multi-protocol support

Read Full API Docs

bash

curl -X POST https://api.unorouter.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d {
    "model": "claude-haiku-4-5-20251001",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ],
    "stream": true
  }

Ready to Build?

Access every AI model through one endpoint. No vendor lock-in, no complexity.

Start Building Documentation

UNIFIED INTELLIGENCE.

Works with Claude Code

Works with Codex CLI

Works with Gemini CLI

Works with OpenClaw

Simple Pricing.Pay only for what you use.

Why UnoRouter

Always-OnReliable AI

Integrate in seconds.Scale to infinity.

Ready to Build?

UNIFIED
INTELLIGENCE.

Simple Pricing.
Pay only for what you use.

Always-On
Reliable AI

Integrate in seconds.
Scale to infinity.