Weaving cosmic threads...

Intelligence Layer

Transparent model intelligence for creators. Live benchmarks, free model tracking, and production routing across 344 models from 57 providers. Updated hourly from OpenRouter.

344 models tracked25 freeLive from OpenRouter

Explore

Model Explorer

Search, filter, and compare 344 models with real-time pricing from OpenRouter.

CategoryModalityContext

Showing 50 of 344 of 344 models | 25 free

Sort by

AI21: Jamba Large 1.7

ai21

Context256K

Input /M tok$2.00

Output /M tok$8.00

Max output4K

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 256K context...

text->text

AionLabs: Aion-1.0

aion-labs

Context131K

Input /M tok$4.00

Output /M tok$8.00

Max output33K

Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree...

text->text

AionLabs: Aion-1.0-Mini

aion-labs

Context131K

Input /M tok$0.70

Output /M tok$1.40

Max output33K

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

text->text

AionLabs: Aion-2.0

aion-labs

Context131K

Input /M tok$0.80

Output /M tok$1.60

Max output33K

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It is particularly strong at introducing tension, crises, and conflict into stories, making narratives feel more engaging....

text->text

AionLabs: Aion-RP 1.0 (8B)

aion-labs

Context33K

Input /M tok$0.80

Output /M tok$1.60

Max output33K

Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto benchmark, a roleplaying-specific variant of Arena-Hard-Auto, where LLMs evaluate each other’s responses. It is a fine-tuned base model...

text->text

AllenAI: Olmo 3 32B Think

allenai

Context66K

Input /M tok$0.15

Output /M tok$0.50

Max output66K

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...

text->text

Amazon: Nova 2 Lite

amazon

Context1.0M

Input /M tok$0.30

Output /M tok$2.50

Max output66K

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

text+image+file+video->text

Amazon: Nova Lite 1.0

amazon

Context300K

Input /M tok$0.06

Output /M tok$0.24

Max output5K

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

text+image->text

Amazon: Nova Micro 1.0

amazon

Context128K

Input /M tok$0.04

Output /M tok$0.14

Max output5K

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

text->text

Amazon: Nova Premier 1.0

amazon

Context1.0M

Input /M tok$2.50

Output /M tok$12.50

Max output32K

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

text+image->text

Amazon: Nova Pro 1.0

amazon

Context300K

Input /M tok$0.80

Output /M tok$3.20

Max output5K

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

text+image->text

Anthropic Claude Haiku Latest

~anthropic

Context200K

Input /M tok$1.00

Output /M tok$5.00

Max output64K

This model always redirects to the latest model in the Anthropic Claude Haiku family.

text+image+file->text

Anthropic Claude Sonnet Latest

~anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output128K

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

text+image+file->text

Anthropic: Claude 3 Haiku

anthropic

Context200K

Input /M tok$0.25

Output /M tok$1.25

Max output4K

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

text+image->text

Anthropic: Claude 3.5 Haiku

anthropic

Context200K

Input /M tok$0.80

Output /M tok$4.00

Max output8K

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

text+image->text

Anthropic: Claude Haiku 4.5

anthropic

Context200K

Input /M tok$1.00

Output /M tok$5.00

Max output64K

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...

text+image+file->text

Anthropic: Claude Opus 4

anthropic

Context200K

Input /M tok$15.00

Output /M tok$75.00

Max output32K

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

text+image+file->text

Anthropic: Claude Opus 4.1

anthropic

Context200K

Input /M tok$15.00

Output /M tok$75.00

Max output32K

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

text+image+file->text

Anthropic: Claude Opus 4.5

anthropic

Context200K

Input /M tok$5.00

Output /M tok$25.00

Max output64K

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and...

text+image+file->text

Anthropic: Claude Opus 4.6

anthropic

Context1.0M

Input /M tok$5.00

Output /M tok$25.00

Max output128K

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective...

text+image+file->text

Anthropic: Claude Opus 4.6 (Fast)

anthropic

Context1.0M

Input /M tok$30.00

Output /M tok$150.00

Max output128K

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

text+image+file->text

Anthropic: Claude Opus 4.7

anthropic

Context1.0M

Input /M tok$5.00

Output /M tok$25.00

Max output128K

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

text+image+file->text

Anthropic: Claude Opus 4.7 (Fast)

anthropic

Context1.0M

Input /M tok$30.00

Output /M tok$150.00

Max output128K

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

text+image+file->text

Anthropic: Claude Opus 4.8

anthropic

Context1.0M

Input /M tok$5.00

Output /M tok$25.00

Max output128K

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

text+image+file->text

Anthropic: Claude Opus 4.8 (Fast)

anthropic

Context1.0M

Input /M tok$10.00

Output /M tok$50.00

Max output128K

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

text+image+file->text

Anthropic: Claude Opus Latest

~anthropic

Context1.0M

Input /M tok$5.00

Output /M tok$25.00

Max output128K

This model always redirects to the latest model in the Claude Opus family.

text+image+file->text

Anthropic: Claude Sonnet 4

anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output64K

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

text+image+file->text

Anthropic: Claude Sonnet 4.5

anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output64K

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

text+image+file->text

Anthropic: Claude Sonnet 4.6

anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output128K

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with...

text+image+file->text

Arcee AI: Coder Large

arcee-ai

Context33K

Input /M tok$0.50

Output /M tok$0.80

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...

text->text

Arcee AI: Maestro Reasoning

arcee-ai

Context131K

Input /M tok$0.90

Output /M tok$3.30

Max output32K

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B...

text->text

Arcee AI: Spotlight

arcee-ai

Context131K

Input /M tok$0.18

Output /M tok$0.18

Max output66K

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal...

text+image->text

Arcee AI: Trinity Large Thinking

arcee-ai

Context262K

Input /M tok$0.22

Output /M tok$0.85

Max output262K

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...

text->text

Arcee AI: Trinity Mini

arcee-ai

Context131K

Input /M tok$0.05

Output /M tok$0.15

Max output131K

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...

text->text

Arcee AI: Virtuoso Large

arcee-ai

Context131K

Input /M tok$0.75

Output /M tok$1.20

Max output64K

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

text->text

Auto Router

openrouter

Context2.0M

Input /M tok$-1000000.0000

Output /M tok$-1000000.0000

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

text+image+file+audio+video->text+image

Baidu: ERNIE 4.5 VL 28B A3B

baidu

Context131K

Input /M tok$0.14

Output /M tok$0.56

Max output8K

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing....

text+image->text

Baidu: ERNIE 4.5 VL 424B A47B

baidu

Context131K

Input /M tok$0.42

Output /M tok$1.25

Max output16K

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...

text+image->text

Body Builder (beta)

openrouter

Context128K

Input /M tok$-1000000.0000

Output /M tok$-1000000.0000

Transform your natural language requests into structured OpenRouter API request objects. Describe what you want to accomplish with AI models, and Body Builder will construct the appropriate API calls. Example:...

text->text

ByteDance Seed: Seed 1.6

bytedance-seed

Context262K

Input /M tok$0.25

Output /M tok$2.00

Max output33K

Seed 1.6 is a general-purpose model released by the ByteDance Seed team. It incorporates multimodal capabilities and adaptive deep thinking with a 256K context window.

text+image+video->text

ByteDance Seed: Seed 1.6 Flash

bytedance-seed

Context262K

Input /M tok$0.08

Output /M tok$0.30

Max output33K

Seed 1.6 Flash is an ultra-fast multimodal deep thinking model by ByteDance Seed, supporting both text and visual understanding. It features a 256k context window and can generate outputs of...

text+image+video->text

ByteDance Seed: Seed-2.0-Lite

bytedance-seed

Context262K

Input /M tok$0.25

Output /M tok$2.00

Max output131K

Seed-2.0-Lite is a versatile, cost‑efficient enterprise workhorse that delivers strong multimodal and agent capabilities while offering noticeably lower latency, making it a practical default choice for most production workloads across...

text+image+video->text

ByteDance Seed: Seed-2.0-Mini

bytedance-seed

Context262K

Input /M tok$0.10

Output /M tok$0.40

Max output131K

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, supports 256k context, four reasoning effort modes (minimal/low/medium/high), multimodal understanding,...

text+image+video->text

ByteDance: UI-TARS 7B

bytedance

Context128K

Input /M tok$0.10

Output /M tok$0.20

Max output2K

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

text+image->text

Cohere: Command A

cohere

Context256K

Input /M tok$2.50

Output /M tok$10.00

Max output8K

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

text->text

Cohere: Command R (08-2024)

cohere

Context128K

Input /M tok$0.15

Output /M tok$0.60

Max output4K

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

text->text

Cohere: Command R+ (08-2024)

cohere

Context128K

Input /M tok$2.50

Output /M tok$10.00

Max output4K

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...

text->text

Cohere: Command R7B (12-2024)

cohere

Context128K

Input /M tok$0.04

Output /M tok$0.15

Max output4K

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

text->text

Deep Cogito: Cogito v2.1 671B

deepcogito

Context128K

Input /M tok$1.25

Output /M tok$1.25

Cogito v2.1 671B MoE represents one of the strongest open models globally, matching performance of frontier closed and open models. This model is trained using self play with reinforcement learning...

text->text

DeepSeek: DeepSeek V3

deepseek

Context131K

Input /M tok$0.20

Output /M tok$0.80

Max output16K

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

text->text

Cost Calculator

Tokens per request

Requests per day

25 free models available in current view

Auto Router$-6000000.0000/mo

Body Builder (beta)$-6000000.0000/mo

OpenRouter: Fusion$-6000000.0000/mo

Pareto Code Router$-6000000.0000/mo

inclusionAI: Ling-2.6-flash$0.12/mo

Free This Week

Free Models on Zen

These models are available at zero cost through Zen routing. No API key required. Updated weekly.

🟣

MiniMax M2.5

MiniMax

FREE

Context200K

SWE-Bench80.2%

Speed75 tok/s

freecodingswe-bench

🟠

Qwen 3.6 Plus

Alibaba

FREE

Context1M

SWE-Bench78.8%

Speed85 tok/s

freelong-contextagentic

🔶

MiMo V2 Pro

Xiaomi

FREE

Context1M

SWE-Bench78%

Speed80 tok/s

freelong-contextcoding

🌙

Kimi K2.5

Moonshot

FREE

Context260K

SWE-Bench76.8%

Speed80 tok/s

freefrontendintegration

🟡

GLM 4.7

Zhipu

FREE

Context200K

SWE-Bench73.8%

Speed70 tok/s

freemultilingualresearch

🟡

Big Pickle (GLM-4.6)

Zhipu

FREE

Context200K

SWE-Bench70%

Speed65 tok/s

freereasoningarchitecture

🦙

Llama 4 Maverick

GPT-5 Nano

OpenAI

FREE

Context128K

Speed200 tok/s

freefastlightweight

💚

Nemotron 3 Super

NVIDIA

FREE

Context1M

Speed90 tok/s

freelong-contextnvidia

Benchmarks

Full Model Rankings

Every model we track, sorted by SWE-Bench Verified score. Pricing is per million tokens.

#	Model	Provider	Context	SWE-Bench	Input	Output	Speed	Category
1	🟤Claude Opus 4	Anthropic	200K	90%	$15	$75	40 t/s	frontier
2	🟣MiniMax M2.5FREE	MiniMax	200K	80.2%	Free	Free	75 t/s	free tier
3	🟠Qwen 3.6 PlusFREE	Alibaba	1M	78.8%	Free	Free	85 t/s	free tier
4	🔶MiMo V2 ProFREE	Xiaomi	1M	78%	Free	Free	80 t/s	free tier
5	🌙Kimi K2.5FREE	Moonshot	260K	76.8%	Free	Free	80 t/s	free tier
6	🟡GLM 4.7FREE	Zhipu	200K	73.8%	Free	Free	70 t/s	free tier
7	🟤Claude Sonnet 4	Anthropic	200K	72.7%	$3	$15	80 t/s	frontier
8	🟡Big Pickle (GLM-4.6)FREE	Zhipu	200K	70%	Free	Free	65 t/s	free tier
9	🦙Llama 4 MaverickFREE	Meta	1M	50%	Free	Free	70 t/s	open source
10	🔵Gemini 2.0 Pro	Google	1M	48%	$1.25	$5	70 t/s	frontier
11	🐋DeepSeek V3	DeepSeek	64K	42%	$0.27	$1.1	60 t/s	open source
12	🟤Claude Haiku 3.5	Anthropic	200K	40.6%	$0.25	$1.25	150 t/s	frontier
13	🟢GPT-4o	OpenAI	128K	38.4%	$2.5	$10	90 t/s	frontier
14	🔵Gemini 2.0 Flash	Google	1M	33%	$0.075	$0.3	160 t/s	frontier
15	🟢GPT-4o Mini	OpenAI	128K	23.7%	$0.15	$0.6	130 t/s	frontier
16	🟢GPT-5 NanoFREE	OpenAI	128K	--	Free	Free	200 t/s	free tier
17	💚Nemotron 3 SuperFREE	NVIDIA	1M	--	Free	Free	90 t/s	free tier

Calculate

Cost Calculator

Estimate monthly costs for your use case. Real pricing from OpenRouter.

Cost Estimator

Cost Calculator

Estimate monthly costs for any use case. Choose a preset or enter custom token counts to compare models side by side.

Use Case

~500 input + 200 output tokens per request

Requests per Day

3,000 requests/month

25 free models available for this use case: Owl Alpha, NVIDIA: Nemotron 3 Nano Omni (free), Poolside: Laguna XS.2 (free), Poolside: Laguna M.1 (free), MoonshotAI: Kimi K2.6 (free), Google: Gemma 4 26B A4B (free), Google: Gemma 4 31B (free), Google: Lyria 3 Pro Preview, Google: Lyria 3 Clip Preview, NVIDIA: Nemotron 3 Super (free), Free Models Router, LiquidAI: LFM2.5-1.2B-Thinking (free), LiquidAI: LFM2.5-1.2B-Instruct (free), NVIDIA: Nemotron 3 Nano 30B A3B (free), NVIDIA: Nemotron Nano 12B 2 VL (free), Qwen: Qwen3 Next 80B A3B Instruct (free), NVIDIA: Nemotron Nano 9B V2 (free), OpenAI: gpt-oss-120b (free), OpenAI: gpt-oss-20b (free), Z.ai: GLM 4.5 Air (free), Qwen: Qwen3 Coder 480B A35B (free), Venice: Uncensored (free), Meta: Llama 3.3 70B Instruct (free), Meta: Llama 3.2 3B Instruct (free), Nous: Hermes 3 405B Instruct (free)

#	Model	Monthly	Per Request
1	OpenRouter: Fusionopenrouter	<$0.01	<$0.0001
2	Pareto Code Routeropenrouter	<$0.01	<$0.0001
3	Body Builder (beta)openrouter	<$0.01	<$0.0001
4	Auto Routeropenrouter	<$0.01	<$0.0001
5	inclusionAI: Ling-2.6-flashinclusionai	$0.03	<$0.0001
6	Mistral: Mistral Nemomistralai	$0.05	<$0.0001
7	Meta: Llama 3.1 8B Instructmeta-llama	$0.06	<$0.0001
8	Meta: Llama 3 8B Instructmeta-llama	$0.08	<$0.0001
9	Sao10K: Llama 3 8B Lunarissao10k	$0.09	<$0.0001
10	IBM: Granite 4.0 Microibm-granite	$0.10	<$0.0001

For 100 chat app requests/day, the cheapest paid model is OpenRouter: Fusion at <$0.01/month. 25 models are free.

Production Routing

Arcanean Workflow Map

How Arcanea routes models to specialized agents in production. Each agent is assigned to an Arcanean Gate with a primary model and fallback chain.

Sisyphus

Guardian: Aiyami

orchestration

GateCrown

Primary ModelQwen 3.6 Plus

FallbacksClaude Sonnet 4, Gemini 2.0 Pro

Orchestrator needs relentless persistence and 1M context to hold the full project state. Qwen 3.6 Plus offers the best free agentic reasoning with massive context.

Hephaestus

Guardian: Draconia

coding

GateFire

Primary ModelMiniMax M2.5

FallbacksClaude Sonnet 4, Kimi K2.5

Coder needs the highest SWE-Bench score available. MiniMax M2.5 leads at 80.2% — the divine forge of code.

Oracle

Guardian: Lyria

research

GateSight

Primary ModelBig Pickle (GLM-4.6)

FallbacksGemini 2.0 Pro, Claude Sonnet 4

Architecture decisions require deep reasoning over complex systems. Big Pickle excels at slow, deliberate analysis — seeing the whole picture.

Prometheus

Guardian: Elara

research

GateStarweave

Primary ModelQwen 3.6 Plus

FallbacksGemini 2.0 Flash, Nemotron 3 Super

Research demands vast context for ingesting papers, docs, and codebases. 1M context + strong reasoning makes Qwen the fire-bringer of knowledge.

Metis

Guardian: Alera

coordination

GateVoice

Primary ModelQwen 3.6 Plus

FallbacksClaude Sonnet 4, Gemini 2.0 Pro

Strategy and planning need long-context reasoning to weigh trade-offs across the entire system. Qwen delivers wisdom at scale.

Momus

Guardian: Maylinn

review

GateHeart

Primary ModelMiniMax M2.5

FallbacksClaude Sonnet 4, Qwen 3.6 Plus

Code review requires deep comprehension of implementation patterns. M2.5 at 80.2% SWE-Bench catches what others miss — the honest critic.

Atlas

Guardian: Lyssandria

coordination

GateFoundation

Primary ModelKimi K2.5

FallbacksMiniMax M2.5, Claude Sonnet 4

Coordination and frontend integration need broad context and strong UI/UX understanding. Kimi K2.5 carries the world of integrations.

Librarian

Guardian: Leyla

research

GateFlow

Primary ModelGLM 4.7

FallbacksQwen 3.6 Plus, Gemini 2.0 Flash

Documentation and research benefit from strong multilingual capabilities. GLM 4.7 excels at structured knowledge extraction.

Explore

Guardian: Ino

quick

GateUnity

Primary ModelGPT-5 Nano

FallbacksGPT-4o Mini, Gemini 2.0 Flash

Navigation and exploration need speed above all else. GPT-5 Nano is the fastest free model — instant wayfinding through the codebase.

Deep Dives

Model Analysis

Strengths, weaknesses, and recommended use cases for the top models.

🟤

Claude Opus 4Anthropic

90%

Strengths

+Highest SWE-Bench score of any model
+Deep multi-step reasoning
+Extended thinking with visible chain-of-thought
+Agentic coding — sustained autonomous work
+Exceptional instruction following

Weaknesses

-Premium pricing
-Slower than lighter models
-Overkill for simple tasks

Tags

reasoningcodingagenticextended-thinking

Context200K

Speed40 tok/s

Released2025-05-22

🟣

MiniMax M2.5MiniMax

FREE

80.2%

Strengths

+Free via Zen routing
+Highest SWE-Bench among free models (80.2%)
+Exceptional code generation and comprehension
+Strong at complex multi-file edits

Weaknesses

-Smaller context than Qwen
-Less versatile for non-coding tasks
-Rate-limited on free tier

Tags

freecodingswe-benchagentic

Context200K

Speed75 tok/s

Released2025-07-01

🟠

Qwen 3.6 PlusAlibaba

FREE

78.8%

Strengths

+Free via Zen routing
+1M context window — best free long-context model
+78.8% SWE-Bench — competitive with top frontier models
+Strong agentic reasoning
+Excellent multilingual support

Weaknesses

-Rate-limited on free tier
-Less tested in Western dev ecosystems
-Availability depends on Zen router uptime

Tags

freelong-contextagenticmultilingualcoding

Context1M

Speed85 tok/s

Released2025-06-01

🔶

MiMo V2 ProXiaomi

FREE

78%

Strengths

+Free via Zen routing
+1M context window
+78% SWE-Bench — strong for a free model
+Good general-purpose coding

Weaknesses

-Newer model with less community adoption
-Less documentation in English

Tags

freelong-contextcoding

Context1M

Speed80 tok/s

Released2025-08-01

🌙

Kimi K2.5Moonshot

FREE

76.8%

Strengths

+Free via Zen routing
+260K context — larger than most free models
+Strong frontend and integration work
+76.8% SWE-Bench — solid coding performance

Weaknesses

-Less proven on backend architecture
-Rate-limited on free tier

Tags

freefrontendintegrationcoding

Context260K

Speed80 tok/s

Released2025-07-01

🟡

GLM 4.7Zhipu

FREE

73.8%

Strengths

+Free via Zen routing
+Improved over GLM-4.6
+Strong multilingual and research capabilities
+Good documentation generation

Weaknesses

-Still behind top-tier coding models
-Smaller context than Qwen

Tags

freemultilingualresearchdocs

Context200K

Speed70 tok/s

Released2025-08-01

🟤

Claude Sonnet 4Anthropic

72.7%

Strengths

+Best cost-performance ratio for coding
+Strong agentic capabilities
+Fast for a frontier model
+Transparent extended thinking

Weaknesses

-Slightly less capable than Opus on hardest tasks
-Smaller output window than Opus

Tags

codingagenticbalancedextended-thinking

Context200K

Speed80 tok/s

Released2025-05-22

🟡

Big Pickle (GLM-4.6)Zhipu

FREE

70%

Strengths

+Free via Zen routing
+Deep reasoning for architecture decisions
+Good at system design and analysis

Weaknesses

-Slower than other free options
-Lower SWE-Bench than M2.5 or Qwen
-Less suited for rapid iteration

Tags

freereasoningarchitecture

Context200K

Speed65 tok/s

Released2025-05-01

🦙

Llama 4 MaverickMeta

FREE

50%

Strengths

+Fully open-source (Apache 2.0 derivative)
+1M context window
+Self-hostable — full data sovereignty
+Mixture-of-experts architecture
+Large community and fine-tune ecosystem

Weaknesses

-Requires significant compute to self-host
-Lower SWE-Bench than top proprietary models
-MoE can have inconsistent quality across tasks

Tags

open-sourcelong-contextself-hostmoe

Context1M

Speed70 tok/s

Released2025-04-05

🔵

Gemini 2.0 ProGoogle

48%

Strengths

+1M context with stronger reasoning than Flash
+Good code generation
+Competitive pricing for performance level

Weaknesses

-Slower than Flash
-Still behind Claude on agentic tasks

Tags

long-contextreasoningcoding

Context1M

Speed70 tok/s

Released2025-03-25

Changelog

Weekly Update Log

Track changes to the model roster, free tier availability, and routing decisions.

Week of 2026-04-04

All Zen-routed models remain free this week. MiniMax M2.5 continues to lead SWE-Bench at 80.2%. Qwen 3.6 Plus remains the recommended orchestrator model for its 1M context + 78.8% SWE-Bench combination. MiMo V2 Pro gaining traction as a strong 1M-context alternative. Image generation models are now tracked in the Arena — 8 models across frontier, open-source, and specialized categories.

Free models this week:Qwen 3.6 PlusMiniMax M2.5Big Pickle (GLM-4.6)GLM 4.7GPT-5 NanoKimi K2.5Nemotron 3 SuperMiMo V2 Pro

Also in the Arena

Image Generation Arena

Compare FLUX.2, Grok Image, DALL-E 3, Stable Diffusion, and more. Pricing, speed, text rendering quality, and Arcanea pipeline routing.

View Image Arena

Start creating with these models

Every model in the Arena is available through Arcanea. Free models run on Zen routing. Premium models run through your own API keys.

Open Chat Image Studio Image Arena

Intelligence Layer

Transparent model intelligence for creators. Live benchmarks, free model tracking, and production routing across 344 models from 57 providers. Updated hourly from OpenRouter.

344 models tracked25 freeLive from OpenRouter

amazon

Context300K

Input /M tok$0.06

Output /M tok$0.24

Max output5K

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

text+image->text

Amazon: Nova Micro 1.0

amazon

Context128K

Input /M tok$0.04

Output /M tok$0.14

Max output5K

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

text->text

Amazon: Nova Premier 1.0

amazon

Context1.0M

Input /M tok$2.50

Output /M tok$12.50

Max output32K

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

text+image->text

Amazon: Nova Pro 1.0

amazon

Context300K

Input /M tok$0.80

Output /M tok$3.20

Max output5K

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

text+image->text

Anthropic Claude Haiku Latest

~anthropic

Context200K

Input /M tok$1.00

Output /M tok$5.00

Max output64K

This model always redirects to the latest model in the Anthropic Claude Haiku family.

text+image+file->text

Anthropic Claude Sonnet Latest

~anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output128K

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

text+image+file->text

Anthropic: Claude 3 Haiku

anthropic

Context200K

Input /M tok$0.25

Output /M tok$1.25

Max output4K

text+image->text

Anthropic: Claude 3.5 Haiku

anthropic

Context200K

Input /M tok$0.80

Output /M tok$4.00

Max output8K

text+image->text

Anthropic: Claude Haiku 4.5

anthropic

Context200K

Input /M tok$1.00

Output /M tok$5.00

Max output64K

text+image+file->text

Anthropic: Claude Opus 4

anthropic

Anthropic: Claude Opus 4.8 (Fast)

anthropic

Context1.0M

Input /M tok$10.00

Output /M tok$50.00

Max output128K

text+image+file->text

Anthropic: Claude Opus Latest

~anthropic

Context1.0M

Input /M tok$5.00

Output /M tok$25.00

Max output128K

This model always redirects to the latest model in the Claude Opus family.

text+image+file->text

Anthropic: Claude Sonnet 4

anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output64K

text+image+file->text

Anthropic: Claude Sonnet 4.5

anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output64K

text+image+file->text

Anthropic: Claude Sonnet 4.6

anthropic

Context1.0M

Input /M tok$3.00

Output /M tok$15.00

Max output128K

text+image+file->text

Arcee AI: Coder Large

arcee-ai

Context33K

Input /M tok$0.50

Output /M tok$0.80

text->text

Arcee AI: Maestro Reasoning

arcee-ai

Context131K

Input /M tok$0.90

Output /M tok$3.30

Max output32K

text->text

text->text

Cohere: Command R7B (12-2024)

cohere

Context128K

Input /M tok$0.04

Output /M tok$0.15

Max output4K

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

text->text

Deep Cogito: Cogito v2.1 671B

deepcogito

Context128K

Input /M tok$1.25

Output /M tok$1.25

text->text

DeepSeek: DeepSeek V3

deepseek

Context131K

Input /M tok$0.20

Output /M tok$0.80

Max output16K

text->text

Cost Calculator

Tokens per request

Requests per day

25 free models available in current view

Auto Router$-6000000.0000/mo

Body Builder (beta)$-6000000.0000/mo

OpenRouter: Fusion$-6000000.0000/mo

Pareto Code Router$-6000000.0000/mo

inclusionAI: Ling-2.6-flash$0.12/mo

Free This Week

Free Models on Zen

These models are available at zero cost through Zen routing. No API key required. Updated weekly.

🟣

MiniMax M2.5

MiniMax

FREE

Context200K

SWE-Bench80.2%

Speed75 tok/s

freecodingswe-bench

🟠

Qwen 3.6 Plus

Alibaba

FREE

Context1M

SWE-Bench78.8%

Speed85 tok/s

freelong-contextagentic

🔶

MiMo V2 Pro

Xiaomi

FREE

Context1M

SWE-Bench78%

Speed80 tok/s

freelong-contextcoding

🌙

Kimi K2.5

Moonshot

FREE

Context260K

SWE-Bench76.8%

Speed80 tok/s

freefrontendintegration

🟡

GLM 4.7

Zhipu

FREE

Context200K

SWE-Bench73.8%

Speed70 tok/s

freemultilingualresearch

🟡

Big Pickle (GLM-4.6)

Zhipu

FREE

Context200K

SWE-Bench70%

Speed65 tok/s

freereasoningarchitecture

🦙

Llama 4 Maverick

GPT-5 Nano

OpenAI

FREE

Context128K

Speed200 tok/s

freefastlightweight

💚

Nemotron 3 Super

NVIDIA

FREE

Context1M

Speed90 tok/s

freelong-contextnvidia

Benchmarks

Full Model Rankings

Every model we track, sorted by SWE-Bench Verified score. Pricing is per million tokens.

#	Model	Provider	Context	SWE-Bench	Input	Output	Speed	Category
1	🟤Claude Opus 4	Anthropic	200K	90%	$15	$75	40 t/s	frontier
2	🟣MiniMax M2.5FREE	MiniMax	200K	80.2%	Free	Free	75 t/s	free tier
3	🟠Qwen 3.6 PlusFREE	Alibaba	1M	78.8%	Free	Free	85 t/s	free tier
4	🔶MiMo V2 ProFREE	Xiaomi	1M	78%	Free	Free	80 t/s	free tier
5	🌙Kimi K2.5FREE	Moonshot	260K	76.8%	Free	Free	80 t/s	free tier
6	🟡GLM 4.7FREE	Zhipu	200K	73.8%	Free	Free	70 t/s	free tier
7	🟤Claude Sonnet 4	Anthropic	200K	72.7%	$3	$15	80 t/s	frontier
8	🟡Big Pickle (GLM-4.6)FREE	Zhipu	200K	70%	Free	Free	65 t/s	free tier
9	🦙Llama 4 MaverickFREE	Meta	1M	50%	Free	Free	70 t/s	open source
10	🔵Gemini 2.0 Pro	Google	1M	48%	$1.25	$5	70 t/s	frontier
11	🐋DeepSeek V3	DeepSeek	64K	42%	$0.27	$1.1	60 t/s	open source
12	🟤Claude Haiku 3.5	Anthropic	200K	40.6%	$0.25	$1.25	150 t/s	frontier
13	🟢GPT-4o	OpenAI	128K	38.4%	$2.5	$10	90 t/s	frontier
14	🔵Gemini 2.0 Flash	Google	1M	33%	$0.075	$0.3	160 t/s	frontier
15	🟢GPT-4o Mini	OpenAI	128K	23.7%	$0.15	$0.6	130 t/s	frontier
16	🟢GPT-5 NanoFREE	OpenAI	128K	--	Free	Free	200 t/s	free tier
17	💚Nemotron 3 SuperFREE	NVIDIA	1M	--	Free	Free	90 t/s	free tier

Calculate

Cost Calculator

Estimate monthly costs for your use case. Real pricing from OpenRouter.

Cost Estimator

Cost Calculator

Estimate monthly costs for any use case. Choose a preset or enter custom token counts to compare models side by side.

Use Case

~500 input + 200 output tokens per request

Requests per Day

3,000 requests/month

#	Model	Monthly	Per Request
1	OpenRouter: Fusionopenrouter	<$0.01	<$0.0001
2	Pareto Code Routeropenrouter	<$0.01	<$0.0001
3	Body Builder (beta)openrouter	<$0.01	<$0.0001
4	Auto Routeropenrouter	<$0.01	<$0.0001
5	inclusionAI: Ling-2.6-flashinclusionai	$0.03	<$0.0001
6	Mistral: Mistral Nemomistralai	$0.05	<$0.0001
7	Meta: Llama 3.1 8B Instructmeta-llama	$0.06	<$0.0001
8	Meta: Llama 3 8B Instructmeta-llama	$0.08	<$0.0001
9	Sao10K: Llama 3 8B Lunarissao10k	$0.09	<$0.0001
10	IBM: Granite 4.0 Microibm-granite	$0.10	<$0.0001

For 100 chat app requests/day, the cheapest paid model is OpenRouter: Fusion at <$0.01/month. 25 models are free.

Production Routing

Arcanean Workflow Map

How Arcanea routes models to specialized agents in production. Each agent is assigned to an Arcanean Gate with a primary model and fallback chain.

Sisyphus

Guardian: Aiyami

orchestration

GateCrown

Primary ModelQwen 3.6 Plus

FallbacksClaude Sonnet 4, Gemini 2.0 Pro

Orchestrator needs relentless persistence and 1M context to hold the full project state. Qwen 3.6 Plus offers the best free agentic reasoning with massive context.

Hephaestus

Guardian: Draconia

coding

GateFire

Primary ModelMiniMax M2.5

FallbacksClaude Sonnet 4, Kimi K2.5

Coder needs the highest SWE-Bench score available. MiniMax M2.5 leads at 80.2% — the divine forge of code.

Oracle

Guardian: Lyria

research

GateSight

Primary ModelBig Pickle (GLM-4.6)

FallbacksGemini 2.0 Pro, Claude Sonnet 4

Architecture decisions require deep reasoning over complex systems. Big Pickle excels at slow, deliberate analysis — seeing the whole picture.

Prometheus

Guardian: Elara

research

GateStarweave

Primary ModelQwen 3.6 Plus

FallbacksGemini 2.0 Flash, Nemotron 3 Super

Research demands vast context for ingesting papers, docs, and codebases. 1M context + strong reasoning makes Qwen the fire-bringer of knowledge.

Metis

Guardian: Alera

coordination

GateVoice

Primary ModelQwen 3.6 Plus

FallbacksClaude Sonnet 4, Gemini 2.0 Pro

Strategy and planning need long-context reasoning to weigh trade-offs across the entire system. Qwen delivers wisdom at scale.

Momus

Guardian: Maylinn

review

GateHeart

Primary ModelMiniMax M2.5

FallbacksClaude Sonnet 4, Qwen 3.6 Plus

Code review requires deep comprehension of implementation patterns. M2.5 at 80.2% SWE-Bench catches what others miss — the honest critic.

Atlas

Guardian: Lyssandria

coordination

GateFoundation

Primary ModelKimi K2.5

FallbacksMiniMax M2.5, Claude Sonnet 4

Coordination and frontend integration need broad context and strong UI/UX understanding. Kimi K2.5 carries the world of integrations.

Librarian

Guardian: Leyla

research

GateFlow

Primary ModelGLM 4.7

FallbacksQwen 3.6 Plus, Gemini 2.0 Flash

Documentation and research benefit from strong multilingual capabilities. GLM 4.7 excels at structured knowledge extraction.

Explore

Guardian: Ino

quick

GateUnity

Primary ModelGPT-5 Nano

FallbacksGPT-4o Mini, Gemini 2.0 Flash

Navigation and exploration need speed above all else. GPT-5 Nano is the fastest free model — instant wayfinding through the codebase.

Deep Dives

Model Analysis

Strengths, weaknesses, and recommended use cases for the top models.

🟤

Claude Opus 4Anthropic

90%

Strengths

+Highest SWE-Bench score of any model
+Deep multi-step reasoning
+Extended thinking with visible chain-of-thought
+Agentic coding — sustained autonomous work
+Exceptional instruction following

Weaknesses

-Premium pricing
-Slower than lighter models
-Overkill for simple tasks

Tags

reasoningcodingagenticextended-thinking

Context200K

Speed40 tok/s

Released2025-05-22

🟣

MiniMax M2.5MiniMax

FREE

80.2%

Strengths

+Free via Zen routing
+Highest SWE-Bench among free models (80.2%)
+Exceptional code generation and comprehension
+Strong at complex multi-file edits

Weaknesses

-Smaller context than Qwen
-Less versatile for non-coding tasks
-Rate-limited on free tier

Tags

freecodingswe-benchagentic

Context200K

Speed75 tok/s

Released2025-07-01

🟠

Qwen 3.6 PlusAlibaba

FREE

78.8%

Strengths

+Free via Zen routing
+1M context window — best free long-context model
+78.8% SWE-Bench — competitive with top frontier models
+Strong agentic reasoning
+Excellent multilingual support

Weaknesses

-Rate-limited on free tier
-Less tested in Western dev ecosystems
-Availability depends on Zen router uptime

Tags

freelong-contextagenticmultilingualcoding

Context1M

Speed85 tok/s

Released2025-06-01

🔶

MiMo V2 ProXiaomi

FREE

78%

Strengths

+Free via Zen routing
+1M context window
+78% SWE-Bench — strong for a free model
+Good general-purpose coding

Weaknesses

-Newer model with less community adoption
-Less documentation in English

Tags

freelong-contextcoding

Context1M

Speed80 tok/s

Released2025-08-01

🌙

Kimi K2.5Moonshot

FREE

76.8%

Strengths

+Free via Zen routing
+260K context — larger than most free models
+Strong frontend and integration work
+76.8% SWE-Bench — solid coding performance

Weaknesses

-Less proven on backend architecture
-Rate-limited on free tier

Tags

freefrontendintegrationcoding

Context260K

Speed80 tok/s

Released2025-07-01

🟡

GLM 4.7Zhipu

FREE

73.8%

Strengths

+Free via Zen routing
+Improved over GLM-4.6
+Strong multilingual and research capabilities
+Good documentation generation

Weaknesses

-Still behind top-tier coding models
-Smaller context than Qwen

Tags

freemultilingualresearchdocs

Context200K

Speed70 tok/s

Released2025-08-01

🟤

Claude Sonnet 4Anthropic

72.7%

Strengths

+Best cost-performance ratio for coding
+Strong agentic capabilities
+Fast for a frontier model
+Transparent extended thinking

Weaknesses

-Slightly less capable than Opus on hardest tasks
-Smaller output window than Opus

Tags

codingagenticbalancedextended-thinking

Context200K

Speed80 tok/s

Released2025-05-22

🟡

Big Pickle (GLM-4.6)Zhipu

FREE

70%

Strengths

+Free via Zen routing
+Deep reasoning for architecture decisions
+Good at system design and analysis

Weaknesses

-Slower than other free options
-Lower SWE-Bench than M2.5 or Qwen
-Less suited for rapid iteration

Tags

freereasoningarchitecture

Context200K

Speed65 tok/s

Released2025-05-01

🦙

Llama 4 MaverickMeta

FREE

50%

Strengths

+Fully open-source (Apache 2.0 derivative)
+1M context window
+Self-hostable — full data sovereignty
+Mixture-of-experts architecture
+Large community and fine-tune ecosystem

Weaknesses

-Requires significant compute to self-host
-Lower SWE-Bench than top proprietary models
-MoE can have inconsistent quality across tasks

Tags

open-sourcelong-contextself-hostmoe

Context1M

Speed70 tok/s

Released2025-04-05

🔵

Gemini 2.0 ProGoogle

48%

Strengths

+1M context with stronger reasoning than Flash
+Good code generation
+Competitive pricing for performance level

Weaknesses

-Slower than Flash
-Still behind Claude on agentic tasks

Tags

long-contextreasoningcoding

Context1M

Speed70 tok/s

Released2025-03-25

Changelog

Weekly Update Log

Track changes to the model roster, free tier availability, and routing decisions.

Week of 2026-04-04

Free models this week:Qwen 3.6 PlusMiniMax M2.5Big Pickle (GLM-4.6)GLM 4.7GPT-5 NanoKimi K2.5Nemotron 3 SuperMiMo V2 Pro

Also in the Arena

Image Generation Arena

Compare FLUX.2, Grok Image, DALL-E 3, Stable Diffusion, and more. Pricing, speed, text rendering quality, and Arcanea pipeline routing.

View Image Arena

Start creating with these models

Every model in the Arena is available through Arcanea. Free models run on Zen routing. Premium models run through your own API keys.

Open Chat Image Studio Image Arena