Skip to content
AI Viewer
Live catalog

Track what changed across the frontier model market.

Follow launches, pricing, context windows, and benchmark snapshots without mixing everything into a fake universal score.

Release tracking Pricing snapshots Capability coverage Benchmark context
Fresh data: Catalog synced Mar 10, 2026 View all changes

Tracked models

37

Models actively covered in the live catalog.

Providers

7

Major labs and platforms tracked in one place.

Recent launches

16

Models first seen in the last 30 days.

Cheapest input

$0.100/M

Lowest listed input rate per million tokens.

Coverage

Filter the market by provider.

Use these filters to isolate releases and model cards by lab. The dataset is refreshed from structured sources and benchmark rows stay clearly labeled.

Last sync: Mar 10, 2026

Methodology

Fresh facts, explicit limits.

OpenRouter API

Primary source for pricing, context windows, and capability flags across providers.

AIViewer editorial layer

Strengths, watchouts, and contextual summaries written by humans, not auto-generated rankings.

AIViewer keeps pricing, capabilities, and benchmark rows separate so the data stays interpretable.

Latest releases

What changed most recently.

Active filter: All providers

OpenAI

OpenAI: GPT-5.4 Pro

Mar 5, 2026

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token...

Context: 1,050,000 tokens Input: $30/M Output: $180/M
OpenAI

OpenAI: GPT-5.4

Mar 5, 2026

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for ...

Context: 1,050,000 tokens Input: $2.50/M Output: $15/M
OpenAI

OpenAI: GPT-5.3 Chat

Mar 3, 2026

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with be...

Context: 128,000 tokens Input: $1.75/M Output: $14/M
Google

Google: Gemini 3.1 Flash Lite Preview

Mar 3, 2026

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2...

Context: 1,048,576 tokens Input: $0.250/M Output: $1.50/M
Google

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Feb 26, 2026

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. ...

Context: 65,536 tokens Input: $0.500/M Output: $3.00/M
Qwen

Qwen: Qwen3.5-35B-A3B

Feb 25, 2026

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, ...

Context: 262,144 tokens Input: $0.163/M Output: $1.30/M

Tracked models

Catalog view for decision-making.

Showing 37 models across all providers.

Benchmarked models: 0
OpenAI

OpenAI: GPT-5.4 Pro

Mar 5, 2026

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922...

Context
1,050,000 tokens
Benchmark
Not listed yet
Input
$30/M
Output
$180/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.4

Mar 5, 2026

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image input...

Context
1,050,000 tokens
Benchmark
Not listed yet
Input
$2.50/M
Output
$15/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.3 Chat

Mar 3, 2026

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualizati...

Context
128,000 tokens
Benchmark
Not listed yet
Input
$1.75/M
Output
$14/M
Tool useVision
Last refreshed Mar 10, 2026
Open model page
Google

Google: Gemini 3.1 Flash Lite Preview

Mar 3, 2026

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance...

Context
1,048,576 tokens
Benchmark
Not listed yet
Input
$0.250/M
Output
$1.50/M
Tool useVisionAudioReasoning
Last refreshed Mar 10, 2026
Open model page
Google

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Feb 26, 2026

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines advanced...

Context
65,536 tokens
Benchmark
Not listed yet
Input
$0.500/M
Output
$3.00/M
VisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3.5-35B-A3B

Feb 25, 2026

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inf...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.163/M
Output
$1.30/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3.5-27B

Feb 25, 2026

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities a...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.195/M
Output
$1.56/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3.5-122B-A10B

Feb 25, 2026

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference eff...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.260/M
Output
$2.08/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3.5-Flash

Feb 25, 2026

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference effic...

Context
1,000,000 tokens
Benchmark
Not listed yet
Input
$0.100/M
Output
$0.400/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Google

Google: Gemini 3.1 Pro Preview Custom Tools

Feb 25, 2026

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party or user-defined fu...

Context
1,048,576 tokens
Benchmark
Not listed yet
Input
$2.00/M
Output
$12/M
Tool useVisionAudioReasoning
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.3-Codex

Feb 24, 2026

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilitie...

Context
400,000 tokens
Benchmark
Not listed yet
Input
$1.75/M
Output
$14/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Google

Google: Gemini 3.1 Pro Preview

Feb 19, 2026

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows...

Context
1,048,576 tokens
Benchmark
Not listed yet
Input
$2.00/M
Output
$12/M
Tool useVisionAudioReasoning
Last refreshed Mar 10, 2026
Open model page
Anthropic

Anthropic: Claude Sonnet 4.6

Feb 17, 2026

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, ...

Context
1,000,000 tokens
Benchmark
Not listed yet
Input
$3.00/M
Output
$15/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3.5 Plus 2026-02-15

Feb 16, 2026

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference e...

Context
1,000,000 tokens
Benchmark
Not listed yet
Input
$0.260/M
Output
$1.56/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3.5 397B A17B

Feb 16, 2026

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher infere...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.390/M
Output
$2.34/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3 Max Thinking

Feb 9, 2026

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and re...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.780/M
Output
$3.90/M
Tool useReasoning
Last refreshed Mar 10, 2026
Open model page
Anthropic

Anthropic: Claude Opus 4.6

Feb 4, 2026

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially eff...

Context
1,000,000 tokens
Benchmark
Not listed yet
Input
$5.00/M
Output
$25/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Qwen

Qwen: Qwen3 Coder Next

Feb 4, 2026

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per to...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.120/M
Output
$0.750/M
Tool use
Last refreshed Mar 10, 2026
Open model page
Kimi / Moonshot AI

MoonshotAI: Kimi K2.5

Jan 27, 2026

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over appr...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.450/M
Output
$2.20/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT Audio

Jan 19, 2026

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is p...

Context
128,000 tokens
Benchmark
Not listed yet
Input
$2.50/M
Output
$10/M
Audio
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT Audio Mini

Jan 19, 2026

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million token...

Context
128,000 tokens
Benchmark
Not listed yet
Input
$0.600/M
Output
$2.40/M
Audio
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.2-Codex

Jan 14, 2026

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution ...

Context
400,000 tokens
Benchmark
Not listed yet
Input
$1.75/M
Output
$14/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Google

Google: Gemini 3 Flash Preview

Dec 17, 2025

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool use performance ...

Context
1,048,576 tokens
Benchmark
Not listed yet
Input
$0.500/M
Output
$3.00/M
Tool useVisionAudioReasoning
Last refreshed Mar 10, 2026
Open model page
Mistral

Mistral: Mistral Small Creative

Dec 16, 2025

Mistral Small Creative is an experimental small model designed for creative writing, narrative generation, roleplay and character-driven dialogue, general-purpose instruction following, and conversati...

Context
32,768 tokens
Benchmark
Not listed yet
Input
$0.100/M
Output
$0.300/M
Tool use
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.2 Chat

Dec 10, 2025

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “thi...

Context
128,000 tokens
Benchmark
Not listed yet
Input
$1.75/M
Output
$14/M
Tool useVision
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.2 Pro

Dec 10, 2025

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reas...

Context
400,000 tokens
Benchmark
Not listed yet
Input
$21/M
Output
$168/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.2

Dec 10, 2025

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamicall...

Context
400,000 tokens
Benchmark
Not listed yet
Input
$1.75/M
Output
$14/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Mistral

Mistral: Devstral 2 2512

Dec 9, 2025

Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports e...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.400/M
Output
$2.00/M
Tool use
Last refreshed Mar 10, 2026
Open model page
OpenAI

OpenAI: GPT-5.1-Codex-Max

Dec 4, 2025

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained ...

Context
400,000 tokens
Benchmark
Not listed yet
Input
$1.25/M
Output
$10/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Mistral

Mistral: Ministral 3 14B 2512

Dec 2, 2025

The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language ...

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.200/M
Output
$0.200/M
Tool useVision
Last refreshed Mar 10, 2026
Open model page
Mistral

Mistral: Ministral 3 8B 2512

Dec 2, 2025

A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny language model with vision capabilities....

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.150/M
Output
$0.150/M
Tool useVision
Last refreshed Mar 10, 2026
Open model page
Mistral

Mistral: Ministral 3 3B 2512

Dec 2, 2025

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities....

Context
131,072 tokens
Benchmark
Not listed yet
Input
$0.100/M
Output
$0.100/M
Tool useVision
Last refreshed Mar 10, 2026
Open model page
Mistral

Mistral: Mistral Large 3 2512

Dec 1, 2025

Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license....

Context
262,144 tokens
Benchmark
Not listed yet
Input
$0.500/M
Output
$1.50/M
Tool useVision
Last refreshed Mar 10, 2026
Open model page
DeepSeek

DeepSeek: DeepSeek V3.2 Speciale

Dec 1, 2025

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context proce...

Context
163,840 tokens
Benchmark
Not listed yet
Input
$0.400/M
Output
$1.20/M
Reasoning
Last refreshed Mar 10, 2026
Open model page
DeepSeek

DeepSeek: DeepSeek V3.2

Dec 1, 2025

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fin...

Context
163,840 tokens
Benchmark
Not listed yet
Input
$0.250/M
Output
$0.400/M
Tool useReasoning
Last refreshed Mar 10, 2026
Open model page
Anthropic

Anthropic: Claude Opus 4.5

Nov 24, 2025

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competit...

Context
200,000 tokens
Benchmark
Not listed yet
Input
$5.00/M
Output
$25/M
Tool useVisionReasoning
Last refreshed Mar 10, 2026
Open model page
Google

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

Nov 20, 2025

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world gr...

Context
65,536 tokens
Benchmark
Not listed yet
Input
$2.00/M
Output
$12/M
Capability flags pending
Last refreshed Mar 10, 2026
Open model page

Newsletter

Stay ahead of the AI curve.

One email per week. No spam, no hype — just the most useful AI developments, tools, and tactics.