OpenAI Released August 14, 2025 Synced Apr 19, 2026

OpenAI: GPT-4o Audio

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

Tool useAudio

Why it stands out

128K-token context window handles longer documents and multi-turn conversations without truncation.

Tool use support makes it viable for function-calling and agentic pipelines.

Multimodal input (audio) extends it beyond text-only workflows.

What to watch

No benchmark score currently tracked — evaluate using task-specific testing alongside pricing and capability data.

Release timeline

Tracked events for OpenAI: GPT-4o Audio.

Back to model tracker

release

OpenAI: GPT-4o Audio entered the tracked catalog

August 14, 2025

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

Nearby alternatives

Other OpenAI models worth checking.

Need a recommendation instead?

OpenAI: GPT-5.4 Nano

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. I

Context 400,000

OpenAI: GPT-5.4 Mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports

Context 400,000

OpenAI: GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, hi

Context 1,050,000

OpenAI: GPT-5.4

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (92

Context 1,050,000

Recent changes

LaunchAug 14

OpenAI launched OpenAI: GPT-4o Audio

Compare

See how OpenAI: GPT-4o Audio stacks up.

All comparisons

vs Anthropic: Claude 3.5 Sonnet

Side-by-side pricing, context, and capabilities

vs Anthropic: Claude 3.7 Sonnet (thinking)

Side-by-side pricing, context, and capabilities

vs Anthropic: Claude 3.7 Sonnet

Side-by-side pricing, context, and capabilities

vs Anthropic: Claude Opus 4.1

Side-by-side pricing, context, and capabilities

vs Anthropic: Claude Opus 4.5

Side-by-side pricing, context, and capabilities

vs Anthropic: Claude Opus 4.6

Side-by-side pricing, context, and capabilities