Skip to content
AI Viewer
OpenAI Released August 14, 2025 Synced Apr 19, 2026

OpenAI: GPT-4o Audio

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

Tool useAudio

Why it stands out

128K-token context window handles longer documents and multi-turn conversations without truncation.
Tool use support makes it viable for function-calling and agentic pipelines.
Multimodal input (audio) extends it beyond text-only workflows.

What to watch

No benchmark score currently tracked — evaluate using task-specific testing alongside pricing and capability data.

Release timeline

Tracked events for OpenAI: GPT-4o Audio.

Back to model tracker

release

OpenAI: GPT-4o Audio entered the tracked catalog

August 14, 2025

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

View source

Nearby alternatives

Other OpenAI models worth checking.

Need a recommendation instead?

Recent changes

LaunchAug 14

OpenAI launched OpenAI: GPT-4o Audio

Compare

See how OpenAI: GPT-4o Audio stacks up.

All comparisons