The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.
Modalities
In / Out Price
$2.50 / $10per 1M
Context
128K
Weekly Rank
#346on OpenRouter
Knowledge Cutoff
Oct 31, 2023