The text-to-speech and speech-to-text tools are all based on GPT-4o. OpenAI hinted it may take a similar path with video.
OpenAI offers new text-to-speech and speech-to-text models in the API. These are designed to outperform Whisper.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results