Skip to content

Generates audio from the input text.

POST
/audio/speech

Authorizations

Request Body required

object
model
required
Any of:
string
input
required

The text to generate audio for. The maximum length is 4096 characters.

string
<= 4096 characters
voice
required

The voice to use when generating the audio. Supported voices are alloy, echo, fable, onyx, nova, and shimmer. Previews of the voices are available in the Text to speech guide.

string
Allowed values: alloy echo fable onyx nova shimmer
response_format

The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.

string
default: mp3
Allowed values: mp3 opus aac flac wav pcm
speed

The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

number
default: 1 >= 0.25 <= 4

Responses

200

OK

string format: binary

Headers

Transfer-Encoding
string

Chunked