Data about a previous audio response from the model.
Base64 encoded audio bytes generated by the model, in the format specified in the request.
The Unix timestamp (in seconds) for when this audio response will no longer be accessible on the server for use in multi-turn conversations.
Unique identifier for this audio response.
Transcript of the audio generated by the model.