TTS Model Test

12 / 400 characters

Server processing

Silence

TTS Latency Test Methodology

TTS providers return audio in different ways. Streaming chunk sizes may differ, and the amount of silence prefixing meaningful audio may differ. This tool facilitates a fair comparison by measuring the time to first byte, then running voice activity detection on the generated audio to calculate silence length. The result (time to first real audio) is what users will experience.

Generating speech...

TTS Latency Test Methodology

ElevenLabs

Inworld

Inworld

Gemini

Inworld

Hume AI

Cartesia