Realtime speech-to-text transcription service. This WebSocket API enables streaming audio input and receiving transcription results.
## Event Flow
- Audio chunks are sent as `input_audio_chunk` messages
- Transcription results are streamed back in various formats (partial, committed, with timestamps)
- Supports manual commit or VAD-based automatic commit strategies
Authentication is done either by providing a valid API key in the `xi-api-key` header or by providing a valid token in the `token` query parameter. Tokens can be generated from the [single use token endpoint](/docs/api-reference/tokens/create). Use tokens if you want to transcribe audio from the client side.
inputAudioChunkobjectRequired
Audio data chunk sent from client to server for transcription.
sessionStartedobjectRequired
Sent when the transcription session is successfully started.
partialTranscriptobjectRequired
Interim transcription result that may change.
committedTranscriptobjectRequired
Committed transcription result that will not change.
committedTranscriptWithTimestampsobjectRequired
Committed transcription result with word-level timestamps.
scribeErrorobjectRequired
Error event during transcription.
scribeAuthErrorobjectRequired
Authentication error during transcription session.
scribeQuotaExceededErrorobjectRequired
Quota exceeded error during transcription session.
scribeThrottledErrorobjectRequired
Throttled error during transcription session.
scribeUnacceptedTermsErrorobjectRequired
Unaccepted terms error during transcription session.
scribeRateLimitedErrorobjectRequired
Rate limited error during transcription session.
scribeQueueOverflowErrorobjectRequired
Queue overflow error during transcription session.
scribeResourceExhaustedErrorobjectRequired
Resource exhausted error during transcription session.
scribeSessionTimeLimitExceededErrorobjectRequired
Session time limit exceeded error during transcription session.
scribeInputErrorobjectRequired
Input error during transcription session.
scribeChunkSizeExceededErrorobjectRequired
Chunk size exceeded error during transcription session.
scribeInsufficientAudioActivityErrorobjectRequired
Insufficient audio activity error during transcription session.
scribeTranscriberErrorobjectRequired
Transcriber error during transcription session.