Overview

Transcription is one of the key elements of a conversation; Vocode allows developers to configure their transcription to suit their use case.

TranscriberConfig Class

The base TranscriberConfig class defines several key fields that all transcriber implementations use.
sampling_rate
int
required
The sampling rate of the audio to be transcribed.
audio_encoding
AudioEncoding
required
The encoding format of the audio to be transcribed.
chunk_size
int
required
The size of the audio chunks to be transcribed.
endpointing_config
Optional[EndpointingConfig]
An optional configuration for which endpointing implementation to use.

Transcriber Implementations

DeepgramTranscriberConfig

model
Optional[str]
Specifies which Deepgram model to use for transcription.
should_warmup_model
bool
Whether to warm up the model before starting transcription.
version
Optional[str]
The version of the Deepgram transcription API to use. If not specified, the latest version will be used.

GoogleTranscriberConfig

model
Optional[str]
Specifies which Google model to use for transcription.
should_warmup_model
bool
Whether to warm up the model before starting transcription.

AssemblyAITranscriberConfig

should_warmup_model
bool
Whether to warm up the model before starting transcription.