Overview

Transcription is one of the key elements of a conversation; Vocode allows developers to configure their transcription to suit their use case.

TranscriberConfig Class

The base TranscriberConfig class defines several key fields that all transcriber implementations use.

sampling_rate
int
required

The sampling rate of the audio to be transcribed.

audio_encoding
AudioEncoding
required

The encoding format of the audio to be transcribed.

chunk_size
int
required

The size of the audio chunks to be transcribed.

endpointing_config
Optional[EndpointingConfig]

An optional configuration for which endpointing implementation to use.

Transcriber Implementations

DeepgramTranscriberConfig

model
Optional[str]

Specifies which Deepgram model to use for transcription.

should_warmup_model
bool

Whether to warm up the model before starting transcription.

version
Optional[str]

The version of the Deepgram transcription API to use. If not specified, the latest version will be used.

GoogleTranscriberConfig

model
Optional[str]

Specifies which Google model to use for transcription.

should_warmup_model
bool

Whether to warm up the model before starting transcription.

AssemblyAITranscriberConfig

should_warmup_model
bool

Whether to warm up the model before starting transcription.