Interface: CustomRealtimeModelConfig

Defined in: packages/sdk/src/types/model-config-types.ts:93

Base configuration for all realtime models. Contains common fields that apply to all realtime model providers.

Extends

BaseRealtimeModelConfig

Properties

inferenceModel

inferenceModel: string

Defined in: packages/sdk/src/types/model-config-types.ts:97

inferenceModelConfig?

optional inferenceModelConfig: ChatModelConfig | null

Defined in: packages/sdk/src/types/model-config-types.ts:98

interruptionSound?

optional interruptionSound: InterruptionSound | null

Defined in: packages/sdk/src/types/model-config-types.ts:69

Sound to play when user stops speaking after interrupting the assistant. Provides immediate auditory feedback that input was received. Default: none.

Inherited from

BaseRealtimeModelConfig.interruptionSound

interruptionSoundVolume?

optional interruptionSoundVolume: number | null

Defined in: packages/sdk/src/types/model-config-types.ts:71

Volume level for the interruption sound (0-100). 0 means muted. Missing or null means full volume (100).

Inherited from

BaseRealtimeModelConfig.interruptionSoundVolume

sttConfig?

optional sttConfig: Record<string, unknown> | null

Defined in: packages/sdk/src/types/model-config-types.ts:95

sttModel?

optional sttModel: string | null

Defined in: packages/sdk/src/types/model-config-types.ts:94

thinkingSound?

optional thinkingSound: ThinkingSound | null

Defined in: packages/sdk/src/types/model-config-types.ts:61

Sound to play while the agent is processing tool calls.

Inherited from

BaseRealtimeModelConfig.thinkingSound

thinkingSoundInitialDelay?

optional thinkingSoundInitialDelay: number | null

Defined in: packages/sdk/src/types/model-config-types.ts:65

Initial delay in milliseconds before playing thinking sound. Prevents needless interruption on short tool calls. Default: 500ms.

Inherited from

BaseRealtimeModelConfig.thinkingSoundInitialDelay

thinkingSoundVolume?

optional thinkingSoundVolume: number | null

Defined in: packages/sdk/src/types/model-config-types.ts:63

Volume level for the thinking sound (0-100). 0 means muted. Missing or null means full volume (100).

Inherited from

BaseRealtimeModelConfig.thinkingSoundVolume

thinkingSpeechPhrases?

optional thinkingSpeechPhrases: string[] | null

Defined in: packages/sdk/src/types/model-config-types.ts:67

When thinkingSound is 'speech', one of these phrases is spoken while the agent runs a tool. Phrases rotate round-robin per session.

Inherited from

BaseRealtimeModelConfig.thinkingSpeechPhrases

ttsModel?

optional ttsModel: TTSModel | null

Defined in: packages/sdk/src/types/model-config-types.ts:73

TTS model ID in provider:tts:model format (e.g. 'elevenlabs:tts:eleven_flash_v2_5'), or 'native'.

Inherited from

BaseRealtimeModelConfig.ttsModel

voiceModel?

optional voiceModel: string | null

Defined in: packages/sdk/src/types/model-config-types.ts:75

Canonical voice identifier (e.g., elevenlabs:voice:catt) that selects the specific agent voice.

Inherited from

BaseRealtimeModelConfig.voiceModel

Extends​

Properties​

inferenceModel​

inferenceModelConfig?​

interruptionSound?​

Inherited from​

interruptionSoundVolume?​

Inherited from​

sttConfig?​

sttModel?​

thinkingSound?​

Inherited from​

thinkingSoundInitialDelay?​

Inherited from​

thinkingSoundVolume?​

Inherited from​

thinkingSpeechPhrases?​

Inherited from​

ttsModel?​

Inherited from​

voiceModel?​

Inherited from​

Extends

Properties

inferenceModel

inferenceModelConfig?

interruptionSound?

Inherited from

interruptionSoundVolume?

Inherited from

sttConfig?

sttModel?

thinkingSound?

Inherited from

thinkingSoundInitialDelay?

Inherited from

thinkingSoundVolume?

Inherited from

thinkingSpeechPhrases?

Inherited from

ttsModel?

Inherited from

voiceModel?

Inherited from