Interface: CustomRealtimeModelConfig
Defined in: packages/sdk/src/types/model-config-types.ts:93
Base configuration for all realtime models. Contains common fields that apply to all realtime model providers.
Extends
Properties
inferenceModel
inferenceModel:
string
Defined in: packages/sdk/src/types/model-config-types.ts:97
inferenceModelConfig?
optionalinferenceModelConfig:ChatModelConfig|null
Defined in: packages/sdk/src/types/model-config-types.ts:98
interruptionSound?
optionalinterruptionSound:InterruptionSound|null
Defined in: packages/sdk/src/types/model-config-types.ts:69
Sound to play when user stops speaking after interrupting the assistant. Provides immediate auditory feedback that input was received. Default: none.
Inherited from
BaseRealtimeModelConfig.interruptionSound
interruptionSoundVolume?
optionalinterruptionSoundVolume:number|null
Defined in: packages/sdk/src/types/model-config-types.ts:71
Volume level for the interruption sound (0-100). 0 means muted. Missing or null means full volume (100).
Inherited from
BaseRealtimeModelConfig.interruptionSoundVolume
sttConfig?
optionalsttConfig:Record<string,unknown> |null
Defined in: packages/sdk/src/types/model-config-types.ts:95
sttModel?
optionalsttModel:string|null
Defined in: packages/sdk/src/types/model-config-types.ts:94
thinkingSound?
optionalthinkingSound:ThinkingSound|null
Defined in: packages/sdk/src/types/model-config-types.ts:61
Sound to play while the agent is processing tool calls.
Inherited from
BaseRealtimeModelConfig.thinkingSound
thinkingSoundInitialDelay?
optionalthinkingSoundInitialDelay:number|null
Defined in: packages/sdk/src/types/model-config-types.ts:65
Initial delay in milliseconds before playing thinking sound. Prevents needless interruption on short tool calls. Default: 500ms.
Inherited from
BaseRealtimeModelConfig.thinkingSoundInitialDelay
thinkingSoundVolume?
optionalthinkingSoundVolume:number|null
Defined in: packages/sdk/src/types/model-config-types.ts:63
Volume level for the thinking sound (0-100). 0 means muted. Missing or null means full volume (100).
Inherited from
BaseRealtimeModelConfig.thinkingSoundVolume
thinkingSpeechPhrases?
optionalthinkingSpeechPhrases:string[] |null
Defined in: packages/sdk/src/types/model-config-types.ts:67
When thinkingSound is 'speech', one of these phrases is spoken while the agent runs a tool. Phrases rotate round-robin per session.
Inherited from
BaseRealtimeModelConfig.thinkingSpeechPhrases
ttsModel?
optionalttsModel:TTSModel|null
Defined in: packages/sdk/src/types/model-config-types.ts:73
TTS model ID in provider:tts:model format (e.g. 'elevenlabs:tts:eleven_flash_v2_5'), or 'native'.
Inherited from
BaseRealtimeModelConfig.ttsModel
voiceModel?
optionalvoiceModel:string|null
Defined in: packages/sdk/src/types/model-config-types.ts:75
Canonical voice identifier (e.g., elevenlabs:voice:catt) that selects the specific agent voice.