The open source voice gateway for conversational AI providers

Release 0.9.0

Info

Release Date: April 20, 2024

New Features

Add support for google v2 STT api
Add support for additional TTS vendors: PlayHT, RimeLabs, and Deepgram
Add support for streaming TTS (reduces latency) for Deepgram, ElevenLabs, Microsoft, PlayHT, RimeLabs, and Whisper
Add support for bidirectional audio in listen verb
Add new verb: dub to insert additional audio tracks into the conversation; see here for example usage.
Add boostAudioSignal to config verb allowing the volume of a conversation to be increased or lowered.
Add support for "filler" audio to the gather verb allowing brief audio to be played to a caller while the user application is processing a user utterance or dtmf collection, this can be useful in scenarios where an AI bot is expected to take a lengthy time to process a request
Add coach mode to conferencing, allowing for instance a manager to "whisper" to an agent on a 3-way conference with a customer
Add support for sending outbound OPTIONS pings to configured SIP trunks
If Deepgram endpointing is enabled, default utterance_end_ms to 1000 if none specified by the application (per Deepgram recommendation)
various improvements and enhancements to node-client-ws

Bug fixes

various fixes for Deepgram STT
714 bargein "sticky" only works twice
710 fix for actionHookDelay action
671 handling of siprec invite failure
666 transcribe on dial verb does not transcribe B leg by default
fix for precaching of TTS
check if sip gateway is in blacklist before sending outbound call

SQL changes

ALTER TABLE sip_gateways ADD COLUMN send_options_ping BOOLEAN NOT NULL DEFAULT 0
ALTER TABLE applications MODIFY COLUMN speech_synthesis_voice VARCHAR(256)
ALTER TABLE applications MODIFY COLUMN fallback_speech_synthesis_voice VARCHAR(256)

Availability

Available now on jambonz.cloud
devops scripts (packer, cloudformation, helm) available now for subscription customers

Questions? Contact us at support@jambonz.org