Release 0.9.0
Info
  • Release Date: April 20, 2024
New Features
  • Add support for google v2 STT api
  • Add support for additional TTS vendors: PlayHT, RimeLabs, and Deepgram
  • Add support for streaming TTS (reduces latency) for Deepgram, ElevenLabs, Microsoft, PlayHT, RimeLabs, and Whisper
  • Add support for bidirectional audio in listen verb
  • Add new verb: dub to insert additional audio tracks into the conversation; see here for example usage.
  • Add boostAudioSignal to config verb allowing the volume of a conversation to be increased or lowered.
  • Add support for "filler" audio to the gather verb allowing brief audio to be played to a caller while the user application is processing a user utterance or dtmf collection, this can be useful in scenarios where an AI bot is expected to take a lengthy time to process a request
  • Add coach mode to conferencing, allowing for instance a manager to "whisper" to an agent on a 3-way conference with a customer
  • Add support for sending outbound OPTIONS pings to configured SIP trunks
  • If Deepgram endpointing is enabled, default utterance_end_ms to 1000 if none specified by the application (per Deepgram recommendation)
  • various improvements and enhancements to node-client-ws
Bug fixes
  • various fixes for Deepgram STT
  • 714 bargein "sticky" only works twice
  • 710 fix for actionHookDelay action
  • 671 handling of siprec invite failure
  • 666 transcribe on dial verb does not transcribe B leg by default
  • fix for precaching of TTS
  • check if sip gateway is in blacklist before sending outbound call
SQL changes
ALTER TABLE sip_gateways ADD COLUMN send_options_ping BOOLEAN NOT NULL DEFAULT 0
ALTER TABLE applications MODIFY COLUMN speech_synthesis_voice VARCHAR(256)
ALTER TABLE applications MODIFY COLUMN fallback_speech_synthesis_voice VARCHAR(256)
Availability
  • Available now on jambonz.cloud
  • devops scripts (packer, cloudformation, helm) available now for subscription customers

Questions? Contact us at support@jambonz.org