IBM and Deepgram partner to advance voice capabilities for enterprise AI for highly performant, enterprise-grade transcription and real-time captioning.

“Voice is rapidly becoming the default interface between humans and technology, and enterprise deployments require a real-time platform that is accurate, low latency, and reliable at scale,” said Scott Stephenson, CEO and co-founder, Deepgram. “By embedding Deepgram inside watsonx Orchestrate Agent Builder, IBM clients can build voice agents and voice-enabled workflows on top of a real-time foundation that has been developed and refined over more than a decade.”
Voice capabilities for enterprise AI
The collaboration entails integrating Deepgram’s speech-to-text (STT) and text-to-speech (TTS) capabilities into IBM’s watsonx Orchestrate generative AI solution, making Deepgram IBM’s first voice partner. The new voice capabilities aim to help enterprises automate their operations and enable users to interact with digital agents using natural speech.

“Our watsonx Orchestrate integration powered by Deepgram APIs introduces new speech recognition and transcription capabilities to IBM clients, refining and modernising their operations,” said Nick Holda, vice president of AI Technology Partnerships at IBM. “This collaboration aims to help enterprise organisations accelerate their AI initiatives and reinforces IBM’s open ecosystem, bringing choice and cutting-edge voice technology to partners and customers.”
