Agora and OpenAI have formed an extended partnership, leveraging the growing conversational AI market. This collaboration brings a host of advanced features to Agora’s Conversational AI engine. The key highlight is the integration of OpenAI’s Real-time API, empowering developers and businesses to create AI agents that mimic human interaction across voice, text, and other modalities in real-time.

“Real-time multimodal interaction is the missing piece for AI agents to feel truly human,” said Tony Zhao, CEO of Agora.“ By integrating OpenAI’s Real-time API into our Conversational AI Engine, we’re giving developers the tools to build experiences that are faster, smarter, and more natural than ever before.”
Conversational AI Engine
Agora’s Conversational AI Engine now features a suite of user-friendly enhancements, meticulously designed to facilitate natural interaction with AI agents. These features include automated greetings for instant session awareness, a natural onboarding experience, and mixed-modality interaction for seamless switching between voice and text input within a single interactive session.
It also offers a flexible turn-detection option, empowering developers to steer the conversational flow and turn-taking behaviour, as well as provide uninterrupted input. Additionally, it features Agora’s proprietary Selective Attention Locking technology, which can filter out noise and interfering voices.
The integration of OpenAI’s real-time language model with Agora’s global real-time network infrastructure (SDRTN) and purpose-built developer toolkit aims to enable the acceleration of time to market, simplify application development, and deliver enhanced conversational AI experiences.