News

Enabling real-time video analysis allows users to interact with their environment in novel ways, from diagnosing mechanical issues to creating personalized bedtime stories.
Creating voice agents just got a whole lot easier, thanks to the OpenAI's latest speech-to-speech model, GPT-Realtime.
One of OpenAI’s new features, dubbed the Realtime API, will give developers the chance to build nearly real-time, speech-to-speech experiences in their apps, with the choice of using six voices ...
OpenAI now lets users video chat with ChatGPT in advanced voice mode, and the chatbot will respond to real-time images.
OpenAI's Realtime API is now generally available, featuring the new gpt-realtime model for more natural voice agents at a 20% lower cost for developers.