OpenAI Releases GPT-realtime, a Dedicated Model for Voice AI Agents

AI Daily News updated 5d ago dongdong
13 0

OpenAI has released the speech model GPT-realtime, a multimodal model designed specifically for voice AI agents. It can generate natural and fluent speech, accurately mimicking human intonation, emotions, and pace, while also supporting image understanding combined with voice or text interactions. The update introduces two new voices—Marin and Cedar—and enhances the existing eight voices. Equipped with advanced intelligence, reasoning, and comprehension abilities, GPT-realtime can capture non-verbal cues, switch languages, and adjust tone seamlessly.

© Copyright Notice

Related Posts

No comments yet...

none
No comments yet...