刚刚,OpenAI 放出了三个全新的实时语音模型,其中一个翻译模型,能把 70 多种语言实时翻译成 13 种语言输出,每分钟成本 2 毛钱。 GPT-Realtime-2,是 OpenAI 目前最强的语音模型,具备 GPT-5 ...
OpenAI has introduced the public beta of its Realtime API, offering developers a tool to integrate natural, low-latency, multimodal interactions into their applications. Now available to all paid ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
最强实时语音模型支持笑声捕捉、无缝切换语言。 智东西8月29日消息,今天凌晨,OpenAI发布为开发人员打造的语音转语音模型GPT-RealTime,并同步更新了包括远程MCP服务器支持、图像输入和SIP(通过会话发起协议)电话呼叫支持的API功能。 OpenAI称这是其迄今为止 ...
Agora's Conversational AI Engine offers key enhancements to the Realtime API for more natural communication and interaction. SANTA CLARA, Calif., Sept. 4, 2025 /PRNewswire/ -- Agora (NASDAQ: API), the ...
AI thrives on data but feeding it the right data is harder than it seems. As enterprises scale their AI initiatives, they face the challenge of managing diverse data pipelines, ensuring proximity to ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Perplexity Labs has recently introduced a new, fast, and efficient API for open-source Large Language Models (LLMs) known as pplx-api. This innovative tool is designed to provide quick access to ...
GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...