On January 22, Jinshi Data News, the first end-to-end voice model GLM-4-Voice of Zhipu AI was officially launched on the open platform. It can directly understand and generate Chinese and English voices, realize real-time voice conversations, and adjust the emotions, tones, speeds, and dialects of voices flexibly according to user instructions, making voice interactions more natural and vivid.
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
The first end-to-end voice API of Zhipu BigModel is launched
On January 22, Jinshi Data News, the first end-to-end voice model GLM-4-Voice of Zhipu AI was officially launched on the open platform. It can directly understand and generate Chinese and English voices, realize real-time voice conversations, and adjust the emotions, tones, speeds, and dialects of voices flexibly according to user instructions, making voice interactions more natural and vivid.