模型规格
模态
模型能力
性能
模型价格
模型优势
精品音色库
内置多款高质量精品音色,覆盖多种性别、年龄与风格,开箱即用,无需额外配置即可满足有声剧、播客、配音等场景的多样音色需求。
风格指令遵循
从一句话风格描述到导演式分层剧本均可精准落地,支持对语速、情绪、语气、气声等进行精细化控制,复杂角色气质稳定还原。
内联情绪标签控制
支持在文本中任意位置嵌入内联音频标签,对逐字情绪进行精确控制,多段情绪过渡自然可靠,适合高强度情绪表达与有声剧制作场景。
零提示词文本理解
无需任何风格提示,模型直接从文本内容中捕捉情绪弧线与人物气质,多角色对话中自动推断不同人物声音并自然切换,字里行间的角色感自然呈现。
真实任务中的表现
风格指令:沧桑老前辈
Instruct
声音低沉沙哑一点,像个历经沧桑的老前辈在讲述传奇人物。语气里带点由衷的敬佩,娓娓道来。
Text
街口那个老周啊,媳妇走得早,一个人拉扯俩娃,白天蹬三轮,晚上还去夜市摆摊修鞋。现在俩孩子都有出息喽,想接他去城里享福——他不去,就守着那间小铺子。哎,人哪,骨头硬,心里头就踏实。
结构化剧本:神祇灭世独白
Text
你们求我垂怜,求我降下甘霖洗净这浊世。可这世间的沉疴,唯有烈火能剔骨刮毒。闭上眼吧。这业火烧起来的时候,一点也不疼。
内联标签:葬礼悼词情绪弧
Text
[crying] She's gone... she's really gone...[pause] but you know what's funny? [sniffles] She always said she'd outlive us all. [crying] God, I miss her so much.
文本理解:多角色对话
Text
The five-year-old squealed, "Look, Grandpa! A PUPPY!" The old man squinted and grumbled, "That ain't a puppy, that's a raccoon." The teenager rolled her eyes: "It's OBVIOUSLY a cat, you're both blind." The police officer stepped forward: "Ma'am, sir, I'm going to need everyone to step back slowly." The little boy whimpered, "Is it gonna bite me?"
选择适合你的接入方式
按量计费 API 接入
示例代码
通过 messages 传递风格指令和文本内容,指定音频格式与音色即可调用。
import os
from openai import OpenAI
import base64
client = OpenAI(
api_key=os.environ.get("MIMO_API_KEY"),
base_url="https://api.xiaomimimo.com/v1"
)
completion = client.chat.completions.create(
model="mimo-v2.5-tts",
messages=[
{
"role": "user",
"content": "Bright, bouncy, slightly sing-song tone — like you're bursting with good news you can barely hold in. Fast pace, rising pitch at the end."
},
{
"role": "assistant",
"content": "Hey boss — guess what, guess what? I just got the results back and I actually passed! Not just passed, I got a distinction! I know, I know — you told me I was cutting it close, but hey, here we are. Drinks are on me tonight, okay?"
}
],
audio={
"format": "wav",
"voice": "Chloe"
}
)
message = completion.choices[0].message
audio_bytes = base64.b64decode(message.audio.data)
with open("audio_file.wav", "wb") as f:
f.write(audio_bytes)在 MiMo Claw 中使用
MiMo Claw 搭载 mimo-v2.5-pro 最新旗舰模型、mimo-v2.5 多模态理解与 mimo-v2.5-tts 语音大模型,可限时免费体验。