MiMo-V2.5-TTS

精品音色语音合成,风格指令精准遵循。

内联情绪标签逐字控制,零提示词捕捉角色气质。

模型规格

模态

输入文本
输出音频

模型能力

语音合成
流式输出

性能

上下文长度8K tokens
最大输出8K tokens
RPM100
TPM10M

模型价格

价格
限时免费

模型优势

精品音色库

内置多款高质量精品音色,覆盖多种性别、年龄与风格,开箱即用,无需额外配置即可满足有声剧、播客、配音等场景的多样音色需求。

风格指令遵循

从一句话风格描述到导演式分层剧本均可精准落地,支持对语速、情绪、语气、气声等进行精细化控制,复杂角色气质稳定还原。

内联情绪标签控制

支持在文本中任意位置嵌入内联音频标签,对逐字情绪进行精确控制,多段情绪过渡自然可靠,适合高强度情绪表达与有声剧制作场景。

零提示词文本理解

无需任何风格提示,模型直接从文本内容中捕捉情绪弧线与人物气质,多角色对话中自动推断不同人物声音并自然切换,字里行间的角色感自然呈现。

真实任务中的表现

风格指令:沧桑老前辈

Instruct

声音低沉沙哑一点,像个历经沧桑的老前辈在讲述传奇人物。语气里带点由衷的敬佩,娓娓道来。

Text

街口那个老周啊,媳妇走得早,一个人拉扯俩娃,白天蹬三轮,晚上还去夜市摆摊修鞋。现在俩孩子都有出息喽,想接他去城里享福——他不去,就守着那间小铺子。哎,人哪,骨头硬,心里头就踏实。

Audio
0:00

结构化剧本:神祇灭世独白

Text

你们求我垂怜,求我降下甘霖洗净这浊世。可这世间的沉疴,唯有烈火能剔骨刮毒。闭上眼吧。这业火烧起来的时候,一点也不疼。

Audio
0:00

内联标签:葬礼悼词情绪弧

Text

[crying] She's gone... she's really gone...[pause] but you know what's funny? [sniffles] She always said she'd outlive us all. [crying] God, I miss her so much.

Audio
0:00

文本理解:多角色对话

Text

The five-year-old squealed, "Look, Grandpa! A PUPPY!" The old man squinted and grumbled, "That ain't a puppy, that's a raccoon." The teenager rolled her eyes: "It's OBVIOUSLY a cat, you're both blind." The police officer stepped forward: "Ma'am, sir, I'm going to need everyone to step back slowly." The little boy whimpered, "Is it gonna bite me?"

Audio
0:00

选择适合你的接入方式

按量计费 API 接入

1

获取 API Key

在控制台创建账号,生成专属 API Key(TTS 系列当前限时免费)。

2

示例代码

通过 messages 传递风格指令和文本内容,指定音频格式与音色即可调用。

import os
from openai import OpenAI
import base64

client = OpenAI(
    api_key=os.environ.get("MIMO_API_KEY"),
    base_url="https://api.xiaomimimo.com/v1"
)

completion = client.chat.completions.create(
    model="mimo-v2.5-tts",
    messages=[
        {
            "role": "user",
            "content": "Bright, bouncy, slightly sing-song tone — like you're bursting with good news you can barely hold in. Fast pace, rising pitch at the end."
        },
        {
            "role": "assistant",
            "content": "Hey boss — guess what, guess what? I just got the results back and I actually passed! Not just passed, I got a distinction! I know, I know — you told me I was cutting it close, but hey, here we are. Drinks are on me tonight, okay?"
        }
    ],
    audio={
        "format": "wav",
        "voice": "Chloe"
    }
)

message = completion.choices[0].message
audio_bytes = base64.b64decode(message.audio.data)
with open("audio_file.wav", "wb") as f:
    f.write(audio_bytes)

Token Plan 订阅

1

购买套餐

包月 / 包年订阅,覆盖 MiMo V2.5 全系模型,高用量场景性价比显著优于按量计费(TTS 系列当前限时免费,不消耗 credits)。

在 MiMo Claw 中使用

MiMo Claw 搭载 mimo-v2.5-pro 最新旗舰模型、mimo-v2.5 多模态理解与 mimo-v2.5-tts 语音大模型,可限时免费体验。

Copyright©2026 Xiaomi. All Rights Reserved | Cookie Policy | Cookie Preferences

We use cookies and similar technologies of our own to ensure the proper functioning of the website, customize content according to user preferences and analyze users' interactions on the website, as well as their browsing habits. You can find more information in our Cookie Policy. Select an option or go to Cookie Settings to manage your preferences. Learn More.