@@ -28,9 +28,11 @@ <h4 style="font-size: 1.3rem; line-height: 1; text-align: center;">Intrinsic Zer
2828 Encoder</ h4 >
2929 < p class ="author ">
3030 MiniMax Team < span class ="date "> May 2025</ span > < br />
31- < a style ="font-size: 1.1rem; " target ="_blank "
32- href ="https://minimax-ai.github.io/tts_tech_report/MiniMax_Speech.pdf "> [Tech
31+ < a style ="font-size: 1.1rem; " target ="_blank " href ="https://arxiv.org/abs/2505.07916 "> [Tech
3332 Report]</ a >
33+ < a style ="font-size: 1.1rem; margin-left: 1rem; " target ="_blank "
34+ href ="https://huggingface.co/datasets/MiniMaxAI/TTS-Multilingual-Test-Set "> [Multilingual Test Set]</ a >
35+ < a style ="font-size: 1.1rem; margin-left: 1rem; " target ="_blank " href ="https://github.com/MiniMax-AI "> [GitHub]</ a >
3436 </ p >
3537 </ header >
3638
@@ -57,13 +59,16 @@ <h2>Abstract</h2>
5759 control
5860 via LoRA; text to voice (T2V) by synthesizing timbre features directly from text description; and professional
5961 voice
60- cloning (PVC) by fine-tuning timbre features with additional data. Welcome to visit
61- < a href ="https://www.minimax.io/audio "> MiniMax Audio</ a > and
62- explore our powerful TTS features.
62+ cloning (PVC) by fine-tuning timbre features with additional data.
6363 </ p >
6464 </ div >
6565
6666 < nav role ="navigation " class ="toc ">
67+ < h2 > Explore MiniMax-Speech</ h2 >
68+ < p > Welcome to visit
69+ < a href ="https://www.minimax.io/audio "> MiniMax Audio</ a > and
70+ explore our powerful TTS features.
71+ </ p >
6772 < h2 > Contents</ h2 >
6873 < ol >
6974 < li >
@@ -232,9 +237,8 @@ <h2 id="zero-shot-vs-one-shot-demonstrations">Zero-Shot vs. One-Shot Demonstrati
232237 features based
233238 on the text content, whereas OneShot adheres more strictly to the speaker characteristics (prosody, speech
234239 rate,
235- emotions, etc.) demonstrated in the audio prompt (The additional input that OneShot has compared to ZeroShot,
236- see
237- technical report for details).
240+ emotions, etc.). For details of Zero-Shot and One-Shot, refer to the < a
241+ href ="https://arxiv.org/abs/2505.07916 " target ="_blank "> technical report</ a > .
238242 </ p >
239243 < div class ="scroll-wrapper " style ="margin-top: 2rem; ">
240244 < table style ="width: 100%; ">
0 commit comments