We need more of these open-source TTS and STT models. Thank you StepFun!
🚀 Step-Audio-EditX is now open source!!
✨ Zero-Shot TTS with high timbre similarity
✨ Iterative editing of dozens of audio emotion and speaking style
✨ Fine-grained control over paralinguistic features
Whether for audio editing, interactive design, or personalized scenarios, it unlocks unprecedented audio expression for you!
🌟GitHub: github.com/stepfun-ai/Step-A…
📑 arXiv:arxiv.org/abs/2511.03601
🔥 Demo Page:stepaudiollm.github.io/step-…
🎮 HF playground: huggingface.co/spaces/stepfu…
Nov 7, 2025 · 8:48 PM UTC
