Original Reddit post

A few days ago, Qwen released a new speech-to-speech model: Qwen3-TTS-12Hz-0.6B-Base. I built a simple web app so you can test it instantly: No registration required Free to use Up to 500 characters per conversion Upload a voice sample + enter text, and it generates cloned speech Honestly, the quality is surprisingly good for a 0.6B model. Model: https://github.com/QwenLM/Qwen3-TTS Web app where you can text the model for free: https://imiteo.com/ Supports 10 major languages: English, Chinese, Japanese, Korean, German, French, Russian, Portuguese, Spanish, and Italian. It runs on an NVIDIA L4 GPU, and the app also shows conversion time + useful generation stats. submitted by /u/OneMoreSuperUser

Originally posted by u/OneMoreSuperUser on r/ArtificialInteligence