GPT-SoVITS
GPT-SoVITS is a hugely popular WebUI and pipeline for few-shot TTS and voice conversion, enabling convincing voice cloning with as little as 5 seconds to 1 minute of audio, plus dataset prep tools (separation, ASR, labeling) and multi-lingual support (EN/JA/KO/ZH/Cantonese). If you’re experimenting with custom voices, VTuber-style content, or rapid TTS prototyping on consumer GPUs, this is effectively the community standard toolkit. ([github.com](https://github.com/RVC-Boss/GPT-SoVITS?utm_source=openai))