Awesome Zero-Shot Voice Cloning Papers and Source Codes

Spark-TTS: Zero-Shot, Controllable Text-to-Speech with a Single LLM—No Vocoder, No Flow Matching 10840

Overview In the rapidly evolving landscape of AI-powered speech synthesis, complexity has long been the price of quality. Traditional text-to-speech…