Skip to content

PaperCodex

Subscribe

Voice Conversion

HierSpeech++: Human-Level Zero-Shot Speech Synthesis with Fast Inference and High Fidelity

HierSpeech++: Human-Level Zero-Shot Speech Synthesis with Fast Inference and High Fidelity 1232

In the rapidly evolving field of speech synthesis, achieving natural-sounding, speaker-consistent voice generation without speaker-specific training data has long been…

12/17/2025Speech Super-Resolution, Voice Conversion, Zero-shot Text-to-Speech
Amphion: A Unified Open-Source Toolkit for Zero-Shot Speech, Singing, and Audio Generation

Amphion: A Unified Open-Source Toolkit for Zero-Shot Speech, Singing, and Audio Generation 9539

Amphion is an open-source toolkit purpose-built for audio, music, and speech generation that dramatically lowers the entry barrier for junior…

12/15/202512/15/2025Singing Voice Conversion, Text-to-Speech, Voice Conversion
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex