Skip to content

PaperCodex

Subscribe

Zero-shot Text-to-Speech

ZipVoice-Dialog: Generate Realistic Spoken Dialogues Instantly—No Fine-Tuning, No Templates

ZipVoice-Dialog: Generate Realistic Spoken Dialogues Instantly—No Fine-Tuning, No Templates 662

Creating natural-sounding spoken dialogues between two people has long been a pain point in AI-driven voice applications. Traditional approaches either…

01/05/2026Non-autoregressive TTS, Spoken Dialogue Generation, Zero-shot Text-to-Speech
HierSpeech++: Human-Level Zero-Shot Speech Synthesis with Fast Inference and High Fidelity

HierSpeech++: Human-Level Zero-Shot Speech Synthesis with Fast Inference and High Fidelity 1232

In the rapidly evolving field of speech synthesis, achieving natural-sounding, speaker-consistent voice generation without speaker-specific training data has long been…

12/17/2025Speech Super-Resolution, Voice Conversion, Zero-shot Text-to-Speech
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex