Skip to content

PaperCodex

Subscribe

Text-to-Image Generation

StoryDiffusion: Generate Consistent Long-Form Visual Stories from Text Without Retraining Models

StoryDiffusion: Generate Consistent Long-Form Visual Stories from Text Without Retraining Models 6351

Creating visually coherent sequences of images or videos from text prompts has long been a bottleneck in AI-powered storytelling. While…

12/17/2025Text-to-Image Generation, Video Generation, Visual Storytelling
MMaDA: One Unified Model for Text Reasoning, Multimodal Understanding, and Image Generation

MMaDA: One Unified Model for Text Reasoning, Multimodal Understanding, and Image Generation 1518

Imagine running a single model that can answer complex reasoning questions, understand images and text together, and generate high-quality images…

12/17/2025Diffusion Language Models, Multimodal Reasoning, Text-to-Image Generation
InstantCharacter: Generate Consistent, High-Fidelity Character Images from a Single Photo—No Fine-Tuning Required

InstantCharacter: Generate Consistent, High-Fidelity Character Images from a Single Photo—No Fine-Tuning Required 1044

Creating personalized, visually consistent characters is a common need across gaming, animation, virtual avatars, and digital storytelling—but until recently, doing…

12/11/202512/15/2025Character Personalization, Diffusion Transformer Adaptation, Text-to-Image Generation

Posts pagination

Previous 1 2 3
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex