Skip to content

PaperCodex

Subscribe

Image Editing

Qwen-Image: Generate and Edit Images with Perfect Text—Even in Chinese

Qwen-Image: Generate and Edit Images with Perfect Text—Even in Chinese 6339

If you’ve ever struggled to generate marketing visuals with legible multilingual text—or tried to edit a product image only to…

12/26/2025Image Editing, Multimodal Text Rendering, Text-to-Image Generation
DragDiffusion: Precise, Interactive Image Editing for Real and AI-Generated Photos Using Diffusion Models

DragDiffusion: Precise, Interactive Image Editing for Real and AI-Generated Photos Using Diffusion Models 1234

DragDiffusion is an open-source framework that brings pixel-precise, point-based image manipulation to both real-world photographs and AI-generated images—without requiring users…

12/19/2025Diffusion Models, Image Editing, Interactive Manipulation
OmniGen: One Unified Model for All Image Generation Tasks—No Plugins, No Preprocessing, Just Prompts

OmniGen: One Unified Model for All Image Generation Tasks—No Plugins, No Preprocessing, Just Prompts 4282

Modern image generation is powerful—but fragmented. Depending on your goal—generating from text, editing existing images, preserving a person’s identity, or…

12/19/2025Image Editing, Subject-driven Generation, Text-to-Image Generation
Lumina-mGPT 2.0: A Standalone Autoregressive Image Generator That Unifies Multimodal Tasks Without Diffusion Dependencies

Lumina-mGPT 2.0: A Standalone Autoregressive Image Generator That Unifies Multimodal Tasks Without Diffusion Dependencies 1076

In the ever-evolving landscape of generative AI, image synthesis has long been dominated by diffusion models—powerful, yet often complex, resource-intensive,…

12/19/2025Controllable Image Synthesis, Image Editing, Text-to-Image Generation
Step1X-Edit: Open-Source Image Editing That Matches GPT-4o and Gemini2 Flash

Step1X-Edit: Open-Source Image Editing That Matches GPT-4o and Gemini2 Flash 1954

Overview Step1X-Edit is a state-of-the-art open-source framework for general-purpose image editing that delivers performance comparable to leading proprietary models like…

12/11/2025Image Editing, Instruction-following Image Generation, Multimodal Reasoning
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex