Text-to-Image

Policy Optimized Text-to-Image Pipeline Design

Make It Count: Text-to-Image Generation with an Accurate Number of Objects

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Training-free method to add objects to real or generated images from a text prompt, leveraging diffusion model attention with weighted fusion and latent blending.