Free Reads
Sign in to view your remaining parses.
Tag Filter
Multimodal Image Generation
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Published:8/13/2023
Text-to-Image Diffusion ModelsImage Prompt GenerationDecoupled Cross-Attention MechanismLightweight Adapter DesignMultimodal Image Generation
The paper introduces IPAdapter, a lightweight adapter that enhances pretrained texttoimage diffusion models' image prompt capabilities through a decoupled crossattention mechanism, achieving performance comparable to fully finetuned models with only 22M parameters.
02
Canvas-to-Image: Compositional Image Generation with Multimodal Controls
Published:11/27/2025
Multimodal Image GenerationJoint Training of Diffusion ModelsText-to-Image Generation FrameworkCanvas Image GenerationMulti-Task Datasets
The paper presents CanvastoImage, a unified framework for highfidelity compositional image generation with multimodal controls, encoding diverse signals into a single composite canvas image. It introduces a MultiTask Canvas Training strategy, enhancing the model's understandi
07