๐ HiDream Img2Img ComfyUI Workflow
Advanced image-to-image generation with HiDream model suite and Florence-2 prompt generator
๐ Overview
This workflow combines the power of HiDream diffusion models with Florence-2 captioning for enhanced image-to-image generation in ComfyUI:
- โจ Image-to-image generation with the state-of-the-art HiDream diffusion model
- ๐ฎ Optional Florence-2 intelligent prompt generation and image captioning
- ๐ผ๏ธ VAE encoding/decoding and advanced CLIP-based text encoding
- ๐ซ Customizable negative prompts for artifact reduction
- ๐ป Low VRAM mode available for systems with limited resources
๐ Try It Now!
You can test this workflow directly on Replicate:
โถ๏ธ Run on Replicate
๐ฅ Required Models & Setup
๐จ Diffusion Model
The workflow supports two HiDream model variants:
Full Model (Default)
hidream_i1_full_fp16.safetensors
๐ Place in:ComfyUI/models/diffusion_models
๐ฆ Download
Dev Model (Alternative)
hidream_i1_dev_bf16.safetensors
๐ Place in:ComfyUI/models/diffusion_models
๐ฆ Download
Credit: HiDream.ai
๐ Text Encoders
๐ Place all in: ComfyUI/models/text_encoders
clip_g_hidream.safetensors
๐ฆ Downloadclip_l_hidream.safetensors
๐ฆ Downloadllama_3.1_8b_instruct_fp8_scaled.safetensors
๐ฆ Downloadt5xxl_fp8_e4m3fn_scaled.safetensors
๐ฆ Download
๐ผ๏ธ VAE
ae.safetensors
๐ Place in:ComfyUI/models/vae
๐ฆ Download
๐ Florence-2 Prompt Generator
Florence-2-large
โก Automatic download at runtime
๐ฆ Microsoft Florence-2Credit: MiaoshouAI for the optimized implementation
๐ก Usage Guide
- Download all required models and place them in the correct directories as listed above
- Import the workflow into ComfyUI
- Load your input image, adjust settings as needed
- Choose whether to use Florence-2 automatic captioning:
- With Florence-2: Provide a brief prefix that will be combined with the AI-generated caption
- Without Florence-2: Enter your complete custom prompt directly
- Customize the negative prompt to avoid unwanted elements
- Generate new images with enhanced quality
๐ป Low VRAM Mode (< 24GB VRAM)
For systems with limited VRAM, use this alternative setup:
- Install city96/ComfyUI-GGUF custom node
- Replace the standard Diffusion Model Loader with the Unet LOADER node
- Download the optimized HiDream-I1 Full or DEV GGUF model:
- ๐ฆ HiDream-I1-Full-gguf
- ๐ฆ HiDream-I1-DEV-gguf
- ๐ Place in:
ComfyUI/models/unet
๐ Workflow Diagram

๐ Acknowledgements
- HiDream.ai for the remarkable diffusion model and encoders
- Microsoft for the Florence-2 vision-language model
- MiaoshouAI for the Florence-2 prompt generator implementation
- ComfyUI team for the intuitive workflow engine
- city96 for the GGUF optimization for low VRAM systems
โญ If you find this workflow useful, please consider starring the repository! โญ
Model tree for hofixD/comfyui-hidream-l1-full-img2img
Base model
HiDream-ai/HiDream-I1-Full