๐ŸŒŸ HiDream Img2Img ComfyUI Workflow

License: MIT Hugging Face Replicate

Advanced image-to-image generation with HiDream model suite and Florence-2 prompt generator

๐Ÿ“‹ Overview

This workflow combines the power of HiDream diffusion models with Florence-2 captioning for enhanced image-to-image generation in ComfyUI:

  • โœจ Image-to-image generation with the state-of-the-art HiDream diffusion model
  • ๐Ÿ”ฎ Optional Florence-2 intelligent prompt generation and image captioning
  • ๐Ÿ–ผ๏ธ VAE encoding/decoding and advanced CLIP-based text encoding
  • ๐Ÿšซ Customizable negative prompts for artifact reduction
  • ๐Ÿ’ป Low VRAM mode available for systems with limited resources

๐Ÿš€ Try It Now!

You can test this workflow directly on Replicate:
โ–ถ๏ธ Run on Replicate

๐Ÿ“ฅ Required Models & Setup

๐ŸŽจ Diffusion Model

The workflow supports two HiDream model variants:

Full Model (Default)

  • hidream_i1_full_fp16.safetensors
    ๐Ÿ“ Place in: ComfyUI/models/diffusion_models
    ๐Ÿ“ฆ Download

Dev Model (Alternative)

  • hidream_i1_dev_bf16.safetensors
    ๐Ÿ“ Place in: ComfyUI/models/diffusion_models
    ๐Ÿ“ฆ Download

Credit: HiDream.ai

๐Ÿ“ Text Encoders

๐Ÿ“ Place all in: ComfyUI/models/text_encoders

  • clip_g_hidream.safetensors
    ๐Ÿ“ฆ Download

  • clip_l_hidream.safetensors
    ๐Ÿ“ฆ Download

  • llama_3.1_8b_instruct_fp8_scaled.safetensors
    ๐Ÿ“ฆ Download

  • t5xxl_fp8_e4m3fn_scaled.safetensors
    ๐Ÿ“ฆ Download

๐Ÿ–ผ๏ธ VAE

  • ae.safetensors
    ๐Ÿ“ Place in: ComfyUI/models/vae
    ๐Ÿ“ฆ Download

๐Ÿ” Florence-2 Prompt Generator

๐Ÿ’ก Usage Guide

  1. Download all required models and place them in the correct directories as listed above
  2. Import the workflow into ComfyUI
  3. Load your input image, adjust settings as needed
  4. Choose whether to use Florence-2 automatic captioning:
    • With Florence-2: Provide a brief prefix that will be combined with the AI-generated caption
    • Without Florence-2: Enter your complete custom prompt directly
  5. Customize the negative prompt to avoid unwanted elements
  6. Generate new images with enhanced quality

๐Ÿ’ป Low VRAM Mode (< 24GB VRAM)

Memory Efficient

For systems with limited VRAM, use this alternative setup:

  1. Install city96/ComfyUI-GGUF custom node
  2. Replace the standard Diffusion Model Loader with the Unet LOADER node
  3. Download the optimized HiDream-I1 Full or DEV GGUF model:

๐Ÿ“Š Workflow Diagram

HiDream Workflow Diagram

๐Ÿ™ Acknowledgements

  • HiDream.ai for the remarkable diffusion model and encoders
  • Microsoft for the Florence-2 vision-language model
  • MiaoshouAI for the Florence-2 prompt generator implementation
  • ComfyUI team for the intuitive workflow engine
  • city96 for the GGUF optimization for low VRAM systems

โญ If you find this workflow useful, please consider starring the repository! โญ

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for hofixD/comfyui-hidream-l1-full-img2img

Finetuned
(7)
this model