← See All Custom Node Packs

ComfyUI-InstantID

1444

Run ComfyUI Easily with InstaSD

Skip the complex setup. InstaSD helps creative professionals build workflows and deploy them to the world:

  • One-click deployment
  • Any model, any node
  • Powerful GPUs for rapid iteration
Get Started

IDGenerationNode

IDGenerationNode Documentation

Overview

The IDGenerationNode is part of the ComfyUI-InstantID package, which provides an interface for generating instant identifiers using the InstantID technology. This node focuses on generating images based on face references, style prompts, and various model-based enhancements. It leverages the capabilities of the Stable Diffusion XL model to synthesize high-quality images conditioned by input prompts and face data.

Features

  • Face Image Conditioning: Uses a reference face image to guide the image generation process, ensuring that outputs can be customized based on facial features.
  • Prompt-based Styling: Incorporates both positive and negative prompts to define the style and characteristics of the generated images.
  • Advanced Controls: Offers multiple user-adjustable parameters such as IP Adapter scale, controlnet conditioning scale, number of steps, and guidance scale.
  • Pose Reference Option: Allows the inclusion of a pose reference image for more customized generation.
  • Face Region Enhancement: Optionally enhances the facial region to improve output quality.

Inputs

The IDGenerationNode accepts the following inputs:

  1. face_image: An image input that provides the facial features to be used as a reference for generation.
  2. pipe: The pre-loaded Stable Diffusion XL model pipeline used for generation.
  3. insightface: The model used for facial feature extraction and analysis.
  4. positive: A string containing the positive prompt, which describes desired features or styles for the generated image.
  5. negative: A string containing the negative prompt, which lists undesired features to avoid in the generated image.
  6. ip_adapter_scale: A float value (0 to 1) that controls the influence of the Image Processing Adapter on image generation.
  7. controlnet_conditioning_scale: A float value (0 to 1) that determines the strength of ControlNet's impact on the image.
  8. steps: An integer defining the number of inference steps, which affects the quality and rendering time of the image.
  9. guidance_scale: A float (typically ranging from 0 to 10) that controls the strength of the guidance provided by text prompts.
  10. enhance_face_region: A boolean to enable or disable specific enhancements of the face region in the output image.
  11. seed: An integer seed value to ensure reproducibility of the generated results.
  12. pose_image_optional: An optional image input used to provide pose reference guidance during generation.

Output

  • IMAGE: The output is an image tensor produced based on the provided inputs, representing the generated visual content with consideration of all input prompts and controls.

Usage in ComfyUI Workflows

The IDGenerationNode can be integrated into ComfyUI workflows to create tailored image outputs for different applications. It can be used in conjunction with nodes that load models or stylize prompts, such as IDBaseModelLoader or ID_Prompt_Styler, to provide comprehensive style and content generation pipelines from human reference images. This makes it particularly useful in projects where customized art creation based on real-world references is desired, such as avatar creation, concept art, or personalized digital scenography.

Special Features and Considerations

  • Pose Reference: Distinctly from the typical OpenPose, the pose reference in this node is specifically focused on the facial area and improves generation quality around facial features.
  • Enhancement Options: Users have the option to enhance the face region, which can be particularly beneficial for portraiture or image outputs where facial fidelity is paramount.
  • Seed Control: The use of a seed allows for deterministic results, enabling users to replicate or fine-tune results efficiently. Adjusting the seed can provide diverse outputs from the same setup.

This node functions as a cornerstone in artistic pipelines requiring both human likeness and creative expression, offering a blend of technical prowess with aesthetic flexibility.