Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Lumina-mGPT		Lumina-mGPT
.gitattributes		.gitattributes
.gitignore		.gitignore
Meta Chameleon Research License.pdf		Meta Chameleon Research License.pdf
Notice (Chameleon).txt		Notice (Chameleon).txt
README.md		README.md
__init__.py		__init__.py
install.py		install.py
modified_requirements.txt		modified_requirements.txt
nodes.py		nodes.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Repository files navigation

ComfyUI-Lumina-mGPT-Wrapper

Overview

This custom node integrates the Lumina-mGPT model into ComfyUI, enabling high-quality image generation using the advanced Lumina text-to-image pipeline. It offers a robust implementation with support for various model sizes and advanced features, including improved latent space handling and flexible crop sizes.

Features

Harnesses the power of the Lumina-mGPT model for state-of-the-art image generation
Supports multiple model sizes: 512, 768, 768-Omni, and 1024
Offers a range of generation parameters for fine-tuned control
Implements Lumina-specific features including cfg-scale and image top-k sampling
Automatic model downloading for seamless setup
Outputs both generated images and latent representations
Includes a converter node for ComfyUI compatibility
Provides a decoder node for latent-to-image conversion
Improved latent space handling for better compatibility with ComfyUI
Flexible crop sizes with automatic adjustment to valid dimensions
Enhanced error handling and logging for easier troubleshooting

Preparation

Since the Chameleon implementation in transformers does not contain the VQ-VAE decoder, please manually download the original VQ-VAE weights provided by Meta and place them in the following directory:

NOTE: I've added the files to the respective path however they will need to be extracted from the zip file before running the node.

"ComfyUI-Lumina-mGPT-Wrapper/Lumina-mGPT/lumina_mgpt/ckpts/chameleon/tokenizer.zip"

Lumina-mGPT
- lumina_mgpt/
    - ckpts/
        - chameleon/
            - tokenizer/
                - text_tokenizer.json
                - vqgan.yaml
                - vqgan.ckpt
- xllmx/
- ...

You can download the required files from Meta's Chameleon Downloads.

Installation

Ensure you have ComfyUI installed and properly set up.

Clone this repository into your ComfyUI custom nodes directory:

git clone https://github.com/Excidos/ComfyUI-Lumina-mGPT-Wrapper.git

The required dependencies will be automatically installed.
If you are having trouble you may have to install XLLMX independently but copying the path of your python_embedded python.exe and running in the Lumina-mGPT directory

go to the root path of the project

cd Lumina_mGPT

install as package

"Path to your embedded python.exe" pip -m install -e .

Usage

Launch ComfyUI.
Locate the "Load Lumina-mGPT Model" node in the node selection menu.
Add the node to your workflow and connect it to a "Lumina-mGPT Image Generate" node.
(Optional) Use the "Lumina-mGPT Crop Selector" to choose a specific resolution.
Configure the node parameters as desired.
Connect the output to either a "Lumina-mGPT Converter" or "Lumina-mGPT Decoder" node for further processing or display.
Execute your workflow to generate images.

Nodes and Parameters

Load Lumina-mGPT Model

model: Choose from available model sizes (512, 768, 768-Omni, 1024)
precision: Select precision (bf16 or fp32)

Lumina-mGPT Crop Selector

target_size: Select from 512, 768, or 1024
aspectRatio: Choose from various aspect ratios (1:1, 4:3, 16:9, etc.)
crop_size: Select from a list of valid crop sizes or "None" for automatic selection

Lumina-mGPT Image Generate

lumina_mgpt_model: Connected from the Load Lumina-mGPT Model node
prompt: Text prompt for image generation
resolution: Image resolution (can be connected from Crop Selector)
cfg: Classifier-free guidance scale
seed: Random seed for generation (0 for random)
image_top_k: Top-k sampling parameter for image generation
temperature: Controls randomness in generation

Lumina-mGPT Converter

image: Input image to convert to ComfyUI-compatible format

Lumina-mGPT Decoder

latent: Input latent representation to decode into an image
output_type: Choose between "IMAGE" or "LATENT" output

Outputs

IMAGE: Generated or decoded image
LATENT: Latent representation of the generated image

Examples

512 x 512

768 x 768

Workflow

Known Features and Limitations

Supports multiple model sizes for different use cases
Implements cfg and image top-k parameters for controlling the generation process
Outputs both images and latent representations
Includes converter and decoder nodes for enhanced compatibility and flexibility
Improved latent space handling ensures better compatibility with ComfyUI's expectations
Automatic adjustment of crop sizes to valid dimensions for each model size
Enhanced error handling and logging for easier troubleshooting and debugging

Troubleshooting

If you encounter any issues, please check the console output for error messages. Common issues include:

Insufficient GPU memory
Missing dependencies
Incorrect model or tokenizer path

For further assistance, please open an issue on the GitHub repository.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Acknowledgements

Lumina-mGPT for the Lumina-mGPT model
The ComfyUI community for their continuous support and inspiration
Meta for providing the Chameleon VQ-VAE weights

About

A ComfyUI wrapper for the Lumina-mGPT model

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%