Generate high-quality square images from text with LongCat Image.
Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning — you still choose inputs, prompts, and settings.
Open preloaded workflow on RunComfy
Open preloaded workflow on RunComfy (browser)
Why RunComfy first
- Fewer missing-node surprises — run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout — useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON — the zip follows the same runnable workflow you can open on RunComfy.
When downloading for local ComfyUI makes sense — you want full control over models on disk, batch scripting, or offline runs.
How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.
Expectations — First run may pull large weights; cloud runs may require a free RunComfy account.
Overview
LongCat Image text to image is a straightforward ComfyUI workflow for square prompt-to-image generation. It combines the LongCat-Image model, the Qwen 2.5 VL text encoder, and the AE VAE in a compact graph that is easy to iterate on. The default setup generates 1024x1024 images in 20 steps, and you can increase the step count if you want to compare against the original recommended setting.
Important nodes:
Resolution SelectorText to Image (LongCat Image)Save Image
Notes
LongCat Image ComfyUI Workflow | Square Text to Image Generation — see RunComfy page for the latest node requirements.

