Download
1 variant available
License:
Files gathered from https://huggingface.co/bertbobson/Ideogram-4-INT8-ConvRot
This is an unavoidable double quantization due to the release state of Ideogram4.
The FP8 weights were cast to FP32 with the FP8 scales, then downcast to BF16 before being converted to INT8.
For use in ComfyUI with https://github.com/BobJohnson24/ComfyUI-INT8-Fast
Speed is 1.78x faster(2.03s/it) than FP8(3.62s/it) on a 3090, without compile.
4.4-6.2s/it on my 3060 12gb
~2x faster with torch compile.

