home models images videos articles comics bounties challenges updates shop

HDR VAE (Anima - QWEN Image)

Name: HDR VAE (Anima - QWEN Image)
Rating: 5 (35 reviews)
Author: Felldude

484

Updated: Jun 21, 2026

style

hdr fp32

Download

1 variant available

SafeTensor

484.08 MB

Verified: 4 days ago

Download (484.08 MB)

Details

Type

VAE

Stats

357

Reviews

Positive

(31)

Published

Jun 21, 2026

Base Model

Anima

Hash

AutoV2

C3C0D4F85D

Tensors

Felldude

License:

Anima

The Anima Model is licensed by CircleStone Labs LLC. Copyright CircleStone Labs LLC. IN NO EVENT SHALL CIRCLESTONE LABS LLC BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH USE OF THIS MODEL.

Built on NVIDIA Cosmos

Qwen Image VAE

Full FP32 Training of Decoder
Works in ComfyUI

Feel free to to suggest onsite support, to civitai staff. I don't think they have any agreements like with FLUX

Overview

This model is a fine-tuned variant of the base Qwen Image VAE, modified to emphasize high-frequency detail preservation and expanded color representation, following an HDR-style reconstruction objective.

The evaluation compares the base and HDR-tuned models using perceptual, structural, distributional, and photometric metrics over identical input data.

Evaluation Summary

Perceptual Fidelity (LPIPS)

Base: 0.0177
HDR: 0.0786

The HDR model exhibits a significant increase in perceptual distance, indicating reduced strict identity reconstruction under deep feature similarity metrics and a shift toward detail-enhancing reconstruction behavior.

Structural Energy (Gradient Magnitude)

Ground Truth: 404.02 (both models)
Base Reconstruction: 313.46
HDR Reconstruction: 687.97

The base model demonstrates strong low-pass behavior with reduced high-frequency content. In contrast, the HDR model exhibits high-frequency amplification, exceeding the structural energy of the original inputs.

Color Distribution Support

Ground Truth: 33150.61 (both models)
Base Reconstruction: 35004.49
HDR Reconstruction: 40133.37

The HDR model produces a substantially expanded color support space, indicating increased chromatic dispersion and reduced quantization collapse.

Photometric Stability

Brightness Bias

Base: 0.000351
HDR: 0.0000098

Contrast Gain

Base: 0.9984
HDR: 0.99999

Both models preserve global photometric consistency, with the HDR variant showing near-perfect affine stability.

Channel Drift

Red Shift:
- Base: +0.0116
- HDR: +0.0104
Green Shift:
- Base: -0.0606
- HDR: -0.1856
Blue Shift:
- Base: +0.0187
- HDR: +0.0219

The HDR model introduces a significantly stronger negative bias in the green channel, while maintaining comparable red and blue stability.

Interpretation

The base Qwen VAE behaves as a contractive perceptual projection operator, prioritizing smooth reconstructions and suppression of high-frequency components.

The HDR-tuned variant transitions into a detail-amplifying reconstruction operator, characterized by:

Increased high-frequency energy
Expanded color manifold coverage
Higher perceptual divergence under LPIPS
Preserved global photometric invariance

This represents a functional shift from a smoothing autoencoder regime toward a high-frequency preserving (HDR-like) reconstruction regime.