Hybrid-Sensitivity-Weighted-Quantization (HSWQ)

High-fidelity FP8 quantization for SDXL, Flux1.dev, and Z Image Turbo diffusion models. HSWQ uses sensitivity and importance analysis instead of naive uniform cast. It offers two modes: standard-compatible (V1) and high-performance scaled (V2). V2 requires a dedicated loader and is not usable at the current time.

Technical details: md/HSWQ_ Hybrid Sensitivity Weighted Quantization.md

SDXL models: Hugging Face — Hybrid-Sensitivity-Weighted-Quantization-SDXL-fp8e4m3

How to quantize

SDXL: How to quantize SDXL
Z Image Turbo: How to quantize Z Image Turbo

Benchmark results: SDXL (MSE / SSIM)

Overview

Feature	V1: Standard Compatible	V2: High Performance Scaled
Compatibility	Full (100%), any FP8 loader	Requires dedicated loader — not usable at present
File format	Standard FP8 (`torch.float8_e4m3fn`)	Extended FP8 (weights + `.scale` metadata)
Image quality (SSIM)	~0.98 (max)	Unmeasurable (no dedicated loader)
Mechanism	Optimal clipping (smart clipping)	Full-range scaling (dynamic scaling)
Benchmark	Measurable	Currently unmeasurable (no dedicated loader)
Use case	Distribution, general users	Unavailable until a dedicated loader exists

File size is reduced by about 30–40% vs FP16 while keeping best quality per use case.

Architecture

Dual Monitor System — During calibration, two metrics are collected:
- Sensitivity (output variance): layers that hurt image quality most if corrupted → top 10–25% kept in FP16 (for SDXL and ZIT, 10% often gives sufficient quality).
- Importance (input mean absolute value): per-channel contribution → used as weights in the weighted histogram. Technical details: Dual Monitor System — Technical Guide.
Rigorous FP8 Grid Simulation — Uses a physical grid (all 0–255 values cast to torch.float8_e4m3fn) instead of theoretical formulas, so MSE matches real runtime.
Weighted MSE Optimization — Finds parameters that minimize quantization error using the importance histogram. Technical details: Weighted Histogram MSE — Technical Guide.

Modes

V1 (scaled=False): No scaling; only the clipping threshold (amax) is optimized. Output is standard FP8 weights. Use this mode — full compatibility with any FP8 loader.
V2 (scaled=True): Weights are scaled to FP8 range, quantized, and inverse scale S is stored in Safetensors (.scale). Requires a dedicated loader; not usable at the current time.

Recommended Parameters

Samples: 32 (recommended) — number of calibration samples.
Steps: 25 — number of inference steps per sample during calibration.
Keep ratio: 10–25% — keeps critical layers in FP16. For SDXL and ZIT, 10% often gives sufficient quality.
Latent: 32–256, default 128 — calibration latent size (H/W). Use --latent 32 for faster calibration, --latent 256 for higher fidelity.

Benchmark (Reference)

Model	SSIM (Avg)	File size	Compatibility
Original FP16	1.0000	100%	High
Naive FP8	0.75–0.93	50%	High
HSWQ V1	0.86–0.98	60-70% (FP16 mixed)	High
HSWQ V2	— (currently unmeasurable)	60-70% (FP16 mixed)	Not usable (no dedicated loader)

HSWQ V1 gives a clear gain over Naive FP8 with full compatibility. V2 would offer higher quality but requires a dedicated loader; benchmark is currently unmeasurable and V2 is not usable at the current time.

Changelog

Version history and release notes are in CHANGELOG.md.

Base Repositories

This project is built upon the following repositories:

ComfyUI — The most powerful and modular diffusion model GUI, API and backend with a graph/nodes interface by @Comfy-Org.

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
ComfyUI-master		ComfyUI-master
analyze		analyze
archives		archives
benchmark		benchmark
clip		clip
histogram		histogram
md		md
sample		sample
test		test
.commitmsg		.commitmsg
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
icon.png		icon.png
logo.png		logo.png
native_convert_fp8.py		native_convert_fp8.py
pyproject.toml		pyproject.toml
quantize_flux_hswq_v1.2.py		quantize_flux_hswq_v1.2.py
quantize_flux_hswq_v1.6.py		quantize_flux_hswq_v1.6.py
quantize_sdxl_hswq_v1.3.py		quantize_sdxl_hswq_v1.3.py
quantize_zib_hswq_v1.92.py		quantize_zib_hswq_v1.92.py
quantize_zit_hswq_v1.6.py		quantize_zit_hswq_v1.6.py
requirements.txt		requirements.txt
verify_fp8_grid.py		verify_fp8_grid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid-Sensitivity-Weighted-Quantization (HSWQ)

How to quantize

Overview

Architecture

Modes

Recommended Parameters

Benchmark (Reference)

Changelog

Base Repositories

About

Uh oh!

Releases 11

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hybrid-Sensitivity-Weighted-Quantization (HSWQ)

How to quantize

Overview

Architecture

Modes

Recommended Parameters

Benchmark (Reference)

Changelog

Base Repositories

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages