Added nvfp4 official support. by BuffMcBigHuge · Pull Request #438 · daydreamlive/scope

BuffMcBigHuge · 2026-02-11T00:45:28Z

Summary

Adds official support for NVFP4 quantization, enabling ~4x weight memory reduction on Blackwell GPUs (SM >= 10.0). NVFP4 uses NVIDIA's E2M1 format and hardware-accelerated Tensor Core kernels via comfy-kitchen.

What's new

NVFP4 quantization option – Users with Blackwell GPUs (RTX 50xx, B100, etc.) can select nvfp4 (Blackwell) in the quantization dropdown when their hardware supports it.
Shared quantization pipeline – FP8 and NVFP4 logic is centralized in quantization_utils.py, replacing duplicated FP8 logic across pipelines.
Hardware detection – The server exposes supports_nvfp4 in the hardware info API based on CUDA device capability (SM >= 10.0). The UI only shows the NVFP4 option when supported.
VACE compatibility – VACE components now support both FP8 and NVFP4 quantization when enabled.

Technical details

NVFP4 uses comfy-kitchen's QuantizedTensor with TensorCoreNVFP4Layout for hardware-accelerated matmul on Blackwell.
FP8 (fp8_e4m3fn) continues to use torchao for Ada+ GPUs (SM >= 8.9), with ~2x weight memory reduction.
Dependencies: Adds comfy-kitchen[cublas]>=0.1.0 (Linux/Windows) and torchaudio==2.9.1 for future audio support.
Fallback: If a user selects NVFP4 and later switches to a non-Blackwell GPU (e.g. from persisted state), the UI resets to fp8_e4m3fn.

Pipelines updated

All pipelines that support quantization now use the shared apply_quantization():

krea_realtime_video
longlive
memflow
reward_forcing
streamdiffusionv2

Files changed

Area	Changes
Backend	New `quantization_utils.py`, enum update, 6 pipelines refactored, VACE mixin, hardware info API
Frontend	TypeScript types, `supportsNvfp4` wiring, quantization dropdown with conditional NVFP4 option, persisted state reset
Deps	`comfy-kitchen[cublas]`, `torchaudio` in pyproject.toml

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

yondonfu · 2026-02-13T21:33:59Z

I'll look into this more later, but on the first run with this branch with default LongLive settings:

2026-02-13 16:32:10,997 - scope.server.pipeline_manager - INFO - Initial load params: {'height': 320, 'width': 576, 'quantization': 'nvfp4', 'vace_enabled': False}
2026-02-13 16:32:10,998 - scope.server.pipeline_manager - INFO - VACE disabled by load_params, skipping VACE configuration
2026-02-13 16:32:11,804 - scope.core.pipelines.wan2_1.vace.mixin - INFO - _init_vace: No vace_path provided, VACE disabled
Loaded diffusion LoRA in 2.525s
2026-02-13 16:32:14,329 - scope.core.pipelines.wan2_1.lora.mixin - INFO - _init_loras: Found 0 LoRA configs to load
2026-02-13 16:32:15,419 - scope.core.pipelines.quantization_utils - INFO - Skipped 600 LoRA adapter layers
2026-02-13 16:32:15,419 - scope.core.pipelines.quantization_utils - INFO - Quantizing 301 Linear layers to NVFP4
2026-02-13 16:32:15,555 - scope.core.pipelines.quantization_utils - INFO - GPU memory before NVFP4 quantization: 3.30 GB
2026-02-13 16:32:15,562 - scope.server.pipeline_manager - ERROR - Failed to load pipeline longlive: CUDA kernel launch failed: CUDA driver version is insufficient for CUDA runtime version. If this error persists, consider removing the models directory 'C:\Users\yondo\.daydream-scope\models' and re-downloading models.
2026-02-13 16:32:15,568 - scope.server.pipeline_manager - ERROR - Failed to load pipeline: longlive
2026-02-13 16:32:15,568 - scope.server.pipeline_manager - ERROR - Some pipelines failed to load

Would be helpful to note the CUDA driver version required.

EDIT: I updated to the the latest driver version on my PC (see details below) and it runs now. Will share test results separately.

 NVIDIA-SMI 591.74                 Driver Version: 591.74         CUDA Version: 13.1

varshith15 · 2026-02-22T18:23:36Z

src/scope/core/pipelines/utils.py

 from .enums import Quantization as Quantization  # noqa: PLC0414
 from .enums import VaeType as VaeType  # noqa: PLC0414

+# Re-export quantization utilities


why re-export?

BuffMcBigHuge added 4 commits February 10, 2026 19:44

Added nvfp4 official support.

b5ad03c

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Added dependencies required for nvfp4 and audio for future proof.

f59fa93

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Built nvfp4 detection via hardware info.

0fc457f

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Merge branch 'main' into marco/feat/nvfp4

90932b4

BuffMcBigHuge mentioned this pull request Feb 11, 2026

NVFP4 Support daydreamlive/scope-ltx-2#2

Open

BuffMcBigHuge added 2 commits February 10, 2026 20:43

Linting.

ca83d1b

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Linting.

202b031

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

BuffMcBigHuge marked this pull request as ready for review February 12, 2026 02:19

varshith15 reviewed Feb 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Added nvfp4 official support.#438

Added nvfp4 official support.#438
BuffMcBigHuge wants to merge 6 commits intomainfrom
marco/feat/nvfp4

BuffMcBigHuge commented Feb 11, 2026 •

edited

Loading

Uh oh!

yondonfu commented Feb 13, 2026 •

edited

Loading

Uh oh!

varshith15 Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

BuffMcBigHuge commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's new

Technical details

Pipelines updated

Files changed

Uh oh!

yondonfu commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

varshith15 Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BuffMcBigHuge commented Feb 11, 2026 •

edited

Loading

yondonfu commented Feb 13, 2026 •

edited

Loading