Support checks PoC #1809

nvmbreughe · 2025-09-29T23:14:33Z

📌 Description

This PR adds is_*supported checks for backend and compute capability, through decorators.

This allows us to check support before running
It also wraps the original function so it calls back the support check before running.

Example of (1):

Note that to support uniformity across API's this is limited to backend and compute capability checks. Any other type of checks are still part of the regular function implementation.

Note further that compute capability checks are optional, and by default, all compute capabilities are assumed to pass unless specified otherwise.

🔍 Related Issues

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

I have installed pre-commit by running pip install pre-commit (or used your preferred method).
I have installed the hooks with pre-commit install.
I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

Tests have been added or updated as needed.
All tests are passing (unittest, etc.).

Reviewer Notes

flashinfer/utils.py

sricketts · 2025-09-30T00:29:00Z

flashinfer/gemm.py



+@supports_backends(
+    ["cudnn", "trtllm", "cutlass"],


Is this redundant with the declaration of the backend parameter?

backend: Literal["cudnn", "trtllm", "cutlass"] = "cudnn",

sricketts · 2025-09-30T00:31:50Z

flashinfer/utils.py

+
+
+def supports_backends(
+    backends, capabilities=None, anti_capabilities=None, capability_tensor_arg=None


nit: I wonder if "cc" or "compute_capabilities" would be more clear than "capabilities" -- or did you mean to signal something more generic than compute capabilities?

nvjullin · 2025-10-01T07:40:38Z

flashinfer/utils.py

+        wrapper.is_backend_supported = is_backend_supported
+        wrapper.is_problem_size_supported = is_problem_size_supported
+        wrapper.__name__ = func.__name__
+        wrapper.__doc__ = func.__doc__


Can use functools.wraps for more robust wrapping and standardized interface.

nvjullin · 2025-10-01T07:47:05Z

flashinfer/utils.py

+            backend = kwargs.get("backend")
+            capability = None
+            if capability_tensor_arg:
+                tensor = kwargs.get(capability_tensor_arg)


Is there a reason why we need capability_tensor_arg instead of finding torch.Tensors automatically and get the capability from them? We can also assert that they're all on the same device.

nvjullin · 2025-10-01T07:55:19Z

flashinfer/utils.py

+    capabilities=None,
+    anti_capabilities=None,
+    capability_tensor_arg=None,
+    problem_size_check=None,


Add type hints, perhaps excluding problem_size_check if it's too tedious or just typing.Callable.

nvjullin · 2025-10-01T08:23:04Z

The checks currently live very far away from the implementation and updating them to be consistent with each other can eventually become a maintenance problem. The conditional checks are also quite tricky to get correct. For example, it's not easy to tell if the mxfp4 checks are correct.

    if not use_nvfp4 and block_size != 32:
        raise ValueError("mxfp4 supports block_size = 32.")

    if backend != "cudnn" and not use_nvfp4:
        raise ValueError("Only cudnn FP4 GEMM supports mxfp4 quantization.")

Shouldn't the checks be reordered to avoid confusing error messages?

User tries trtllm + block_size=16 and gets rejected by block_size=32
User then tries trtllm + block_size=32 and gets rejected by only cudnn is supported

Instead of having one top level supports_backends, perhaps consider a two level design:

Local requirement decorator requirement written for each backend entrypoint
Top level backend_requirement that composes requirements

For example:

def cudnn_gemm_fp4_requirement(
    # ...
):
        if (
            not use_nvfp4
            and _match_sm_version(a.device, ["120"])
            and cudnn.backend_version() < 91400
        ):
            raise LibraryError(
                "cudnn FP4 GEMM with mxfp4 quantization is not supported on SM120 with cuDNN backend version < 9.14.0."
            )

        _check_cudnn_fp4_availability()
        # ...

@requirement(cudnn_gemm_fp4_requirement, capability=["100", "101", "102"])
def execute_cudnn_gemm_fp4_graph(
    # ...


@backend_requirement({
    "cudnn": execute_cudnn_gemm_fp4_graph.requirement,
    "trtllm": #...
})
def mm_fp4(
    # ...

This also means that all requirements are enforced to be local to the backend and won't affect each other.

nvmbreughe added 2 commits September 29, 2025 16:01

Added PoC for decorator

959776a

Merge branch 'main' into support_checks_poc

675bb29

sricketts requested changes Sep 30, 2025

View reviewed changes

nvmbreughe added 2 commits September 30, 2025 09:11

Added docstring

583a644

Added problem_size_check to the decorator

83bd9ef

nvjullin reviewed Oct 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support checks PoC #1809

Support checks PoC #1809

Uh oh!

nvmbreughe commented Sep 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

sricketts Sep 30, 2025

Uh oh!

sricketts Sep 30, 2025

Uh oh!

nvjullin Oct 1, 2025

Uh oh!

nvjullin Oct 1, 2025

Uh oh!

nvjullin Oct 1, 2025 •

edited

Loading

Uh oh!

nvjullin commented Oct 1, 2025 •

edited

Loading

Uh oh!

Uh oh!



		def supports_backends(
		backends, capabilities=None, anti_capabilities=None, capability_tensor_arg=None

Support checks PoC #1809

Are you sure you want to change the base?

Support checks PoC #1809

Uh oh!

Conversation

nvmbreughe commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Description

🔍 Related Issues

🚀 Pull Request Checklist

✅ Pre-commit Checks

🧪 Tests

Reviewer Notes

Uh oh!

Uh oh!

sricketts Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

sricketts Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

nvjullin Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

nvjullin Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

nvjullin Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nvjullin commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

nvmbreughe commented Sep 29, 2025 •

edited

Loading

nvjullin Oct 1, 2025 •

edited

Loading

nvjullin commented Oct 1, 2025 •

edited

Loading