Roofline quantized conv3d/2d layer #3419

jainapurva · 2025-12-03T03:11:27Z

No description provided.

pytorch-bot · 2025-12-03T03:11:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3419

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 30dc793 with merge base 73730e8 ():

NEW FAILURE - The following job has failed:

PR Label Check / Check PR Labels (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-12-04T02:48:07Z

benchmarks/float8/float8_inference_roofline.py

+            and not is_sm_at_least_100()
+        )
+
+        if skip_conv_benchmarks:


I feel conditions seems a bit convoluted here

maybe:

if do_benchmarks: if op_name in ("conv2d", "conv3d") and not is_sm_at_least_100(): print warning else: # can also move this part to a function to make it clearer ....

jerryzh168 · 2025-12-04T02:51:48Z

benchmarks/float8/float8_inference_roofline.py

-            r_speedup = None
+            # use roofline model to estimate gemm time using equivalent GEMM dims
+            r_bf16_gemm_time_s = float(
+                bf16_gemm_time_sympy.subs(M, gemm_M).subs(K, gemm_K).subs(N, gemm_N)


is the memory operations of conv the same as linear well?

ao/torchao/testing/training/roofline_utils.py

Line 332 in 0975a40

mem_gemm_time_s = (

As conv is an implicit gemm, I'm assuming the memory operations for gemm and conv should be same.

jerryzh168 · 2025-12-04T02:54:12Z

benchmarks/float8/float8_inference_roofline.py

        print()
    else:
-        # TODO: enable roofline analysis for conv
        pass


if we share the same thing, we should remove this if/else branch and inline the code in if here I think

I kept if/else just incase of an unexpected op_name. We can either make the code verify op_name in beginning to avoid any errors, or assume the input for op_name will always be correct

we already verify that in L264 I think

jerryzh168 · 2025-12-04T02:57:23Z

benchmarks/float8/float8_inference_roofline.py


-            # real gemm benchmark time, also not added yet
-            # if enabled, also measured observed gemm time
+            # gemm benchmarks for conv not implemented, as conv uses implicit GEMM


we should run the conv ops I think?

Add conv roofline

3bc5d37

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 3, 2025

Add conv roofline

79cdaec

jainapurva force-pushed the conv_roofline branch from 8fba23d to 79cdaec Compare December 3, 2025 06:26

jainapurva added 4 commits December 3, 2025 18:08

updates

f4c7a6e

updates

828cb02

updates

e815fd4

minor fixes

30dc793

jerryzh168 reviewed Dec 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Roofline quantized conv3d/2d layer #3419

Roofline quantized conv3d/2d layer #3419

Uh oh!

jainapurva commented Dec 3, 2025

Uh oh!

pytorch-bot bot commented Dec 3, 2025 •

edited

Loading

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

jainapurva Dec 4, 2025

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

jainapurva Dec 4, 2025

Uh oh!

jerryzh168 Dec 5, 2025

Uh oh!

jerryzh168 Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Roofline quantized conv3d/2d layer #3419

Are you sure you want to change the base?

Roofline quantized conv3d/2d layer #3419

Uh oh!

Conversation

jainapurva commented Dec 3, 2025

Uh oh!

pytorch-bot bot commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3419

❌ 1 New Failure

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jainapurva Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jainapurva Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Dec 3, 2025 •

edited

Loading