Enable DualPipeV by adding a multiplexed graph #258

sanketpurandare · 2025-11-18T22:26:58Z

Stacked PRs:

->Enable DualPipeV by adding a multiplexed graph #258

Enable DualPipeV by adding a multiplxed graph that currently doesn't do overlapping but executes the backward and forward of different stages and microbatches in a single multiplexed graph.

stack-info: PR: #258, branch: sanketpurandare/stack/1

wconstab · 2025-11-19T00:48:59Z

autoparallel/_passes/graph_multiplex.py

-            insert_point = n
-        if n.op == "output":
-            multiplexed_graph_op_node = n
+    multiplxed_gm_placeholders = multiplexed_gm.graph.find_nodes(op="placeholder")


typo multiplxed_

wconstab · 2025-11-19T00:49:57Z

autoparallel/_passes/graph_multiplex.py

-    # Collect output arguments from forward graph
+    # Collect output arguments from multiplexed graph (will contain only fwd_outs)
+    multiplexed_graph_outputs = multiplexed_gm.graph.find_nodes(op="output")
+    assert len(multiplexed_graph_outputs) == 1


why this assert? is this a tuple of forward outs or are you hardcoding the model arch into the infra?

graph.find_nodes() returns a list of nodes with the filter criteria, this assert is for sanity check that there is exactly one output node. It says nothing about the args of the output node

wconstab · 2025-11-19T01:04:02Z

autoparallel/graph_pp_runner.py

+) -> tuple[list[Any], list[Any], Any, tuple[tuple[Any], tuple[Any]]]:
+    multiplexed_outs = fx.Interpreter(multiplexed_fw_bw_module).boxed_run(bw_fw_args)
+
+    num_params_buffers = bw_graph_meta.num_params + bw_graph_meta.num_buffers


maybe in a later pr, it would be nice if the decoding of the multiplexed_outs could be done by a module that was generated in the same place that generated the multiplexed graph. i mostly don't like how this function hardcodes a specific 'API' that's defined somewhere else and there isn't any enforcement that it stays consistent.

wconstab · 2025-11-19T01:06:10Z

autoparallel/graph_pp_runner.py

-    numerics_logs: Optional[list[str]] = None,
-) -> None:
+) -> tuple[
+    _PipelineScheduleRuntime,


its a bit odd to me that this util function is (a) not a class method on the runtime, and (b) returns a runtime object among other things. (from the description, mostly doing setup stuff and waiting ops, it seems this function could plausibly have a null return type)

wconstab · 2025-11-19T01:09:30Z

autoparallel/graph_pp_runner.py

+        mb_index,
+        is_next_stage_on_this_rank,
+        is_prev_stage_on_this_rank,
+    ) = _prepare_fwd_common(action, ctx)


i see why its this way now. but maybe you can
(a) get any widely used objects off ctx into local vars inside stage_forward and pass those into the util functions
(b) just access the less widely used objects directly off ctx inside the utils to reduce the number of passed around locals / API surface

Will try some this refactoring in the next PR

wconstab

mostly lgtm!

xmfan · 2025-11-19T01:32:40Z

autoparallel/api.py

        ), f"num_params_buffers: {num_params_buffers}, num_params: {num_params}, num_buffers: {num_buffers}"
+        num_input_grads = (
+            len(bw_module.graph.find_nodes(op="output")[0].args[0]) - num_params_buffers
+        )


i'm probably getting tripped up by the naming convention here, num_input_grads is generally filtered from num_fwd_outputs right?

You can get the number of input_grads either from the placeholders of the fwd_graph or the outputs of the bwd_graph. num_fwd_outputs will give you the num_out_grads (tangents) that will be passed in as bwd_graph inputs.

Enable DualPipeV by adding a multiplxed graph

ab091ce

stack-info: PR: #258, branch: sanketpurandare/stack/1

sanketpurandare force-pushed the sanketpurandare/stack/1 branch from 26237a1 to ab091ce Compare November 18, 2025 22:27

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 18, 2025

sanketpurandare requested review from wconstab and xmfan November 18, 2025 22:28

sanketpurandare changed the title ~~Enable DualPipeV by adding a multiplxed graph~~ Enable DualPipeV by adding a multiplexed graph Nov 18, 2025

wconstab reviewed Nov 19, 2025

View reviewed changes

wconstab approved these changes Nov 19, 2025

View reviewed changes

xmfan reviewed Nov 19, 2025

View reviewed changes

sanketpurandare merged commit 06d5f49 into main Nov 19, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable DualPipeV by adding a multiplexed graph #258

Enable DualPipeV by adding a multiplexed graph #258

Uh oh!

sanketpurandare commented Nov 18, 2025 •

edited

Loading

Uh oh!

wconstab Nov 19, 2025

Uh oh!

wconstab Nov 19, 2025

Uh oh!

sanketpurandare Nov 19, 2025 •

edited

Loading

Uh oh!

wconstab Nov 19, 2025

Uh oh!

wconstab Nov 19, 2025

Uh oh!

wconstab Nov 19, 2025

Uh oh!

sanketpurandare Nov 19, 2025

Uh oh!

wconstab left a comment

Uh oh!

xmfan Nov 19, 2025 •

edited

Loading

Uh oh!

sanketpurandare Nov 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Enable DualPipeV by adding a multiplexed graph #258

Enable DualPipeV by adding a multiplexed graph #258

Uh oh!

Conversation

sanketpurandare commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Enable DualPipeV by adding a multiplxed graph that currently doesn't do overlapping but executes the backward and forward of different stages and microbatches in a single multiplexed graph.

Uh oh!

wconstab Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

sanketpurandare Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

sanketpurandare Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab left a comment

Choose a reason for hiding this comment

Uh oh!

xmfan Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sanketpurandare Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sanketpurandare commented Nov 18, 2025 •

edited

Loading

sanketpurandare Nov 19, 2025 •

edited

Loading

xmfan Nov 19, 2025 •

edited

Loading

sanketpurandare Nov 19, 2025 •

edited

Loading