chore(refactor): simplify graph ops #102

folded · 2025-05-26T04:01:47Z

Simplify the implementations of graph walking.
Update comments and move only_stages application

cpg-software-ci-bot · 2025-05-26T04:04:46Z

📊 SonarQube Summary

Metric	This PR	Main Branch
✅ Coverage	75.7%	75.7%
💨 Code Smells	15	17
🐞 Bugs	0	0
🔐 Vulnerabilities	0	0
🚨 Security Hotspots	1	0
🌟 Quality Gate	✅ OK	✅ OK

🔗 View Main Branch Report
🔗 View PR Report

MattWellie

V interesting, looks much simpler. Shadow compute test is compelling.

src/cpg_flow/workflow.py

cpg-software-ci-bot · 2025-05-26T04:32:08Z

📊 SonarQube Summary

Metric	This PR	Main Branch
✅ Coverage	75.7%	75.7%
💨 Code Smells	15	17
🐞 Bugs	0	0
🔐 Vulnerabilities	0	0
🚨 Security Hotspots	1	0
🌟 Quality Gate	✅ OK	✅ OK

🔗 View Main Branch Report
🔗 View PR Report

cpg-software-ci-bot · 2025-05-26T04:35:46Z

📊 SonarQube Summary

Metric	This PR	Main Branch
✅ Coverage	76.3%	76.3%
💨 Code Smells	44	47
🐞 Bugs	0	0
🔐 Vulnerabilities	0	0
🚨 Security Hotspots	1	0
📝 New Issues	0	0
🌟 Quality Gate	✅ OK	✅ OK

🔗 View Main Branch Report
🔗 View PR Report

Bring only_stages closer to first/last_stages.

violetbrina · 2025-06-16T00:22:30Z

Apologies for my delayed response.

Feel free to go ahead and bump the requests version to pass the security check.

If you could install the pre-commit hooks as well that would be great. Should have said this earlier but you can check it all out in the contirbutors file.

https://github.com/populationgenomics/cpg-flow/blob/main/CONTRIBUTING.md

If this branch contains a fix have at least one commit fix: ..., a new feature is feat: ... and for a breaking change add the exclamation after the verb so feat!: new breaking change

See https://www.conventionalcommits.org/en/v1.0.0/ for a full breakdown of the convention

violetbrina · 2025-06-17T01:45:28Z

Just need to merge so it's not out of sync with main.

github-actions · 2025-10-02T01:29:31Z

🐳 Docker Image Built

A new Docker image has been built for this PR:

Image: australia-southeast1-docker.pkg.dev/cpg-common/images-dev/cpg_flow:dfa7e69708407784fd683434db446ff80d390725

Pull command:

docker pull australia-southeast1-docker.pkg.dev/cpg-common/images-dev/cpg_flow:dfa7e69708407784fd683434db446ff80d390725

🔗 View in Google Cloud Console

This comment was automatically generated by the Docker workflow.

MattWellie

I like this, compelling test case, much more easily explained logical flow

rameshka

Hey @folded, thanks for taking on this work to improve the workflow logic—and sorry for taking too long to review it.
Overall, the proposed improvements look good. I’ve left a few comments for you to take a look at. Since this logic is fairly complex, adding some inline comments would also help make it easier to follow and maintain.

rameshka · 2026-01-23T10:07:47Z

src/cpg_flow/workflow.py

+        stages_dict: dict[str, 'Stage'] = {}  # noqa: UP037
+
+        def _make_once(cls) -> tuple['Stage', bool]:
+            try:


Instead of catching the KeyError, we can simplify this by using a safe lookup on stages_dict. Something like:

instance = stages_dict.get(cls.__name__) if instance is not None: return instance, False instance = stages_dict[cls.__name__] = cls() return instance, True

rameshka · 2026-01-23T10:45:34Z

src/cpg_flow/workflow.py

-            stages_dict |= implicit_stages
+    def _instantiate_stages(
+        requested_stages: list['StageDecorator'], skip_stages: list[str], only_stages: list[str]
+    ) -> dict[str, 'Stage']:


This method is nice and improves some inefficiencies in the previous logic (eg, processing the same stage more than once).

rameshka · 2026-01-23T11:33:14Z

src/cpg_flow/workflow.py

+                if not instance.skipped:
+                    instance.required_stages.extend(
+                        filter(None, map(_recursively_make_stage, instance.required_stages_classes)),
+                    )


I think, if we move the only_stages logic here, we can avoid re-iterating over the stages_dict logic (between lines 543-546).
Something like:

if only_stages: if cls.__name__ not in only_stages: instance.skipped = True

rameshka · 2026-01-27T06:46:12Z

src/cpg_flow/workflow.py

    return out


+def _compute_shadow(graph: nx.DiGraph, shadow_casters: set[str]) -> set[str]:


This logic looks more concise than the previous implementation. @folded, I've included a few differences I've noticed between the new logic and the previous implementation. Here, I did a 1:1 comparison, but if these changes are intentional, feel free to skip my comment.

(In the workflow examples, -> points to the execution order and not the edge direction in the DAG object)

When last_stages contains multiple stages on the same path, the previous logic picks the downstream stage (to skip the stages further downstream).
Let's say we have a workflow A -> B -> C -> D. If we define last_stages = [B, C], the previous logic skips only D, but the new logic will skip C, D.

This happens when B becomes a shadow caster with shadowed={C, D}.

Stage skipping when both last_stages and first_stages are defined.
Let's say we have a workflow with first_stages = B and last_stages=F

A->C ->B->D ->E->F->G ->G

The previous logic will result in,

B->D ->E->F

But in the new logic,

A will not be skipped - Even though B is a shadow caster, C will light up A and the last_stage kept logic will include A.

Gwill not be skipped - Even though F is a shadow caster, E will light-up G and the first_stage kept logic will include G.

folded requested a review from a team as a code owner May 26, 2025 04:01

folded temporarily deployed to production May 26, 2025 04:01 — with GitHub Actions Inactive

folded requested review from MattWellie and violetbrina May 26, 2025 04:03

folded temporarily deployed to production May 26, 2025 04:03 — with GitHub Actions Inactive

MattWellie reviewed May 26, 2025

View reviewed changes

src/cpg_flow/workflow.py Show resolved Hide resolved

src/cpg_flow/workflow.py Outdated Show resolved Hide resolved

src/cpg_flow/workflow.py Show resolved Hide resolved

src/cpg_flow/workflow.py Show resolved Hide resolved

src/cpg_flow/workflow.py Outdated Show resolved Hide resolved

folded temporarily deployed to production May 26, 2025 04:29 — with GitHub Actions Inactive

folded temporarily deployed to production May 26, 2025 04:31 — with GitHub Actions Inactive

folded temporarily deployed to production May 26, 2025 04:33 — with GitHub Actions Inactive

folded temporarily deployed to production May 26, 2025 04:34 — with GitHub Actions Inactive

violetbrina marked this pull request as draft May 28, 2025 05:21

folded added 3 commits June 10, 2025 12:11

Simplify the implementations of graph walking.

a021789

Update comments and move only_stages application

a8bd619

Bring only_stages closer to first/last_stages.

Fix grammatical error in comment

07cb65c

folded force-pushed the simplify-graph-ops branch from e59422a to 07cb65c Compare June 10, 2025 02:25

folded temporarily deployed to production June 10, 2025 02:25 — with GitHub Actions Inactive

folded had a problem deploying to production June 10, 2025 02:27 — with GitHub Actions Failure

Improve type info

d3afaad

folded temporarily deployed to production June 10, 2025 02:30 — with GitHub Actions Inactive

folded added 2 commits June 10, 2025 12:33

remove incorrect merge conflict resolution

43fd2ef

Improve type info

06821a4

folded temporarily deployed to production June 10, 2025 02:45 — with GitHub Actions Inactive

whitespace

7a39af0

folded temporarily deployed to production June 10, 2025 02:46 — with GitHub Actions Inactive

folded temporarily deployed to production June 10, 2025 02:48 — with GitHub Actions Inactive

folded temporarily deployed to production June 10, 2025 03:02 — with GitHub Actions Inactive

folded temporarily deployed to production June 10, 2025 03:04 — with GitHub Actions Inactive

folded marked this pull request as ready for review June 10, 2025 03:08

disable lint: Remove quotes from type annotation

ecb4f45

folded force-pushed the simplify-graph-ops branch from 275a341 to ecb4f45 Compare June 10, 2025 04:07

folded temporarily deployed to production June 10, 2025 04:07 — with GitHub Actions Inactive

folded temporarily deployed to production June 10, 2025 04:09 — with GitHub Actions Inactive

Merge branch 'main' into simplify-graph-ops

2557185

folded temporarily deployed to production June 16, 2025 04:16 — with GitHub Actions Inactive

folded temporarily deployed to production June 16, 2025 04:18 — with GitHub Actions Inactive

Merge branch 'main' into simplify-graph-ops

bb2a3f1

folded temporarily deployed to production June 17, 2025 01:46 — with GitHub Actions Inactive

folded temporarily deployed to production June 17, 2025 01:48 — with GitHub Actions Inactive

folded changed the title ~~simplify graph ops~~ chore(refactor): simplify graph ops Jun 19, 2025

Merge branch 'main' into simplify-graph-ops

057a3c9

folded temporarily deployed to production June 19, 2025 01:33 — with GitHub Actions Inactive

folded temporarily deployed to production June 19, 2025 01:35 — with GitHub Actions Inactive

Merge branch 'main' into simplify-graph-ops

c2147cb

folded temporarily deployed to production June 20, 2025 05:13 — with GitHub Actions Inactive

folded temporarily deployed to production June 20, 2025 05:14 — with GitHub Actions Inactive

Merge branch 'main' into simplify-graph-ops

ab3b6ef

MattWellie temporarily deployed to production October 2, 2025 01:25 — with GitHub Actions Inactive

MattWellie temporarily deployed to production October 2, 2025 01:27 — with GitHub Actions Inactive

MattWellie approved these changes Oct 2, 2025

View reviewed changes

Merge branch 'main' into simplify-graph-ops

dfa7e69

MattWellie temporarily deployed to production November 27, 2025 02:58 — with GitHub Actions Inactive

MattWellie temporarily deployed to production November 27, 2025 02:59 — with GitHub Actions Inactive

vivbak requested a review from rameshka December 12, 2025 00:17

rameshka reviewed Jan 27, 2026

View reviewed changes

		return out


		def _compute_shadow(graph: nx.DiGraph, shadow_casters: set[str]) -> set[str]:

chore(refactor): simplify graph ops #102

Are you sure you want to change the base?

chore(refactor): simplify graph ops #102

Uh oh!

Conversation

folded commented May 26, 2025

Uh oh!

cpg-software-ci-bot commented May 26, 2025

📊 SonarQube Summary

Uh oh!

MattWellie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cpg-software-ci-bot commented May 26, 2025

📊 SonarQube Summary

Uh oh!

cpg-software-ci-bot commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📊 SonarQube Summary

Uh oh!

violetbrina commented Jun 16, 2025

Uh oh!

violetbrina commented Jun 17, 2025

Uh oh!

github-actions bot commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🐳 Docker Image Built

Uh oh!

MattWellie left a comment

Choose a reason for hiding this comment

Uh oh!

rameshka left a comment

Choose a reason for hiding this comment

Uh oh!

rameshka Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

rameshka Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

rameshka Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

rameshka Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

cpg-software-ci-bot commented May 26, 2025 •

edited

Loading

github-actions bot commented Oct 2, 2025 •

edited

Loading