Skip to content

Conversation

@philip-paul-mueller
Copy link
Contributor

@philip-paul-mueller philip-paul-mueller commented Jan 22, 2026

This is for experimental purposes.
Can be force pushed any time.

Instead you should use PR#2457 that is made to be public.

NOT WORKING: 5.7612245082855225
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 831fb2d to 9cade71 Compare January 22, 2026 07:41
DOES NOT WORK: 5.77982020
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from bcb3fd5 to 41385a5 Compare January 22, 2026 07:58
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 41385a5 to 8a389ee Compare January 22, 2026 08:12
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 9e499a4 to 244dc10 Compare January 22, 2026 08:48
…tion.

But it also has an additional simplify that was present when TF was run in stage 1, but not in the other version.

PERFORMANCE: 4.5106589s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from eb6ebf3 to ac2c5ce Compare January 22, 2026 09:11
DOES NOT WORK: 5.90685105s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 9837be5 to b68bae5 Compare January 22, 2026 10:10
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 3347571 to e05939e Compare January 22, 2026 10:54
The inliner ignores Tasklets with empty Memlets and the PR also moved the `TaskletIntoMap` before the TF.
This is like the version that did not worked, i.e. was super fast, but with the new TF we should recover it.

WORKING: 4.604942321777344s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from e05939e to aa6afa2 Compare January 22, 2026 11:57
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 2faa11d to c404174 Compare January 22, 2026 12:45
… Tasklet fusion again.

I have noticed that in the normal/fast version the Tasklets with the literals are not inlined into other Tasklets.
Now let's do that.

DOES NOT WORK: 6.070111036300659s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from b417158 to cef7ebb Compare January 22, 2026 13:16
…sion run Tasklet fusion again."

This reverts commit c404174.

WORKS: 4.4253344535s
…fa3b1b754e`.

But this time it is run right after the inlining.
I think there will be a next iteration.

DOWS NOT WORK: 6.1715042
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from a7435da to eb8c62a Compare January 22, 2026 13:33
…he currently disacbled code and see if it is slow.

As expected it is kind of fast.
WORKS: 4.620257
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from eb8c62a to 2cf8002 Compare January 22, 2026 13:45
I would have expected that it is slow now but it is not.
WORKS: 4.66531s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 2cf8002 to 1047b06 Compare January 22, 2026 14:02
This seems to do the trick.
If between the `MoveDataflowIntoIf` and the second `TaskletFusion` is a call to simplify, it is slow.

DOES NOT WORK: 6.2248911s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch 2 times, most recently from 1c769ea to ebf9ce7 Compare January 23, 2026 07:32
…somehow executed in a wrong way.

In that experiment I wanted to run TF a second time after everything is done.
It was fast, which puzzeled me, because it contradicted by previous findings.
However, I looked at the archived SDFG and at least that thing showed that TF was not run a second time.
Thus I have decided to redo that part, so now let's run TF for a second time and see what happens.

SLOW: 6.137449s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 5812c74 to 4ada9a6 Compare January 23, 2026 07:48
…sklets with zero input degree and the SDFG must not be the top level one, i.e. real constant tasklet, that are not at the top level.

WORKS: 4.68273s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 4ada9a6 to 967c2a0 Compare January 23, 2026 08:22
…ly ignoreing empty memlet (which the previous experiment should have been done), we only consider them.

SLOW: 6.658746s
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch 2 times, most recently from 0fa4c80 to 5dd1a69 Compare January 23, 2026 08:49
@philip-paul-mueller philip-paul-mueller force-pushed the better_in_map_dataflow_optimization_order branch from 19e69b9 to cef303d Compare January 23, 2026 10:32
… that are nested, but in a more liberal way.

FAST: 4.63308429
@philip-paul-mueller philip-paul-mueller changed the title DO NOT MERGE: refactor[dace-next]: New Optimization Scheme in Intra-Map Optimization DO NOT MERGE: refactor[dace-next]: (EXPERIMENT) New Optimization Scheme in Intra-Map Optimization Jan 23, 2026
@philip-paul-mueller philip-paul-mueller changed the title DO NOT MERGE: refactor[dace-next]: (EXPERIMENT) New Optimization Scheme in Intra-Map Optimization DO NOT MERGE: Experiment for TaskletFusion in Intra-Map Optimization Jan 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant