Skip to content

Commit 77fc012

Browse files
committed
Update blog
1 parent 627d456 commit 77fc012

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

content/blog/2025-11-18-1763464399.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ tags:
1212

1313
Successfully compiled the VAE of Stable Diffusion 1.5 using [graph-compiler](https://github.com/cmdr2/graph-compiler).
1414

15-
The compiled model is terribly slow because I haven't written any performance optimizations, and it (conservatively) converts a lot of intermediate tensors to contiguous copies. But we don't need a lot of clever optimizations to get to decent performance.
15+
The compiled model is terribly slow because I haven't written any performance optimizations, and it (conservatively) converts a lot of intermediate tensors to contiguous copies. But we don't need any clever optimizations to get to decent performance, just basic ones.
1616

1717
It's pretty exciting because I was able to bypass the need to port the model to C++ manually. Instead, I was able to just compile the exported ONNX model and get the same output values as the original PyTorch implementation (given the same input and weights). I could compile to any platform supported by ggml by just changing one flag (e.g. CPU, CUDA, ROCm, Vulkan, Metal etc).
1818

0 commit comments

Comments
 (0)