optimize: improve inference scripts with mmgp and torchao support by shangvo · Pull Request #13 · showlab/PhotoDoodle

shangvo · 2025-03-12T04:13:33Z

Optimization for Inference Scripts

Added inference_mmgp.py for memory optimization using mmgp
Added inference_torchao.py for model quantization and acceleration
Updated requirements with new dependencies:
- para-attn: for first block caching
- mmgp: for memory management
- torchao: for model quantization

Added new requirements in requirements_new.txt:

Please make sure to install these new dependencies before running the optimized inference scripts.

shangvo added 2 commits March 12, 2025 04:05

optimize: improve inference scripts with mmgp and torchao support

cbc7054

chore: update requirements with new dependencies for optimization

f38eccc