## Feature Implement caching to allow for faster inference. ## Reason Faster inference for better user experience (but mainly for me to learn more engineering).