This way expensive networking calls done in state loading (e.g., loading CLIP model) can be parallelized