The code is cleaned, simplified version of: https://github.com/zai-org/SCAIL-Pose
For face and hands, instead of DWPose this uses Vitpose and it's outputs converted into DWpose format for the optional alignment
VitPose detector is available in these nodes: https://github.com/kijai/ComfyUI-WanAnimatePreprocess
NLF model loader is already included in WanVideoWrapper
Reason this is separate repository is the additional requirements of taichi and pyrender