ICLR 2026 paper on state-dependent reward shaping with multi-view videos and ViCLIP for reinforcement learning.
reinforcement-learning robot-learning jax multi-view-learning reward-shaping metaworld vision-language-model viclip iclr-2026 humanoidbench
-
Updated
Mar 6, 2026 - Python