Hi, Thx for the nice work. May I ask for your code to eval the Claude Computer Use setting as I saw they are included in the leaderboard? Thx!
Hi,
Thx for the nice work. May I ask for your code to eval the Claude Computer Use setting as I saw they are included in the leaderboard? Thx!