Skip to content

Question: No parallel path tags during evaluation despite using Parallel-R1 checkpoints #11

@larasneves

Description

@larasneves

Hi, thanks a lot for this work and for open-sourcing the code!!

I’ve been trying to reproduce the results for a course project this semester and ran into an issue during evaluation.

What I did:
•Ran submit_rl_exp_s1 to train with RL using the base model Parallel-R1/Parallel-SFT-Unseen from Hugging Face.
• Evaluated the resulting checkpoint using verl/recipe/r1/run_r1_distill_qwen.sh.
• I also tried evaluating the released checkpoint Parallel-R1/Parallel-R1-Unseen_Step_200.

Issue:
• I noticed that I had to manually modify the evaluation prompt to match the Parallel Thinking prompt described in the report.
• Even after doing so, the evaluation outputs do not contain any parallel path tags, which I find unexpected (using run_r1_distill_qwen.sh )

Questions:
1. Are parallel path tags supposed to appear automatically during evaluation, or is there an additional configuration or prompt formatting step required?
2. Is there a specific evaluation script or flag needed to enable parallel path generation?

Any guidance on how to properly reproduce the parallel reasoning behavior and evaluation would be greatly appreciated.
Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions