Skip to content

PlanR1.py里面强化学习部分的疑问 #11

@123sdadaw

Description

@123sdadaw

作者你好!
model\PlanR1.py,第162行:
ratio = (plan_logps - plan_logps.detach()).exp()
是否应该改为下面的啊:
ratio = (plan_logps - pred_logps.detach()).exp()

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions