Skip to content

Question about fine-tuning #136

@kimwin2

Description

@kimwin2

Firstly, thank you for your great effort to make this project.

When will the fine-tuning code be released? If it's delayed, could you please let me know the learning rate used in the end-to-end training? I've tried to reimplement the fine-tuning using the model output logits, but while the train loss decreases, the evaluation loss keeps increasing.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions