Skip to content
This repository was archived by the owner on Oct 26, 2022. It is now read-only.
This repository was archived by the owner on Oct 26, 2022. It is now read-only.

Instructions for fine-tuning pre-trained models #139

@y3nk0

Description

@y3nk0

Is there any chance that we can have a full description on how to fine-tune pre-trained models (for example in machine translation)? I've managed to continue training on a much smaller dataset (by using the pre-trained dictionary) but the results are disappointing. The model gets kind of "broken" (it mostly outputs "unk"). Am I missing any step? Should we update the bpecodes as well? Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions