Continued training of FlauBERT (with --reload_model) -- Question about vocab size

Hello. :)

I would like to use the "--reload_model" option with your train.py command to further train one of your pretrained FlauBERT models.

Upon trying to run train.py with the  "--reload_model" option I got an error message saying that there was a "size mismatch" between the pretrained FlauBERT model and the adapted model I was trying to train.

The error message referred to a "shape torch.Size([67542]) from checkpoint". This was for the flaubert_base_uncased model. I assume that the number 67542 is the vocabulary size of flaubert-base-uncased.

In order to use the "--reload_model" option with your pretrained FlauBERT models, do I need to ensure that the vocabulary of my training data is identical to that of the pretrained model? If so, do you think that I could manage that simply by concatenating the "vocab" file of the pretrained model with my training data?

Thank you in advance for your help!





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continued training of FlauBERT (with --reload_model) -- Question about vocab size #40

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Continued training of FlauBERT (with --reload_model) -- Question about vocab size #40

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions