-
Notifications
You must be signed in to change notification settings - Fork 63
Open
Description
Hi kvablack,
thanks for your great work! I'm using dlimp to read the OpenX dataset and train a model using huggingface Trainer for muti-GPU acceleration. In multi-GPU training, an issue involving serialization occurs when loading the dataset and starting the training. The following error is encountered:
train_dataset = build_datasets(
data_args,
)
# TEST: dump using pickle
import pickle
filehandler = open("outputs/dataset.obj","wb")
pickle.dump(train_dataset, filehandler)
# [error]
pickle.PicklingError: Can't pickle <class 'dlimp.dataset.DLataset'>: it's not the same object as dlimp.dataset.DLatasetI believe the issue lies in the serialization of the dlimp Dataset or tf Dataset. Could you provide any solution or suggestions?
Many Thanks! 🤗
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels