You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is the implementation of GPT2 paper in PyTorch. This is just the architecture.
It has 162262609 (162M) parameters (not training on my GPU so doesn't matter:) ).
About
This is the implementation of GPT2 architecture in PyTorch.