Update gpt.py #51

apoorva5ingh · 2025-07-13T22:24:43Z

I created a mini GPT model from scratch using PyTorch, inspired by Karpathy’s educational examples. This project implements all core components of a transformer: multi-head self-attention, feedforward layers, embeddings, and layer normalization. The model is trained on character-level text data and can generate new sequences after training. It includes logic for evaluation, loss tracking, and saving/loading the model. The code is clean and modular, making it perfect for learning how GPT models work internally. This setup is great for experimenting with custom datasets or building lightweight LLMs for small-scale tasks and educational purposes.

B4xAbhishek · 2025-09-03T19:57:44Z

needs improvement

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update gpt.py #51

Update gpt.py #51

Uh oh!

apoorva5ingh commented Jul 13, 2025

Uh oh!

B4xAbhishek commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update gpt.py #51

Are you sure you want to change the base?

Update gpt.py #51

Uh oh!

Conversation

apoorva5ingh commented Jul 13, 2025

Uh oh!

B4xAbhishek commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants