🧠 Transformer Architecture from Scratch (PyTorch)

This project is a clean, educational implementation of the Transformer architecture (as introduced in Attention is All You Need), built entirely in PyTorch.

Architecture

The implementation follows the classic Transformer design:

Input Embedding + Positional Encoding
N × Encoder Layers
- Multi-head Self-Attention
- Feed-Forward Network
- Layer Normalization + Residuals
N × Decoder Layers
- Masked Multi-head Self-Attention
- Encoder-Decoder Attention
- Feed-Forward Network
Final Linear + Softmax Layer

🛠️ Tech Stack

Language: Python
Framework: PyTorch

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ReadMe.md		ReadMe.md
model.ipynb		model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Transformer Architecture from Scratch (PyTorch)

Architecture

🛠️ Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Transformer Architecture from Scratch (PyTorch)

Architecture

🛠️ Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages