🎵 LSTMiniNet: Custom LSTM Melody Generator in Modern C++

LSTMiniNet is a handcrafted LSTM-based melody generator built entirely in modern C++20, powered by a custom matrix library and a reverse-mode automatic differentiation engine. This project demonstrates building deep sequence models like LSTM from scratch, without any external ML/DL libraries.

📌 Features

🚀 Pure C++ LSTM architecture using manually defined computation graphs
🧠 Gradient backpropagation via a custom Reverse AutoDiff Engine
🧮 Matrix operations module with support for activation functions, softmax, and loss functions
🎹 Generates music note sequences using an LSTM trained on melody data
📉 Support for loss functions like MSE, BCE, CCE
🔄 Truncated BPTT and gradient clipping for stability

🧠 Architecture Overview

🧮 Reverse AutoDiff Engine

All operations (addition, multiplication, activation, etc.) are captured dynamically via shared_ptr<Node>
Each Node stores the gradient and defines a backward() lambda to propagate gradients to its parents

shared_ptr<Node> A, B;
shared_ptr<Node> C = A * B; // Gradient tracked automatically

🧩 Matrix Module

Core data structure for computations
Supports broadcasting, elementwise ops, transpose, activation derivatives
Integrated with autodiff via operator overloading

🔁 LSTM Cell Logic

Each LSTM cell performs 4 computations:

Forget Gate: Ft = sigmoid(Wf * xt + Uf * ht-1 + bf)
Input Gate: It = sigmoid(Wi * xt + Ui * ht-1 + bi)
Candidate Memory: Ct~ = tanh(Wc * xt + Uc * ht-1 + bc)
Cell State Update: Ct = Ft ⊙ Ct-1 + It ⊙ Ct~
Output Gate: Ot = sigmoid(Wo * xt + Uo * ht-1 + bo)
Hidden State: Ht = Ot ⊙ tanh(Ct)

📈 Training Details

Dataset: Melody note sequences (Melody.txt)
Input: sequence of notes → Output: next note prediction
Training: Sequence-to-sequence LSTM using CrossEntropyLoss
Optimization: Manual SGD + L2 regularization
Extras: Gradient clipping, weight decay, truncation step

model.Train(dataset, targets, epochs = 4, lr = 0.05, clip = 1.0, L2 = 0.05, truncate = 3);

🎼 Output Sample

Once trained, the model can generate sequences:

vector<int> seed = dataset.back();
Matrix nextMelody = model.predict(seed);

🖼️ Output

This shows a printed sequence of generated notes, predicted one-by-one, simulating a musical continuation.

🧮 GRU Gate Comparison (Not Used in Code)

While the project uses LSTM, here's how GRU differs:

Update Gate: Controls memory retention
Reset Gate: Controls forgetting of past hidden state
Fewer parameters than LSTM (no separate cell state)

GRU Equations:

zₜ = sigmoid(Wz * xₜ + Uz * hₜ₋₁)
rₜ = sigmoid(Wr * xₜ + Ur * hₜ₋₁)
ĥₜ = tanh(W * xₜ + U * (rₜ ⊙ hₜ₋₁))
hₜ = (1 - zₜ) ⊙ hₜ₋₁ + zₜ ⊙ ĥₜ

🖼️ GRU Architecture

🧠 LSTM Architecture

This shows the flow of data through input, forget, and output gates and how memory is updated and passed.

🚨 Known Issues

Memory leak in training loop

💬 Author Note

This project is part of a larger initiative to build deep learning models from scratch in C++, without relying on external frameworks. Stay tuned for more architectures like GRU, Transformer, and full sequence-to-sequence models!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Dataset		Dataset
GRU-RNN.png		GRU-RNN.png
LICENSE		LICENSE
LSTM-RNN.png		LSTM-RNN.png
LSTM_Arch.ixx		LSTM_Arch.ixx
Matrix.ixx		Matrix.ixx
Model.ixx		Model.ixx
Output.png		Output.png
README.md		README.md
RevAutoDiffEngine.ixx		RevAutoDiffEngine.ixx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎵 LSTMiniNet: Custom LSTM Melody Generator in Modern C++

📌 Features

🧠 Architecture Overview

🧮 Reverse AutoDiff Engine

🧩 Matrix Module

🔁 LSTM Cell Logic

📈 Training Details

🎼 Output Sample

🖼️ Output

🧮 GRU Gate Comparison (Not Used in Code)

🖼️ GRU Architecture

🧠 LSTM Architecture

🚨 Known Issues

💬 Author Note

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎵 LSTMiniNet: Custom LSTM Melody Generator in Modern C++

📌 Features

🧠 Architecture Overview

🧮 Reverse AutoDiff Engine

🧩 Matrix Module

🔁 LSTM Cell Logic

📈 Training Details

🎼 Output Sample

🖼️ Output

🧮 GRU Gate Comparison (Not Used in Code)

🖼️ GRU Architecture

🧠 LSTM Architecture

🚨 Known Issues

💬 Author Note

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages