Skip to content

New Networks#71

Merged
rahulkfernandes merged 24 commits intoloss-functionsfrom
rahul/loss-functions/more-models
Mar 2, 2026
Merged

New Networks#71
rahulkfernandes merged 24 commits intoloss-functionsfrom
rahul/loss-functions/more-models

Conversation

@rahulkfernandes
Copy link
Collaborator

@rahulkfernandes rahulkfernandes commented Mar 2, 2026

Description

This pull request: Adds more neural network architectures to the NN model Library, fixes model overfitting by increasing validation set length, hyperparameter tuning, and early stopping. Best performing model + loss as of this pull request is AttentionLSTM + custom_loss_10 (See src/training/loss_functions.py).

What was done

  • Length of Validation and Test splits increased to 2 years.
  • Hyperparameter tuning and early stopping to tackle overfitting.
  • Experimentation with, and implementation of more Neural Networks.

New Networks Added:

  • InvertedAttentionLSTM
  • LSTMTransformer
  • TemporalTransformer
  • TFT
  • DeformTime

What type is this change?

  • Bug fix
  • New feature

Checklist before requesting a review

  • [X ] My code follows the style guidelines of this project
  • I have tested on MAC OS system
  • I have tested on Ubuntu system
  • I have tested on the RIT research cluster and updated the spack packages for compiling
  • My changes generate no new warnings
  • If the PR is related to a new feature, I added the running scripts for the new feature

@rahulkfernandes rahulkfernandes merged commit 59644fe into loss-functions Mar 2, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant