S4 #281

skinnider · 2026-01-06T19:17:11Z

Fresh start trying to merge #275

…y Stopping

…le_library and use code from s4dd git repo directly

I was just testing something.

seungchan-an

S4 integration looks solid, unused models are cleanly commented out, and the loss refactor and NaN early stopping are reasonable. S4 tests are in place and CI is green.
looks good to me.

skinnider · 2026-01-09T00:50:47Z

@GuptaVishu2002 I was going to launch a few small runs of RNN vs. Transformer vs. S4 to check that the changes are non-breaking, but I realized I can't actually set the use of Transformer or S4 models from the config.yaml - and src/clm/commands/sample_molecules_RNN.py does not actually import either of these classes.

Could you take a look at integrating config.yaml -> Snakemake -> sample_molecules_RNN.py so that the user (here, me) can specify Transformer or S4 in the config and run the whole pipeline with one of these models?

We might need to add new parameters to the config, e.g., a "model_type" parameter might be worth considering (since currently the model_params all relate to RNNs):

# Parameters that define the neural network model and training process.
model_params:
  # Type of Recurrent Neural Network (RNN) to use.
  # Available options are 'LSTM' and 'GRU'
  rnn_type: LSTM
  embedding_size: 128 # Size of the embedding vectors that represent each token in the input sequence.
  hidden_size: 1024 # Size of the hidden state of the RNN.
  n_layers: 3 # Number of stacked RNN layers in the model.
  dropout: 0 # Dropout rate applied to the RNN layer for regularization.
  batch_size: 64 # Number of samples processed before the models internal parameters are updated.
  learning_rate: 0.001 # Used by the optimizer to update model parameters.
  max_epochs: 999999 # Maximum number of training epochs (complete passes through the training dataset).
  patience: 50000 # Number of steps with no improvement in the validation loss after which early stopping is triggered.

  # An RNN model conditioned on input descriptors (experimentally obtained properties of the input SMILES).
  # Note that rnn_type and other RNN architecture parameters are still applicable in this case.
  conditional:
    # Is the conditional model enabled?
    enabled: false

    # Note: Both emb and emb_l below cannot be true at the same time.
    # Concatenate the descriptors directly to the token embeddings at each step in the sequence?
    emb: false
    # Concatenate the descriptors to the token embeddings, but by first passing them through a
    # linear layer to obtain embeddings of dimensionality equal to that of the token embeddings?
    emb_l: true

    # Note: Both dec and dec_l below cannot be true at the same time.
    # Concatenate the descriptors directly to the output of the RNN layers
    # (prior to the decoder layer)?
    dec: false
    # Concatenate the descriptors to the output of the RNN layers
    # (prior to the decoder layer), but by first passing them through a
    # linear layer to obtain embeddings of dimensionality equal to that of the token embeddings?
    dec_l: true

    # Instantiate the hidden states based on learned transformations of the descriptors
    # (with a single linear layer), as in Kotsias et al?
    h: false

see also issue #283

skinnider · 2026-01-21T13:48:02Z

Update on this: tested the S4 implementation with the NPS training set. The model is not outperforming the LSTM by any means but seems to be doing reasonably well, ruling out any major issues in the implementation. The Transformer on the other hand is failing immediately at the train_models step - Vishu will look into this.

…r does not maintain recurrent state. Also add torch.cuda.empty_cache() and torch.no_grad() for sampling to GPU memory management

Vishu Gupta and others added 14 commits December 13, 2025 14:39

add code to debug Transformer, add S4 archietcture and NaN based Earl…

f7b3dfa

…y Stopping

temprorily removed code for H3, H3Conv, Hyena

0a42001

run pre-commit fixing

8272469

resolve some flake8 linting errors

5a2e700

resolve some flake8 linting errors

7716a7f

resolve some flake8 linting errors

8a49bf8

resolve some flake8 linting errors

4b43608

resolve some flake8 linting errors

10177e1

add setup.sh for environment installation after removing src/clm/modu…

837a0e4

…le_library and use code from s4dd git repo directly

remove unnecessary imports and variables

c9c73e1

remove yaml files

d484bc8

temp fix until s4 pr

d87e8ce

Merge remote-tracking branch 'origin/master' into s4

2a8d2ec

reformat with black

d61b97f

skinnider mentioned this pull request Jan 6, 2026

add code to debug Transformer, add S4 archietcture and NaN based Early Stopping #275

Open

anushka255 previously approved these changes Jan 6, 2026

View reviewed changes

anushka255 requested review from anushka255 and removed request for anushka255 January 6, 2026 20:14

Michael A. Skinnider and others added 2 commits January 6, 2026 15:22

remove setup.sh

ac37db3

add a test for S4

2621505

skinnider requested a review from seungchan-an January 7, 2026 16:06

seungchan-an approved these changes Jan 7, 2026

View reviewed changes

skinnider mentioned this pull request Jan 7, 2026

Stop with error when loss is NaN? #284

Open

skinnider added 4 commits January 17, 2026 18:55

try explicitly specifying s4dd

4109285

revert changes to environment.yml, change pyproject.toml instead

a7c8c58

fix loss calculation for S4 and Transformer

dfc8784

add a little more runtime to sample from S4

9dc77bb

Vishu Gupta added 2 commits January 21, 2026 14:24

Remove the reset_state() call from Transformer.sample() as Transforme…

9f62a29

…r does not maintain recurrent state. Also add torch.cuda.empty_cache() and torch.no_grad() for sampling to GPU memory management

Trim Trailing Whitespace

a83b67b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S4 #281

S4 #281

Uh oh!

skinnider commented Jan 6, 2026

Uh oh!

seungchan-an left a comment

Uh oh!

skinnider commented Jan 9, 2026 •

edited

Loading

Uh oh!

skinnider commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

S4 #281

Are you sure you want to change the base?

S4 #281

Uh oh!

Conversation

skinnider commented Jan 6, 2026

Uh oh!

seungchan-an left a comment

Choose a reason for hiding this comment

Uh oh!

skinnider commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skinnider commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

skinnider commented Jan 9, 2026 •

edited

Loading