This may be out of scope considering size of datasets and complexity, however, options: RNN, transformer, LSTM