Why this is needed:
For AL, we need to identify/quantify uncertainty in our current models. This uncertainty guides where we add new training data.
For a given model, or set of models (ensemble approach), we would like to:
- select structures within the database
- run an MLIP type calculation (usually a static calculation
Composition Exploration Workflow:
- Identify holes in the composition space within the database
- Create structures with those compositions
- Run MCMC (using an MLIP) to get a more realistic SRO within the system
- Run 1 ps long MD with NVT ensemble using MLIP at 1000K, 2000K, and 3000K
- Ignore first 1-2 structures, and sample every other structure for a total of 5 structures. (including the final structure)
Adversarial Attack Workflow:
- Calculate the force variance on every structure in the dataset using ensemble of models
- Select the 25-100 highest structures with high uncertainty
- compute adversarial attacks, running 25 steps or until variance is maximized
- save every 3-5 structures
Both of these workflows require loading in structures, and computing force variance on them. Probably best to batch this, so that we are not loading in 10,000s of structures at once.
Adversarial Attack hasn't been implemented yet, but I have the past implementation that I can transfer over. Clustering and composition analysis is implemented, we have MD (forge/forge/workflows/md.py), but combining both hasn't been done yet.
Why this is needed:
For AL, we need to identify/quantify uncertainty in our current models. This uncertainty guides where we add new training data.
For a given model, or set of models (ensemble approach), we would like to:
Composition Exploration Workflow:
Adversarial Attack Workflow:
Both of these workflows require loading in structures, and computing force variance on them. Probably best to batch this, so that we are not loading in 10,000s of structures at once.
Adversarial Attack hasn't been implemented yet, but I have the past implementation that I can transfer over. Clustering and composition analysis is implemented, we have MD (forge/forge/workflows/md.py), but combining both hasn't been done yet.