Memory leak when running MCMC in parallel

Due to a known memory leak when instantiating subclasses of SymEngine (one of our upstream dependencies) `Symbol` objects (see https://github.com/symengine/symengine.py/issues/379), running ESPEI with parallelization will cause memory to grow in each worker.

Only running in parallel will trigger significant memory growth, because running in parallel uses the `pickle` library to serialize and deserialize symbol objects and create new objects that can't be freed. When running without parallelization (`mcmc.scheduler: null`), new symbols are not created.

Until https://github.com/symengine/symengine.py/issues/379 is fixed, some mitigation strategies to avoid running out of memory are:
- Run ESPEI without parallelization by setting `scheduler: null`
- (Under consideration to implement): when parallelization is active, use an option to restart the workers every `N` iterations.
- (Under consideration to implement): remove `Model` objects from the keyword arguments of ESPEI's likelihood functions. Model objects contribute a lot of symbol instances in the form of `v.SiteFraction` objects. We should be able to get away with only using `PhaseRecord` objects, but there are a few places `Model.constituents` to be able to infer the sublattice model and internal degrees of freedom that would need to be rewritten.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory leak when running MCMC in parallel #230

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Memory leak when running MCMC in parallel #230

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions