PRIAM is an interactive tool for proxy-based climate reconstruction and analysis. It allows users to import palaeoecological proxy data—such as fossil pollen, sediments, or other biological indicators—and apply multiple statistical and machine learning methods to reconstruct past climate conditions. The app provides instant model training, validation, and visualization, offering insights into model performance, feature importance, and uncertainty. Users can explore reconstructed climate variables through dynamic plots, comparative analyses, and summary statistics, making it a comprehensive platform for both research and teaching in palaeoclimatology.
This project implements multiple methods for quantitative palaeoclimate reconstruction from fossil pollen data, including:
- Modern Analogue Technique (MAT)
- Boosted Regression Trees (BRT)
- Weighted Averaging Partial Least Squares (WA-PLS)
- Random Forest (RF)
It also provides a web-based interface using Streamlit.
git clone https://github.com/yourusername/PyLae.git
cd pollen-reconconda env create -f environment.yml
conda activate pylaeoThe main pipeline is in main.py.
It trains models on training climate + pollen data and predicts for fossil pollen samples.
python main.py \
--train_climate ./data/train/AMPD_cl_worldclim2.csv \
--train_pollen ./data/train/AMPD_po.csv \
--test_pollen ./data/test/scrubbed_SAR.csv \
--model RF \
--target TANN \
--output_csv ./out/predictions.csvArguments:
--train_climate: CSV with climate variables (targets).--train_pollen: CSV with modern pollen counts.--test_pollen: CSV with fossil pollen data.--model: Model choice (MAT,BRT,WA-PLS,RF).--target: Target variable to reconstruct (e.g.,TANN).--output_csv: Where to save predictions.
Once predictions are saved, you can plot them with:
python visuals/plot.py \
--predictions_csv ./out/predictions.csv \
--output_file ./out/predictions.png \
--title "Reconstructed TANN"For an interactive interface:
streamlit run app.pyThis will launch a web UI at http://localhost:8501.
Features:
- Upload training and fossil pollen CSVs.
- Choose model + target variable.
- Run predictions interactively.
- View results as a table and time-series plot.
- Download predictions as CSV.
├── app.py # Streamlit web app
├── main.py # CLI pipeline
├── models/ # Model classes (MAT, BRT, WA-PLS, RF)
├── utils/dataloader.py # Data loading + preprocessing
├── visuals/plot.py # Visualization script
├── data/ # Example datasets
├── out/ # Output predictions + plots
├── requirements.txt # Dependencies
└── README.md # This file
- Datasets are aligned automatically (non-overlapping taxa filled with zeros).
- Predictions are saved with column name
Predicted_<target>. - Tested on Linux (Python 3.10).
Dael Sassoon — Palaeoecologist and Marie Curie Research Fellow (GEO3BCN-CSIC, Barcelona). Specialising in tropical palaeoecology and quantitative reconstruction of past climates. Developer of PRIAM, integrating palaeoecological data with statistical and machine learning models for climate reconstruction.
Jordan Sassoon — Computer Scientist. Focused on applied AI, data systems, and scientific visualization. Lead developer of the PRIAM interface, integrating backend modelling pipelines with an interactive analytical framework.
If you use PRIAM in your research, please cite as:
Sassoon, D., & Sassoon, J. (2025). PRIAM: A tool for Palaeoclimatic Reconstruction through Interactive Analysis & Modelling. GitHub repository: https://github.com/jordisassoon/PRIAM