This folder contains all materials for the GeOceanographers team project in MLGEO-2026. Our project uses machine learning to predict ocean surface pCO2 from satellite and buoy observations across various NOAA coastal monitoring sites and open ocean vessel monitoring campaigns.
TeamProject/
├── README.md ← You are here
├── GeOceanographers Intro Presentation.pptx ← Team introduction slides
└── GeOceanProject/ ← All project code, data, and documentation
├── README.md ← Technical guide: setup, notebooks, and run order
├── requirements.txt
├── notebooks/ ← Analysis notebooks (exploration → prep → training)
├── data/ ← Raw, processed, and training data + workflow docs
├── plots/ ← Visualization outputs
└── docs/ ← Data source documentation
| What you need | Where to find it |
|---|---|
| Run the analysis or understand the code | GeOceanProject/README.md — setup, directory layout, notebook run order |
| Understand the data processing pipeline | GeOceanProject/data/DATA_ANALYSIS_WORKFLOW.md — step-by-step walkthrough with diagrams |
| Learn about the NOAA buoy dataset | GeOceanProject/docs/NOAA_BUOY_DATA_README.md — data source details and methodology |
| Regenerate data files from scratch | GeOceanProject/data/DATA_REGENERATION_GUIDE.md |
- Mary, Colin, Ellie, Arya
Part of the GeOceanographers repository for MLGEO-2026.