NGWPC PI-7 PR by dylanlee · Pull Request #1 · NOAA-OWP/auto-eval-coordinator

dylanlee · 2025-09-30T15:13:14Z

This PR represents the initial delivery of the code associated with NGWPC's auto-eval-coordinator repository to OWP. The repository contains code for a data pipeline that works together with Hashicorp Nomad and another repo called auto-eval-jobs to perform FIM evaluations.

merge main into data-handler branch

…with a pipeline-dev service

This updated pipeline is still only capable of using mock data but brings the pipeline uptodate with the new innudate job

This seems to be working but should undergo a bit more testing

…flowfiles

new pipeline.py working to the point where it can take in updated mock data that points to the results of a stac query for the same huc that we have mock HAND data for.

refactored pipeline.py that retains functionality of being able to work with mock data

added the final job to pipeline and also updated the job definitions to use the full suite of current config variables currently in the autoeval-jobs .env file

job status now being updated and correct nomad job id being written

code in pipeline.py was overwriting job monitor status updates

Created a new branch for refactoring efforts. The commit in feature/logging right before this commit was able to execute a successful pipeline run with logging

Previously just the base class was in that file but decided it made more sense to put the child classes there as well

Added garbage collection settings to the server block. These settings increase the frequency that old dispatched jobs and evaluations are cleaned up and should substitute for nomad_memory_monitor.sh for the PW deployment of autoeval.

Continued building out docs in preparation for pi7 repo delivery

Added job_sizing_guide.md Finished a draft of interpreting-reports.md Finished a draft of batch-run-guide-ParallelWorks.md

…ve into pw-demo

Merge the main-for-pr branch with PI-7 deliverables into OWP's version of this repos main branch with OWP specific files

DJackson2313 · 2025-09-30T21:16:26Z

SWCM witness approval; release concurrence.

Deleted the doc directory because docs already existed before merging the repo with OWP. Edited the local-nomad/REAMDE.md so that the load job command has a dash in auto-eval-coordinator in the --network argument. This was necessary because the OWP repo added a dash between autoeval and docker compose appends the repo/directory name to the network that is created by the repos docker-compose-local.yml

Delete "doc" directory and edit local-nomad/README.md

The argument name batch_root makes more sense as the argument for the batch root directory for this script

Rename output_root argument in tools/make_master_metrics.py to batch_root

…stead of conditionals for each.

Get rid of conditionals using fsspec path normalization in two more places

Simplify fsspec filehandling futher

Simplify local file / S3 referencing to use fsspec.core.url_to_fs.

* Add tests verifying metrics aggregation deduplication Added unit tests and test data to verify that the current MetricsAggregator class successfully deduplicates metrics for a unique "collection_id", "stac_item_id", "scenario" combination only in the case where an exact or near duplicate is present. In the case that a set of index columns has different metrics an error is raised indicating a violation of idempotency due to code or data changes. Also added a test docker compose file for the unit tests and updated the README with instructions on running the tests. Since the tests passed I went ahead and deleted the clean_agg_metrics.py script from tools since its primary job was getting rid of near duplicates. * Update README.md

* Add aoi_stac_item_id and aoi_geom_path arguments The goal of this commit is to refactor the pipeline, submit_stac_batch.py, and the pipeline job definitions to not need an AOI gpkg when --aoi_stac_item_id is provided along with a valid STAC item ID string. Previously --aoi_is_item was a logical flag that extracted the STAC item ID from the AOI gpkg name. When this flag was on then the STAC item was queried by item ID instead of geometry. --aoi_is_item was changed to --aoi_stac_item_id and was modified to accept a string and that string will be used to query STAC items by item ID and then extract an AOI within the pipeline itself from the benchmark STAC when aoi_stac_item_id is being used. Pipeline code was changed to extract the geometry from the STAC item provided in the --aoi_stac_item_id argument. The geometry will be held in GeoDataFrame and won't be persisted to disk. The --aoi argument will become optional and was changed to --aoi_geom_path. So instead of requiring --aoi we now require one of --aoi_geom_path or --aoi_stac_item_id. When a --aoi_stac_item_id string is provided then that string will be used when writing the pipeline outputs instead of pulling the aoi name from the 'aoi_name' tag provided by the user. submit_stac_batch.py was also changed. The big change to this script was that we won't be extracting geometries from the benchmark STAC in this script. Nomad job definitions were changed to remove the aoi meta and add optional --aoi_stac_item_id and --aoi_geom_path as optional meta parameters. Conditional logic was added using a templated wrapper script defined in the job definition around the command that invokes the pipeline in the pipeline Nomad job depending on which meta parameter the user provides. * Fix conditional aoi argument passing in Nomad job definitions The previous commit's changes to the nomad job definitions broke the pipeline job. This commit's changes to the nomad job definitions successfully allow for conditional dispatch of the coordinator task depending on which aoi argument has been submitted to the parameterized job. I also fixed a small indentation bug in data_service.py and reformatted according to the repos agreed upon line length conventions * Update README Updated README.md in repo root to reflect the fact that for the test pipeline to fully work it still currently needs access to the fimc-data bucket so that the agreement job can access masks. * Remove initiating pipeline args into environment Removed the NOMAD_META variables related to the pipeline job from the env stanza. They aren't necessary for the task to call the pipeline's main.py * Remove conditional argument handling from pipeline job def Removed the complexity of the inline bash script from pipeline job definition. Now all optional meta parameters that are related to the call to main.py are fed into the coordinator task. When a parameter isn't provided main.py is fed an empty string for that argument and resolves the argument to none during pipeline initializiation * Add aoi_name tag derivation from aoi arguments Modified the way the aoi_name tag is handled. The pipeline first looks to see if it has been provided by a user and if it has that aoi_name tag gets precedence. If it hasn't been provided then the aoi_name tag is derived from aoi_stac_item_id or aoi_geom_path (whichever was provided) * Update comment for tags meta parameter

…ents drawio diagram now has a tab with a version of the diagram dated Jan 1st, 2026 that shows a schematic for how the existing pipeline will be updated to be able to perform depth evaluations

Drawio diagram

Re-introduced edit showing that user can submit either an aoi or a stac-item-id into the diagram

dylanlee and others added 30 commits May 3, 2025 14:37

remove async draft script

8d632e2

Merge pull request #3 from NGWPC/main

9a3cb70

merge main into data-handler branch

added requirements.txt, added docker file, added docker compose file …

3733d32

…with a pipeline-dev service

got rid of readline for more robust event stream connection

fb5b4fd

Refactor paths in config

06a0b8b

edit test job defs to use v0.1 version of container

f2408ad

edited example config yaml

2e8035d

draft code to combine flowfile csv's

96d6817

add draft of HAND index query function

d6ae8c2

raise exception if less than two columns

5fce287

updated pipeline to work with new inundate job format

607f4fc

This updated pipeline is still only capable of using mock data but brings the pipeline uptodate with the new innudate job

pipeline now working with most recent version of mosaic

b4aac8f

prototype of hand index querying

e8f8c3a

This seems to be working but should undergo a bit more testing

Merge branch 'feature/query-hand-index-parquet' into feature/combine-…

2637127

…flowfiles

multi-scenario processing working with mock STAC query data

f622926

new pipeline.py working to the point where it can take in updated mock data that points to the results of a stac query for the same huc that we have mock HAND data for.

simplified pipeline.py

1677e1d

refactored pipeline.py that retains functionality of being able to work with mock data

working with actual stac api

59dd71e

draft mosaicking benchmark extents

0abd0db

add agreement maker and update config

5df2ca1

added the final job to pipeline and also updated the job definitions to use the full suite of current config variables currently in the autoeval-jobs .env file

working pipeline that writes agreement map and metrics

0f107d9

update to load HUC polygons directly from WBD_National

c346cde

add sqlite logging database

4c43865

replace old usage of smart-open with fsspec

116efcb

change log db table format

b90e5e6

fix issue with job status table

0c3aa6f

job status now being updated and correct nomad job id being written

fix issue with job status updates in log db

c46c314

code in pipeline.py was overwriting job monitor status updates

move pipeline id out of job parameters and defs

4950b60

fix bug in indexing results in mosaic and agreement stages

c3c41ce

branch off feature/logging

fa2c275

Created a new branch for refactoring efforts. The commit in feature/logging right before this commit was able to execute a successful pipeline run with logging

moved all stages into pipeline_stages.py

01c597f

Previously just the base class was in that file but decided it made more sense to put the child classes there as well

dylanlee added 9 commits September 16, 2025 16:40

Build out docs directory

6d07239

Continued building out docs in preparation for pi7 repo delivery

Add additional documentation and refine existing docs

071c50c

Added job_sizing_guide.md Finished a draft of interpreting-reports.md Finished a draft of batch-run-guide-ParallelWorks.md

Revise README.md and tweak job_sizing_guide.md

b89918a

Merge branch 'pw-demo' of github.com:NGWPC/autoeval-coordinator-archi…

dd4ca2c

…ve into pw-demo

Fix aoi directory naming bug

77ccac7

Fix testdata for new hand-index format and update README instructions

65e6176

Merge remote-tracking branch 'origin/pi-7-deliverables'

ca3d961

Merge pull request #1 from NGWPC/main-for-pr

e64d7d4

Merge the main-for-pr branch with PI-7 deliverables into OWP's version of this repos main branch with OWP specific files

dylanlee marked this pull request as draft September 30, 2025 15:13

dylanlee marked this pull request as ready for review September 30, 2025 19:17

CarsonPruitt-NOAA added this to NGWPC Oct 6, 2025

dylanlee and others added 17 commits October 15, 2025 12:15

Merge pull request #2 from NGWPC/minor-tweaks

f520909

Delete "doc" directory and edit local-nomad/README.md

Rename output_root in tools/make_master_metrics.py to batch_root

6b7f573

The argument name batch_root makes more sense as the argument for the batch root directory for this script

Merge pull request #5 from NGWPC/edit-tools

44e3bb5

Rename output_root argument in tools/make_master_metrics.py to batch_root

Add draw.io diagram (1st attempt)

d8df628

Update Eval-Pipeline-Workflow.drawio.svg

2fbf35a

Simplify local file / S3 referencing to use fsspec.core.url_to_fs, in…

67e13c7

…stead of conditionals for each.

Merge branch 'main' into adjust_fsspec

735a1f9

Simplify fsspec filehandling futher

bcd7c83

Get rid of conditionals using fsspec path normalization in two more places

Add clarifying comments

f1c3326

Merge pull request #1 from dylanlee/adjust_fsspec

a37142a

Simplify fsspec filehandling futher

Merge pull request #8 from robgpita/adjust_fsspec

8c91b30

Simplify local file / S3 referencing to use fsspec.core.url_to_fs.

Modify diagram to include a tab depicting workflow for depths and ext…

1c249ca

…ents drawio diagram now has a tab with a version of the diagram dated Jan 1st, 2026 that shows a schematic for how the existing pipeline will be updated to be able to perform depth evaluations

Merge pull request #10 from NGWPC/drawio-diagram

44f72f6

Drawio diagram

Update drawio diagram (#11)

a1dc505

Re-introduced edit showing that user can submit either an aoi or a stac-item-id into the diagram

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NGWPC PI-7 PR#1

NGWPC PI-7 PR#1
dylanlee wants to merge 279 commits intoNOAA-OWP:mainfrom
NGWPC:main

dylanlee commented Sep 30, 2025

Uh oh!

DJackson2313 commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

dylanlee commented Sep 30, 2025

Uh oh!

DJackson2313 commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants