-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
I am just trawling through the code you wrote @mafdmi (LOL, looking at the commits I wrote some of this and I don't even remember 😆 ) and I will start writing down some questions I have :) With answers to these questions I think we can make a plan for finishing off a version version of the MSGCPP dataset for mlcast-datasets
- Where was the source data downloaded from? The paths currently point to internal DMI paths (https://github.com/mlcast-community/mlcast-dataset-msgcpp/blob/main/msgcpp_to_zarr/source.py#L7). I think you @irenelivia downloaded the source netCDF files from somewhere, right?
- sorry, I realised I am being an idiot, you mentioned this on Adding the MSGCPP dataset mlcast-datasets#13 , the data can be accessed here: https://msgcpp.knmi.nl/data-access.html
- Rather than using
rechunkeras you do here https://github.com/mlcast-community/mlcast-dataset-msgcpp/blob/main/msgcpp_to_zarr/zarr_store.py#L47 @mafdmi, did you try simply removing theencodinginformation? (this is what I realised one can do, and I used it in https://github.com/dmidk/nwp-forecast-zarr-creator/blob/main/zarr_creator/write_zarr.py#L66)- this hasn't been tried yet, so we could try that and maybe that way avoid using rechunker (which sometimes requires quite a bit of memory)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels