Skip to content

Fix distributed HydrostaticFreeSurfaceModel test that seem to break with memory error#4666

Merged
simone-silvestri merged 8 commits intomainfrom
ncc/distributed-hydrostaticmodel-tests
Jul 28, 2025
Merged

Fix distributed HydrostaticFreeSurfaceModel test that seem to break with memory error#4666
simone-silvestri merged 8 commits intomainfrom
ncc/distributed-hydrostaticmodel-tests

Conversation

@navidcy
Copy link
Copy Markdown
Member

@navidcy navidcy commented Jul 23, 2025

@navidcy navidcy requested a review from simone-silvestri July 23, 2025 23:32
@navidcy navidcy added testing 🧪 Tests get priority in case of emergency evacuation distributed 🕸️ Our plan for total cluster domination labels Jul 23, 2025
@navidcy
Copy link
Copy Markdown
Member Author

navidcy commented Jul 23, 2025

Hm... this tho passed: https://buildkite.com/clima/oceananigans-distributed/builds/9100#0198392b-7d42-48c3-994f-3695c482b433

So perhaps the errors are random/stochastic...

@simone-silvestri
Copy link
Copy Markdown
Collaborator

Yeah, I think it is kind of random, it might be connected to the timeout of the yaml. Let me try changing it.

@navidcy navidcy requested a review from glwagner July 25, 2025 21:15
@navidcy
Copy link
Copy Markdown
Member Author

navidcy commented Jul 27, 2025

It's happening more and more; eg #4674, #4672. Doesn't seem so "random"?

@navidcy
Copy link
Copy Markdown
Member Author

navidcy commented Jul 27, 2025

I increased the memory request a bit. They passed the first go. Shall we merge? Does it worth more investigating?

@simone-silvestri
Copy link
Copy Markdown
Collaborator

Let's merge this then see if the error continues appearing

@simone-silvestri simone-silvestri merged commit c299390 into main Jul 28, 2025
69 checks passed
@simone-silvestri simone-silvestri deleted the ncc/distributed-hydrostaticmodel-tests branch July 28, 2025 06:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

distributed 🕸️ Our plan for total cluster domination testing 🧪 Tests get priority in case of emergency evacuation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants