Skip to content

sync#1

Open
huangzhibo wants to merge 3890 commits intoBGI-flexlab:developfrom
broadinstitute:develop
Open

sync#1
huangzhibo wants to merge 3890 commits intoBGI-flexlab:developfrom
broadinstitute:develop

Conversation

@huangzhibo
Copy link
Member

No description provided.

jgainerdewar and others added 29 commits June 14, 2024 16:04
Co-authored-by: Tristan Garwood <tristan.garwood@gmail.com>
Co-authored-by: Adam Nichols <anichols@broadinstitute.org>
Co-authored-by: Janet Gainer-Dewar <jdewar@broadinstitute.org>
Co-authored-by: Adam Nichols <anichols@broadinstitute.org>
Co-authored-by: Tom Wiseman <twiseman@broadinstitute.org>
Co-authored-by: Adam Nichols <aednichols@gmail.com>
…#7457)

After playing a while with GCP Batch:
1. Batch can automatically retry preemption errors.
2. When Batch retries, there is no signal in the Job status events, we need to check the VM logs.
3. Cromwell does not get any details about Batch retries, hence, the same jobId is kept even if a VM is recreated.
4. When the job status events mention that the job failed due to a preemption error, this is final, Batch already exhausted the retries.

This removes all the code related to handling preemption errors and parses the job status events to derive the failure reason.

Also, this tries detecting the other potential exit codes mapping them to a better error message.

Refs:
- [Batch automated task retries](https://cloud.google.com/batch/docs/automate-task-retries)
- [Batch exit codes](https://cloud.google.com/batch/docs/troubleshooting#reserved-exit-codes)
Co-authored-by: SophiaPerzan-DG <sophia.perzan@deepgenomics.com>
jgainerdewar and others added 30 commits November 7, 2025 11:43
Co-authored-by: Adam Nichols <aednichols@gmail.com>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Co-authored-by: Liz Baldo <lizbaldo@Liz-Baldo-H5267Y2XDX.local>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.