Skip to content

backup: PerNodeProcessorProgress stalls backup completion #151945

@jeffswenson

Description

@jeffswenson

I was inspecting a backup job that was stuck at 100% completion and found the following stacks in backup:

Image

This suggests backup is waiting for PerNodeProcessorProgress to complete. The backup processor pull from a channel that is sized with the capacity of numTotalSpans.

On a large multi-region cluster, it was observed the channel can take up to five hours to completely drain (360k spans, 50ms per update).

Jira issue: CRDB-53550

Metadata

Metadata

Assignees

Labels

A-disaster-recoveryC-bugCode not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.O-supportWould prevent or help troubleshoot a customer escalation - bugs, missing observability/tooling, docsT-disaster-recoverybranch-release-23.2Used to mark GA and release blockers, technical advisories, and bugs for 23.2

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions