feat: Custom backup and restoration by alexarefev · Pull Request #794 · Netcracker/KubeMarine

alexarefev · 2026-03-10T11:39:56Z

Description

Provide an ability to manage periodic ETCD backups
Provide an ability to restore ETCD from periodic backup

Solution

Change the backup procedure to create CronJob that will manage periodic backup

Test Cases

TestCase 1

Make sure the backup is working

Test Configuration:

Hardware: 4CPU/8GB
OS: Ubuntu 22.04
Inventory: AllInOne

Steps:

Run backup with the following options

Results:

Before	After
---	CronJob is running and working correctly

TestCase 2

Make sure the restoration is working

Test Configuration:

Hardware: 4CPU/8GB
OS: Ubuntu 22.04
Inventory: AllInOne

Steps:

Run restoration with the following options

Results:

Before	After
---	Restoration has been finished successfully

Checklist

I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
There is no breaking changes, or migration patch is provided
Integration CI passed
There is no merge conflicts

theboringstuff · 2026-03-16T05:44:54Z

documentation/Maintenance.md

 * make_descriptor
 * pack

+### Periodic ETCD backups


So, to enable periodic backups, we need to run separate backup procedure. Problems here is that backup procedure goal originally (as I understand) is to actually backup data, not to install backup job which will perform backups in future. It seems strange to use backup to perform install. If we need clusters to be installed with disabled etcd fsync, and we recommend that backup job is enabled when fsync is disabled, it would be safer if we install backup job right in install procedure (where we disable fsync).

The provisioner/storageclass need not to be present immediately, it could be installed later, and once installed it can provision the volume for the job

theboringstuff · 2026-03-16T05:58:38Z

documentation/Maintenance.md

+restore_plan:
+  etcd:
+    image: registry.k8s.io/etcd:3.6.6-0
+    snapshot: /opt/local-path-provisioner/pvc-e3b0d6c5-495d-4887-90d9-000d6b3d4d00_kube-system_etcd-backup/etcd-snapshot-20260220_103000.db


What if some other provisioner is used, e.g. some network-attached like NFS. In this case there will be no directory on the host by default, since volume is not mounted. Do we expect users to mount the volume on their own and find the mount point?

Maybe we could use some additional "backup download" job, which we will run only during restore, which will mount backup volume and copy latest backup to some well-known node direcotry. Then kubemarine will take backup from this well-known location on this particular host

theboringstuff · 2026-03-16T06:00:30Z

kubemarine/procedures/restore.py

+    if cluster.procedure_inventory.get('restore_plan', {}).get('etcd', {}).get('snapshot', {}):
+        cluster.log.debug('The particular snapshot will be used')
+        path_to_snap = cluster.procedure_inventory.get('restore_plan', {}).get('etcd', {}).get('snapshot', {})
+        first_control_plane = cluster.nodes['control-plane'].get_first_member()


We probably should not assume that backup will be present on first master. E.g. local path provisioner could create volume on another node. Maybe using "download backup" job (and checking on which node it run) would be better

alexarefev added 13 commits March 10, 2026 14:23

backup

8f98e65

backup/restore

f10f8ba

feat: excluded tasks

f3b1d2e

feature: comment

3c2da80

feature: rework

47744d8

feature: enabling/disabling

c9afdf9

feature: backup docs

246df5d

feature: docs

b5544e9

feat: docs, last snapshot

18b3497

fix

3c016e9

fix

9bc6caf

fix: ident

02000d5

feat: docs

1c8552d

alexarefev added improvement New feature or request python Pull requests that update Python code labels Mar 13, 2026

alexarefev assigned DmitriiRabenok Mar 13, 2026

alexarefev added 4 commits March 13, 2026 13:37

comments; fix

a15fd57

empty string

7ab47d4

comment

8981b8a

docs

d869831

alexarefev requested review from disa1217, nikhil1697, pranavcracker and theboringstuff March 14, 2026 07:13

alexarefev marked this pull request as ready for review March 14, 2026 07:15

theboringstuff requested changes Mar 16, 2026

View reviewed changes

alexarefev added feature and removed improvement New feature or request labels Mar 16, 2026

alexarefev marked this pull request as draft March 16, 2026 13:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Custom backup and restoration#794

feat: Custom backup and restoration#794
alexarefev wants to merge 17 commits intomainfrom
feature/custom_restoration

alexarefev commented Mar 10, 2026 •

edited

Loading

Uh oh!

theboringstuff Mar 16, 2026

Uh oh!

theboringstuff Mar 16, 2026

Uh oh!

theboringstuff Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alexarefev commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Solution

Test Cases

Checklist

Uh oh!

theboringstuff Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

theboringstuff Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

theboringstuff Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alexarefev commented Mar 10, 2026 •

edited

Loading