Skip to content

Conversation

@josepselga
Copy link
Contributor

@josepselga josepselga commented Nov 25, 2025

This feature aims to make the OpenNebula FabricManager Appliance stateful concerning GPU partitioning. Upon initialization or recovery, the FabricManager must automatically detect and re-apply the last successfully configured set of GPUs partitions across the NVSwitches.
This ensures workload continuity and eliminates the need for manual intervention to restore the correct topology after a power cycle or crash.

…er last configured partitions (Stateful appliance)

Signed-off-by: josepselga <jselga@opennebula.io>
…er last configured partitions (Stateful appliance) - Hard recovery Mode

Signed-off-by: josepselga <jselga@opennebula.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

OpenNebula FabricManager - Automatically recover last configured partitions (Stateful appliance)

2 participants