-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Open
Labels
Getting StartedUse this label to identify setup, installation, or onboarding issues.Use this label to identify setup, installation, or onboarding issues.NemoClaw CLIUse this label to identify issues with the NemoClaw command-line interface (CLI).Use this label to identify issues with the NemoClaw command-line interface (CLI).enhancement: featureUse this label to identify requests for new capabilities in NemoClaw.Use this label to identify requests for new capabilities in NemoClaw.priority: highImportant issue that should be resolved in the next releaseImportant issue that should be resolved in the next release
Description
Description
After a host reboot, users must manually run multiple commands to reconnect to their sandbox: start Docker, restart the OpenShell gateway, check sandbox state, then connect. This is documented in the troubleshooting guide but could be a single command.
A nemoclaw reconnect command would automate the recovery sequence:
- Verify the container runtime is running
- Start the OpenShell gateway if it's not running (
openshell gateway start --name nemoclaw) - Wait for the gateway to become healthy
- Check sandbox state (
openshell sandbox list) - Start port forward and connect to the default (or named) sandbox
Context
- Issue Please add restart commands to documentation #469 requested documentation for reconnecting after a reboot — addressed by the docs PR
- PR docs(troubleshooting): explain reconnect flow after host reboot #904 (by @WuKongAI-CMU) also attempted to document this but was closed due to PR limit
- The building blocks already exist in the codebase:
sandboxConnect()inbin/nemoclaw.jshandles port forwarding and connectionbin/lib/onboard.jshas gateway start, health check, and CoreDNS fix logicregistry.listSandboxes()provides the default sandbox name
Open questions
- Does
openshell gateway start --name nemoclawwork correctly when the gateway container exists but is stopped? Or does it require a destroy/recreate cycle? - Is the CoreDNS fix needed again after a reboot, or only on first setup?
- Should this be a new
nemoclaw reconnectcommand, or shouldnemoclaw <name> connectbe smarter about detecting and recovering from a stopped gateway?
Suggested implementation
async function reconnect(sandboxName) {
// 1. Check gateway
const gwInfo = runCaptureOpenshell(["gateway", "info", "-g", "nemoclaw"], { ignoreError: true });
if (!gwInfo || !gwInfo.includes("Running")) {
console.log(" Starting OpenShell gateway...");
runOpenshell(["gateway", "start", "--name", "nemoclaw"]);
// wait for health
}
// 2. Find sandbox
const name = sandboxName || registry.listSandboxes().defaultSandbox;
if (!name) {
console.error(" No sandbox registered. Run `nemoclaw onboard` first.");
process.exit(1);
}
// 3. Connect
sandboxConnect(name);
}Related
- Please add restart commands to documentation #469 — original user request for reboot recovery docs
- PR docs(troubleshooting): explain reconnect flow after host reboot #904 — previous docs-only attempt (closed, PR limit)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
Getting StartedUse this label to identify setup, installation, or onboarding issues.Use this label to identify setup, installation, or onboarding issues.NemoClaw CLIUse this label to identify issues with the NemoClaw command-line interface (CLI).Use this label to identify issues with the NemoClaw command-line interface (CLI).enhancement: featureUse this label to identify requests for new capabilities in NemoClaw.Use this label to identify requests for new capabilities in NemoClaw.priority: highImportant issue that should be resolved in the next releaseImportant issue that should be resolved in the next release