release: v1.2.0 - Agent resilience and crash recovery#18
Merged
williamzujkowski merged 1 commit intomainfrom Feb 2, 2026
Merged
release: v1.2.0 - Agent resilience and crash recovery#18williamzujkowski merged 1 commit intomainfrom
williamzujkowski merged 1 commit intomainfrom
Conversation
## Summary - Add agent process resilience phase with memory watchdog, cgroups limits, and session persistence - Increase default RAM to 16GB and swap to 8GB for Claude CLI memory leak protection - Add RESOURCES.md with parallel agent memory planning guide - Clean up vestigial files and fix documentation accuracy ## Changes ### Added - Claude memory watchdog systemd service (warns 8GB, kills 13GB) - `run-claude-limited` cgroups wrapper for hard memory limits - `agent-session` tmux wrapper with session persistence - Enhanced `vm-health-check` with memory trend prediction - `--memory` and `--vcpus` flags for setup_cloud.sh - RESOURCES.md comprehensive resource planning guide ### Fixed - README: swap size 4GB → 8GB (accuracy) - README: updated directory structure - Removed .mcp.json from git (local config) - Removed vestigial guest/vm-health-check.sh (embedded in bootstrap) ### Documentation - Updated CHANGELOG with all new features - Updated CLAUDE.md quick reference - Added agent resilience commands to README Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This release adds comprehensive agent process resilience features to prevent and recover from AI CLI crashes (particularly Claude Code memory leaks).
Key Features
Agent Resilience Phase (New Bootstrap Phase 8)
run-claude-limitedwrapper enforces hard limitsagent-sessiontmux wrappervm-health-checkwith trend analysisResource Planning
--memoryand--vcpusflags forsetup_cloud.shRESOURCES.mdwith parallel deployment guideCleanup
.mcp.jsonfrom git (local MCP config)guest/vm-health-check.sh(now embedded in bootstrap)New Commands (Inside VM)
Test Plan
make lintpasses🤖 Generated with Claude Code