Summary
Currently, Studio provides visual runtime monitoring (Activity Map, Activity Stream, Cockpit overlay), but lacks automated detection and alerting for potential issues during long-running agent sessions.
Current State
✅ Already implemented:
- Real-time status visualization (thinking, tool_running, awaiting, error, idle)
- Per-agent memory usage tracking with color-coded warnings (1GB/2GB thresholds)
- Activity Stream logging all events across sessions
- Cockpit overlay with manual RESTART/STOP controls
❌ Not yet implemented:
- Automated alerts when agents stay in error state too long
- Detection of infinite loops or repeated failed operations
- Dangerous operation warnings (destructive commands, unexpected file deletions)
- Cost/token usage tracking per session
- Configurable thresholds and notification rules
Motivation
When managing multiple agents in long sessions, it's easy to miss one agent silently failing or looping. Automated alerts would catch these issues before they waste time or cause damage.
Raised via Reddit feedback from u/draconisx4.
Proposed Features (prioritized)
- Error state timeout alert — notify when agent is in error for > N minutes
- Loop detection — flag repeated identical tool calls or error patterns
- Dangerous operation warning — intercept
rm -rf, force push, DROP TABLE etc.
- Session cost tracking — token/API cost estimation per agent
- Custom alert rules — user-configurable thresholds and actions
Open Questions
- Should alerts be in-app notifications, system notifications, or both?
- Should dangerous operation detection block the action or just warn?
Summary
Currently, Studio provides visual runtime monitoring (Activity Map, Activity Stream, Cockpit overlay), but lacks automated detection and alerting for potential issues during long-running agent sessions.
Current State
✅ Already implemented:
❌ Not yet implemented:
Motivation
When managing multiple agents in long sessions, it's easy to miss one agent silently failing or looping. Automated alerts would catch these issues before they waste time or cause damage.
Raised via Reddit feedback from u/draconisx4.
Proposed Features (prioritized)
rm -rf, force push, DROP TABLE etc.Open Questions