-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
needs-triageNew issue that hasn't been reviewed/prioritized yetNew issue that hasn't been reviewed/prioritized yettaskGeneral work item (implementation, setup, cleanup) – most common labelGeneral work item (implementation, setup, cleanup) – most common label
Milestone
Description
Description
Build the initial Appliance-side Q-Learning agent and a dummy network environment so full episodes can be run locally on the Pi (before real telemetry integration).
Why: Validates agent composition with shared core and allows early loop testing.
Type
- Feature
Focus Area (pick one)
- Appliance Core (Pi edge)
Priority
- Critical
Acceptance Criteria
-
DummyNetworkEnvwith discrete state/action space and basic reward function -
ApplianceQAgentcomposes shared TabularQ + replay buffer + config - Runs 5000+ steps/episodes end-to-end without crashing
- Logs episode stats (total reward, length, avg reward)
- Uses only shared types/config
- Simple test script shows reward trend improvement over episodes
Blocker / Dependencies
- Phase 1 shared core issues
Notes / Links
- Target files: appliance/src/learning/q_learning/agent.py and dummy_env.py
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
needs-triageNew issue that hasn't been reviewed/prioritized yetNew issue that hasn't been reviewed/prioritized yettaskGeneral work item (implementation, setup, cleanup) – most common labelGeneral work item (implementation, setup, cleanup) – most common label
Projects
Status
Manual QA Testing