feat: consolidate vnext workstream (supersedes #12) by digitarald · Pull Request #15 · microsoft/agentrc

digitarald · 2026-02-22T22:47:01Z

Explainer

This PR is the consolidated "vnext" integration from digitarald/primer into upstream main.

The goal is to land one coherent, reviewable change set instead of several fragmented PRs. It combines CLI work, VS Code extension parity updates, policy/readiness improvements, and repository workflow/docs hardening.

Why This Exists

Previous work was split across branches/PRs, which made end-to-end review difficult.
Several features (policy engine, readiness/reporting, extension command flows, docs/workflows) are interdependent.
A single integration PR gives maintainers one place to validate behavior, CI, and release impact.

What Changed (High Level)

CLI surface + command orchestration updates across src/commands/* and src/cli.ts.
Service-layer expansion/refactors in src/services/* (readiness, evaluator, analyzer, policy modules, Git/GitHub/Azure paths).
Extension-side improvements in vscode-extension/src/* for command UX and integration flow.
Test coverage additions/updates in src/services/__tests__/*.
Repo governance and automation updates in .github/* (CI/eval/release workflows, templates, prompts, agent files).
Docs and examples refresh (README.md, CONTRIBUTING.md, examples/*, PLAN.md).

Reviewer Guide

Start here for fastest signal:

.github/workflows/ci.yml and repo config changes
src/services/policy/* and src/services/readiness.ts
src/commands/* integration points
vscode-extension/src/commands/* + vscode-extension/src/services.ts
Tests under src/services/__tests__/*

Risk and Compatibility

Main risk is integration breadth, not a single risky algorithmic change.
Mitigated by broad service tests and extension/CLI type/build checks.
Expected behavior: additive improvements plus refactors; no intentional breaking CLI contract documented.

Validation

npm run lint
npm run format:check (tracked files)
npm run typecheck
npm run test
npm run build
cd vscode-extension && npx tsc --noEmit && node esbuild.mjs

Supersedes

Supersedes fix: resolve all ESLint errors — add node globals, disable base no-unused-vars, fix unused imports, auto-fix import order #12

…ing and CLI options

feat(eval): implement model listing functionality in eval command feat(copilot): create function to extract and list Copilot models from CLI help feat(evaluator): enhance trajectory viewer with phase filtering for tool calls feat(tui): add readiness report feature with detailed output and user interaction

…text - Added a systemMessage field to primer.eval.json to clarify response context. - Updated evalScaffold.ts to include instructions for incorporating the system message in generated cases. - Modified evaluator.ts to use a default system message when none is provided, ensuring responses are relevant to the repository.

…d enhance TUI - Removed the analyze command from the CLI as it is no longer needed. - Updated the Primer evaluation workflow to include timestamped output files and improved error handling. - Enhanced the TUI to manage multiple Copilot models for evaluation and judging, allowing users to cycle through available models. - Improved user prompts and messages throughout the TUI for better clarity and user experience. - Adjusted the structure of the evaluation results display and added readiness report functionality.

- Fix TypeScript syntax errors in tui.tsx and github.ts - Create visual report generator service with beautiful HTML output - Add --visual flag to readiness command for HTML reports - Implement batch-readiness command for multi-repo visual reports - Add BatchReadinessTui for interactive repository selection - Support both GitHub and Azure DevOps repository sources - Include summary cards, pillar performance, and level distribution charts - Update documentation with visual report examples Co-authored-by: pierceboggan <1091304+pierceboggan@users.noreply.github.com>

- Added support for selecting and running evaluations with a new eval-pick status. - Implemented batch processing options for GitHub and Azure DevOps. - Introduced a logging mechanism to track activity and status updates. - Enhanced user interface with spinner animations and improved status indicators. - Refactored code to check for eval configuration on mount and display relevant messages. - Updated command hints for better user guidance during interactions. - Removed unused readiness report functionality and related types.

…rt generation - Implement tests for `ensureDir` and `safeWriteFile` in `fs.test.ts`. - Create comprehensive tests for `runReadinessReport` in `readiness.test.ts`, covering various criteria and pillars. - Add tests for `generateVisualReport` in `visualReport.test.ts`, ensuring correct HTML output and content. - Update `generateCopilotInstructions` to use a new preferred model. - Enhance TUI to support model selection and generation options for copilot instructions and agents. - Introduce `tsup` configuration for building the project.

…eport Add visual AI readiness reports with batch processing

…ove error handling

… a prioritized fix list

- Added support for additional languages (C#, Java, Ruby, PHP) in the analyzeRepo function. - Updated detectPackageManager to recognize new package managers (Maven, Gradle, Bundler, Composer). - Improved handling of pnpm workspace files by skipping comment-only lines and supporting inline arrays. fix: validate Azure DevOps slugs and improve error handling - Introduced validateAdoSlug function to ensure organization, project, and repo names are valid. - Updated API calls in Azure DevOps service to use validated slugs. - Enhanced error handling in checkRepoHasInstructions to throw descriptive errors on request failures. refactor: streamline Copilot CLI path resolution - Cached Copilot CLI path to avoid redundant lookups. - Improved findCopilotCliPath to handle platform-specific paths more effectively. - Added glob pattern matching for VS Code extension paths. chore: update evalScaffold and evaluator services - Refactored generateEvalScaffold to use withCwd for better directory management. - Simplified runEval by using assertCopilotCliReady for CLI path resolution. - Removed redundant EvalCase and EvalConfig type definitions from evaluator. fix: sanitize error messages in git push - Added error handling in pushBranch to sanitize embedded credentials from error messages. feat: enhance instruction generation and PR body creation - Updated generateCopilotInstructions to use withCwd for improved directory handling. - Created utility functions for building PR bodies for configurations and instructions. - Updated BatchTui and BatchTuiAzure to use DEFAULT_MODEL for instruction generation. chore: add utility functions for file system operations - Introduced validateCachePath to prevent path traversal vulnerabilities. - Enhanced safeWriteFile to reject symlinks and ensure safe file writing.

…nalysis

…ve workspace detection - Added support for detecting non-JS monorepos (Cargo, Go, .NET, Gradle, Maven). - Updated RepoApp and RepoAnalysis types to include ecosystem and manifestPath. - Improved workspace type detection to accommodate additional ecosystems. - Refactored app resolution logic to handle non-JS monorepos when JS apps are insufficient. fix(azureDevops): encode memberId in accounts URL - Updated accounts URL construction to properly encode memberId. refactor(copilot): implement caching for CLI path resolution - Added caching mechanism for Copilot CLI path with a TTL of 5 minutes. - Improved path resolution logic to handle different platforms. refactor(evalScaffold): simplify progress callback type - Updated onProgress callback type to use a more concise parameter. refactor(evaluator): define CopilotClient and CopilotSession interfaces - Introduced interfaces for better type safety and clarity in Copilot session management. refactor(instructions): enhance prompt for generating Copilot instructions - Updated prompt to clarify analysis requirements and include additional tech stack files. refactor(readiness): streamline readiness checks with improved status handling - Refactored readiness criteria checks to return structured status and reason. fix(ui): handle errors during repo loading and processing - Added error handling for repo loading and processing in BatchTui and BatchTuiAzure components. chore(tui): improve error logging for repo analysis failures - Enhanced error logging to provide clearer feedback on repo analysis issues. docs(cwd): add warning about process.chdir() side effects - Updated documentation to clarify the implications of using process.chdir(). refactor(fs): export utility functions for file system operations - Made fileExists, safeReadDir, and readJson functions public for broader usage.

… tests refactor(analyzer): normalize package.json and Cargo.toml paths in workspace resolution refactor(copilot): normalize CLI path when found in Copilot CLI path resolution

…prove error reporting

…used-vars, fix unused imports, auto-fix import order

…ing, dead stubs, typo

…oncurrency control

TypeScript's compiler handles undeclared variable checks natively. Disabling ESLint's no-undef prevents false positives for Node.js globals (process, console, setTimeout, fetch, Buffer) per typescript-eslint recommendation.

…ns-command" This reverts commit 8db6aed, reversing changes made to 0384387.

Dogfood improvements: CLI UX, extension polish, safeWriteFile, docs split

…R service, and UX polish

fix(vscode): harden extension commands with overwrite flows, shared PR service, and UX polish

refactor: consolidate PR file handling

…t ordering - isAllowedSystemAlias() now returns true on win32 since lstat already confirmed no symlinks; realpath differences are just 8.3/case normalization - Fix import ordering in 7 files to satisfy import-x/order lint rule

- Replace blanket isAllowedSystemAlias win32 return with ancestor lstat walk to correctly distinguish 8.3 normalization from real symlinks - Fix policy loadPolicy catch to handle cross-realm errors (instanceof Error fails across vitest worker threads)

Rewrite Readiness with Signal Engine

Introduce a new plugin-based policy engine that runs alongside the existing readiness system in shadow mode for safe validation. New modules in src/services/policy/: - types.ts: Signal, Recommendation, Grade, PolicyContext interfaces - compiler.ts: compiles policy configs into executable plugin chains - loader.ts: loads and validates plugin chains from policy sources - engine.ts: executes plugins, computes scores and grades - adapter.ts: bridges engine reports to legacy ReadinessReport format - shadow.ts: shadow mode logging for comparison with legacy path Changes to existing code: - readiness.ts: adds optional 'shadow' mode and 'engine' field on ReadinessReport; refactors isConfigSourced tracking for clarity - utils/fs.ts: clarifies validateCachePath caller responsibilities - README.md: documents new policy/ directory and links plugin guide Includes comprehensive test coverage (7 new test files, all 440 tests pass) and plugin authoring documentation in docs/plugins.md. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: add plugin-based policy engine with shadow mode

…ker state

pierceboggan and others added 30 commits February 3, 2026 22:05

chore: vnext updates

640c7c6

Refactor code structure for improved readability and maintainability

05a98bf

Enhance Copilot integration and evaluation features with new scaffold…

4019d4e

…ing and CLI options

Initial plan

09c7cee

refactor: update key hints for clarity in TUI

2cda58d

feat: update trajectory viewer with new design and metrics display

d1db068

Merge pull request #10 from pierceboggan/claude/create-ai-readiness-r…

fb4d512

…eport Add visual AI readiness reports with batch processing

Enhance PrimerTui: Add eval case generation with Copilot SDK and impr…

e87dd2f

…ove error handling

feat: add multi-model code review prompt and synthesize findings into…

2c8843e

… a prioritized fix list

feat: enhance eval case generation prompts for deeper architectural a…

764d00c

…nalysis

refactor(tests): normalize cache root path usage in validateCachePath…

aff8ff2

… tests refactor(analyzer): normalize package.json and Cargo.toml paths in workspace resolution refactor(copilot): normalize CLI path when found in Copilot CLI path resolution

feat(eval): implement timeout handling for scaffold generation and im…

cd48e73

…prove error reporting

fix: resolve all ESLint errors — add node globals, disable base no-un…

e7d3286

…used-vars, fix unused imports, auto-fix import order

fix: remove dead code, fix import spacing and double blank lines

6f3bcfd

fix: address PR review — fix pkg name, version, PAT leak, error handl…

24b110d

…ing, dead stubs, typo

Add CI workflow

15f7676

ci: harden CI with parallel jobs, format check, matrix testing, and c…

acca7ce

…oncurrency control

ci: harden CI workflow on main for PR checks

b0854e4

ci: re-trigger PR checks

5d52700

update dependencies for eslint and vitest

c1af327

fix: disable no-undef for TypeScript files

954a95d

TypeScript's compiler handles undeclared variable checks natively. Disabling ESLint's no-undef prevents false positives for Node.js globals (process, console, setTimeout, fetch, Buffer) per typescript-eslint recommendation.

Merge readiness

a22a220

Harald Kirschner and others added 15 commits February 16, 2026 18:57

Revert "Merge pull request #17 from digitarald/consolidate-instructio…

e385dad

…ns-command" This reverts commit 8db6aed, reversing changes made to 0384387.

Merge pull request #25 from digitarald/dogfood-improvements

32ef05a

Dogfood improvements: CLI UX, extension polish, safeWriteFile, docs split

fix(vscode): harden extension commands with overwrite flows, shared P…

9e55d36

…R service, and UX polish

Merge pull request #26 from digitarald/vscode-ext-hardening

03c3446

fix(vscode): harden extension commands with overwrite flows, shared PR service, and UX polish

refactor: consolidate PR file handling

2a4c080

Merge pull request #27 from digitarald:digitarald/git-cleanup

74481cb

refactor: consolidate PR file handling

Adds Azure DevOps PR support to the VS Code extension

d4a090b

docs: update PR support details

63c0021

Enhance readiness report output options

b6dd1b7

Add win32 file restore error handling

e90dabe

Merge pull request #28 from digitarald/digitarald/bitter-swallow

5bba153

Rewrite Readiness with Signal Engine

Merge pull request #29 from digitarald/digitarald/policy-engine

067e675

feat: add plugin-based policy engine with shadow mode

digitarald mentioned this pull request Feb 22, 2026

fix: resolve all ESLint errors — add node globals, disable base no-unused-vars, fix unused imports, auto-fix import order #12

Closed

5 tasks

Harald Kirschner added 10 commits February 22, 2026 15:27

Build extension cross-platform on release and update deps

35033a3

Update repository links to pierceboggan

aa06210

fix(vscode-extension): resolve vsce packaging blockers

e44b4ec

chore(lint): fix visual report imports

701ca6d

feat: add maxWidth prop to banners

30645e6

fix(tui): fix Esc key handling, guard idle keys, remove dead modelPic…

55cc9d4

…ker state

chore: remove .vsix from repo, add to gitignore

ee42d27

refactor: improve instruction generation clarity

3443e41

chore: update version to 2.0.0

ad1a098

refactor: simplify shimSdkImportMeta setup

dea45f3

digitarald self-assigned this Feb 24, 2026

digitarald merged commit be2b2b6 into microsoft:main Feb 24, 2026
8 of 9 checks passed

This was referenced Feb 24, 2026

add support for Windows platform in Copilot CLI path resolution #1

Closed

feat: Add Windows support for Copilot CLI #7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: consolidate vnext workstream (supersedes #12)#15

feat: consolidate vnext workstream (supersedes #12)#15
digitarald merged 106 commits intomicrosoft:mainfrom
digitarald:primer/vnext-refresh

digitarald commented Feb 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

digitarald commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Explainer

Why This Exists

What Changed (High Level)

Reviewer Guide

Risk and Compatibility

Validation

Supersedes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

digitarald commented Feb 22, 2026 •

edited

Loading