Skip to content

pitimon/8-habit-ai-dev

Repository files navigation

8 Habits of Effective AI-Assisted Development

License: MIT Claude Code Plugin Skills EU AI Act Habits Version Wiki

📖 Full documentation: Wiki — deep-dive guides per step, FAQ, Troubleshooting, and the 8 Habits Reference.

"ทำเสร็จ ≠ ทำดี" — Shipping code is not the same as shipping good code.

AI coding tools are powerful — but "build me X" without requirements, review, or staging creates fast, fragile code. This plugin adds the discipline AI lacks: 17 skills across a 7-step workflow, grounded in Covey's 8 Habits of effective development.


Table of Contents

Get Started

The Framework

Deep Dives

Reference


The Problem

AI coding tools (Claude Code, Cursor, Copilot, Codex) are powerful — but they amplify whatever process you bring to them. No process? You get fast, fragile code that works in demo but breaks in production.

The 7 most common mistakes:

  1. No requirements — jumping straight to "build me X"
  2. No design — letting AI decide architecture
  3. No task breakdown — one giant prompt for everything
  4. No context — AI doesn't know your codebase
  5. No review — shipping AI output without reading it
  6. No staging — deploying directly to production
  7. No monitoring — "it works on my machine" mindset

This plugin provides a skill for each step — not as a gate, but as a habit.


Design Principle

We follow the "Thin Harness, Fat Skills" pattern — the session hook is bounded (≤300 tokens, enforced in hooks/session-start.sh and documented in CLAUDE.md), and the intelligence lives in on-demand markdown skills loaded when you invoke them. The harness gets out of the way; the skills do the work.

The same principle is documented independently by Garry Tan (President & CEO, Y Combinator) in his 2026 essay "Thin Harness, Fat Skills" and shipped in gbrain. We arrived at it from workflow discipline; he arrived at it from building a brain — same conclusion.


Quick Start

Install:

claude plugin marketplace add pitimon/8-habit-ai-dev
claude plugin install 8-habit-ai-dev@pitimon-8-habit-ai-dev

Use (restart Claude Code, then invoke any skill):

/requirements       # Before you build anything
/review-ai          # Before you commit anything
/cross-verify       # Before you ship anything
/whole-person-check # Assess Body/Mind/Heart/Spirit balance

Verify installation: After restarting, you should see ## 8-Habit AI Dev Active in the session banner with the 7-step workflow reminder.

New to the plugin? Start with /workflow for a guided walkthrough, or see Use Cases to find the right skill for your situation.

Two commands to install. The plugin loads a session reminder and makes 17 skills available.


The 7-Step Workflow

Each step maps to one of Covey's 8 Habits — the habit explains why the step matters.

Step 0          Step 1          Step 2         Step 3
/research  ───→ /requirements ─→ /design  ────→ /breakdown
H5:Understand   H2:End in Mind   H8:Find Voice  H3:First Things

Step 4          Step 5          Step 6         Step 7
/build-brief ─→ /review-ai ───→ /deploy-guide → /monitor-setup
H5:Understand   H4:Win-Win      H1:Proactive   H7:Sharpen Saw

You don't need all steps every time. Start with /requirements before building and /review-ai before committing — those two alone eliminate most Vibe Coding problems.


Skills Reference

Workflow Skills (Steps 0-7)

Skill Step Habit Purpose
/research 0 H5: Seek First to Understand Investigate with depth levels (Quick/Standard/Deep), modes (General/Compare/Audit), and source verification
/requirements 1 H2: Begin with End in Mind Draft PRD — what, why, who, scope, success criteria
/design 2 H8: Find Your Voice Surface architecture decisions for human judgment
/breakdown 3 H3: Put First Things First Decompose into atomic tasks, prioritize by importance
/build-brief 4 H5: Seek First to Understand Problem statement gate + context brief before implementing
/review-ai 5 H4: Think Win-Win 4-level verdict (PASS/CONCERNS/REWORK/FAIL) + dimension balance
/deploy-guide 6 H1: Be Proactive Staging-first deployment with rollback plan
/monitor-setup 7 H7: Sharpen the Saw Set up health checks, alerting, error tracking

Assessment Skills (Use Anytime)

Skill Habit Purpose
/cross-verify H1-H8 17-question checklist + dimension summary (Body/Mind/Heart/Spirit)
/consistency-check H5 + H1 Cross-artifact analyzer — 5 detection passes (Coverage, Drift, Ambiguity, Underspec, Inconsistency) over persisted PRD↔design↔tasks (v2.15.0, ADR-013)
/whole-person-check H8: Find Your Voice 4-dimension assessment (1-5 scale) with AI Blind Spot detection
/security-check H1: Be Proactive Focused OWASP security lens — secrets, injection, auth, deps
/reflect H7: Sharpen the Saw 5-question micro-retrospective (5 min max) with action tracking
/workflow All Guided 7-step walkthrough — invoke or skip each step
/calibrate H8: Find Your Voice Self-assessment (5-7 questions) → writes ~/.claude/habit-profile.md so other skills adapt verbosity to your maturity level
/using-8-habits H5 + H8 Onboarding meta-skill — all 17 skills + decision tree for "which skill next?"
/eu-ai-act-check H1 + H8 (Spirit) Redirect stub — migrated to pitimon/claude-governance v3.1.0+ on 2026-05-02 (ADR-012). Install that plugin for the canonical 9-obligation checklist.
/ai-dev-log H4 + H1 Generate AI-assisted dev log from git history for audit trail

Use Cases: Which Skill When?

Start from your situation, not the skill name.

I want to... Start with Then Habit
Build a new feature from scratch /requirements /design/breakdown H2: Define done first
Review code before committing /review-ai /security-check if needed H4: Never skip review
Understand an unfamiliar codebase /research /build-brief H5: Read before writing
Deploy to production /deploy-guide /monitor-setup H1: Staging first
Assess overall project health /cross-verify /whole-person-check All 8 habits
Fix a production bug /build-brief Reproduce first H5: Understand first
Something feels off about a plan /cross-verify Check dimension scores H1-H8
Learn the full workflow /workflow (guided walkthrough) All

Recommended Paths

Minimum Viable Discipline/requirements before building + /review-ai before committing. Two skills, biggest impact.

Full Workflow/research through /monitor-setup via /workflow. For new features or greenfield projects.

Quality Gate/cross-verify + /whole-person-check. For pre-PR or pre-release assessment.

For the full 15-situation map, see guides/situation-map.md.


The 8 Habits

Based on Stephen Covey's The 7 Habits of Highly Effective People + Habit 8 (The 8th Habit: From Effectiveness to Greatness), adapted for AI-assisted development from real experience building a production system over 910 man-day-equivalents.

Private Victory (Self-Management)

Habit Principle In Practice
H1: Be Proactive Act on what you can control Trace all callers of a bug fix, handle edge cases, update docs during implementation
H2: Begin with End in Mind Define done before starting Write success criteria and test plans before the first prompt
H3: Put First Things First Important over interesting Tests and CI gates prevent future crises — don't skip them for speed

Public Victory (Collaboration)

Habit Principle In Practice
H4: Think Win-Win Every interaction is a deposit Error messages that help, issue closures with rationale, actionable review feedback
H5: Seek First to Understand Read before you write Understand existing code and patterns before proposing changes
H6: Synergize Together > apart Human judgment + AI execution. Parallel agents for independent tasks

Renewal & Significance

Habit Principle In Practice
H7: Sharpen the Saw Invest in capability Monitor production, track tech debt, automate what you learned
H8: Find Your Voice From effective to significant Understand why before implementing. Share patterns. Empower others

The Maturity Model

Dependence → Independence → Interdependence → Significance
Stage Mindset AI Relationship
Dependence "AI writes my code" Blind acceptance, no review
Independence "I use AI as a tool" Selective adoption, human judgment
Interdependence "We build together" Complementary strengths, shared process
Significance "We empower others" Publishing patterns, raising the bar

Cross-Verification

The /cross-verify skill runs 17 questions across all 8 habits, with dimension mapping (Body/Mind/Heart/Spirit) and scoring bands.

Category Questions Dimensions Habits
Private Victory 8 questions Body, Mind H1 (scope), H2 (criteria), H3 (priority)
Public Victory 6 questions Mind, Heart H4 (feedback), H5 (understanding), H6 (synergy)
Renewal 3 questions Body, Spirit H7 (learning), H8 (meaning)

Scoring Bands: 15-17 (proceed) → 12-14 (address gaps) → 8-11 (revisit plan) → <8 (stop and rethink)

Confidence Levels (v1.9.0): For high-stakes reviews, mark each Pass as ✓V (Verified), ✓I (Inferred), or ✓U (Unverified) — inspired by Feynman's honest uncertainty principle.

Domain Packs: Optional question sets for API, Frontend, Infrastructure, AI/ML, and Mobile work.

Full checklist: guides/cross-verification.md


Whole Person Assessment

The /whole-person-check skill evaluates work across Covey's 4 dimensions — the plugin's unique differentiator. No other engineering tool covers all four.

Dimension What It Measures AI Strength
Body (Discipline) CI, tests, monitoring, quality gates Strong — AI excels
Mind (Vision) Architecture, ADRs, roadmap, tech debt Strong — AI excels
Heart (Passion) Craft quality, empathetic errors, UX, DX Weak — needs humans
Spirit (Conscience) Security-first, ethics, compliance, sharing Weak — needs humans

AI-assisted development systematically neglects Heart and Spirit. This assessment makes the gap visible so teams can compensate.

Worked Example: A REST API Feature

After building a user authentication API, /whole-person-check might produce:

Dimension Score Finding
Body 4/5 CI green, 85% coverage, but no load test
Mind 5/5 ADR documented, JWT vs session decision recorded
Heart 2/5 Error messages return raw 500s, no onboarding guide
Spirit 3/5 Input validation present, but no rate limiting or audit log

AI Blind Spot visible: Body and Mind scored high (AI's strength). Heart and Spirit scored low (needs human attention).

Action: Before shipping, add user-friendly error messages (Heart) and rate limiting with audit logging (Spirit). These are the gaps AI won't catch on its own.

Maturity Rubrics

3 levels per dimension (Reactive → Proactive → Significant): guides/whole-person-rubrics.md

Plugin's Own Progression

v1.2.0  ████████████░░░░░░░░  3.0   (honest reassessment after inflated 4.5)
v1.9.0  ██████████████████░░  4.5   (evidence grounding + integrity)
v2.0.0  ██████████████████░░  4.625 (orchestration + meta-system)
v2.1.0  ███████████████████░  4.75  (verification agent + research rigor)
v2.2.0  ████████████████████  5.0   (content validation + fitness functions)

Full self-assessment: SELF-CHECK.md


Agents

The plugin includes two specialized agents — read-only reviewers that analyze without modifying your code.

8-habit-reviewer

Deep cross-verification reviewer. Evaluates plans, implementations, or PRs against all 8 habits.

  • When it runs: Invoked by /cross-verify or manually via the Agent tool
  • What it produces: Score out of 17, dimension summary, failed items with file:line evidence
  • Tools: Read, Glob, Grep (read-only)

research-verifier

Source verification agent (v2.1.0). Validates every citation in a research brief.

  • When it runs: Automatically during /research Deep mode
  • What it produces: Verification report — Verified / Dead / Not Found / Redirected per source
  • Tools: Read, Glob, Grep, WebFetch (read-only)
  • Principle: Feynman standard — "The first principle is that you must not fool yourself"

Both agents use the sonnet model for fast, focused analysis.


Architecture

8-habit-ai-dev/
├── .claude-plugin/
│   ├── plugin.json                 # Plugin metadata (v2.15.0)
│   └── marketplace.json            # Marketplace listing
├── skills/                         # 18 skills (8 workflow + 10 standalone)
│   ├── research/SKILL.md           #   Step 0 → H5 (depth levels + modes)
│   ├── requirements/SKILL.md       #   Step 1 → H2
│   ├── design/SKILL.md             #   Step 2 → H8
│   ├── breakdown/SKILL.md          #   Step 3 → H3 (orchestration classification)
│   ├── build-brief/SKILL.md        #   Step 4 → H5 (context boundaries)
│   ├── review-ai/SKILL.md          #   Step 5 → H4 (4-level verdict)
│   ├── deploy-guide/SKILL.md       #   Step 6 → H1
│   ├── monitor-setup/SKILL.md      #   Step 7 → H7
│   ├── cross-verify/SKILL.md       #   All habits (17Q + dimension summary)
│   ├── consistency-check/SKILL.md  #   H5+H1: cross-artifact analyzer (v2.15.0, ADR-013)
│   ├── whole-person-check/SKILL.md #   H8: Body/Mind/Heart/Spirit
│   ├── security-check/SKILL.md     #   H1: OWASP security lens
│   ├── reflect/SKILL.md            #   H7: micro-retrospective + lesson persistence
│   ├── calibrate/SKILL.md          #   H8: maturity self-assessment → habit-profile.md
│   ├── using-8-habits/SKILL.md     #   H5+H8: onboarding + smart-routing mode (v2.14.0)
│   ├── eu-ai-act-check/SKILL.md    #   H1+H8: EU AI Act 9-obligation checklist
│   ├── ai-dev-log/SKILL.md         #   H4+H1: AI dev log from git history (Art. 11)
│   └── workflow/SKILL.md           #   Guided 7-step walkthrough
├── agents/
│   ├── 8-habit-reviewer.md         # Deep cross-verification agent
│   └── research-verifier.md        # Source verification agent (v2.1.0)
├── hooks/
│   ├── hooks.json                  # SessionStart hook registration
│   └── session-start.sh            # Workflow reminder + progress indicators
├── habits/                         # Reference content (loaded on-demand)
│   ├── h1-be-proactive.md          #   through h8-find-voice.md
│   └── ...                         #   (8 files, one per habit)
├── guides/
│   ├── cross-verification.md       # 17-point checklist detail
│   ├── whole-person-rubrics.md     # 4-dimension maturity rubrics
│   ├── integrity-principles.md    # 12 AI Integrity Commandments
│   ├── quick-reference.md          # 19 prioritized rules (scannable)
│   ├── situation-map.md            # 15 situations → right habit/skill
│   ├── orchestration-patterns.md   # Multi-agent orchestration (v2.0.0)
│   ├── templates/                  # Output templates
│   │   ├── prd-template.md         #   For /requirements
│   │   ├── adr-template.md         #   For /design
│   │   ├── task-list-template.md   #   For /breakdown
│   │   ├── review-report-template.md # For /review-ai
│   │   └── research-brief-template.md # For /research (v2.1.0)
│   └── cross-verify-packs/         # Domain question packs (5 questions each)
│       ├── api.md                  #   API development
│       ├── frontend.md             #   Frontend/UI
│       ├── infra.md                #   Infrastructure
│       ├── ai-ml.md                #   AI/ML systems
│       └── mobile.md               #   Mobile apps
├── tests/
│   ├── validate-structure.sh       # Structure validation (13 checks, pure bash)
│   └── validate-content.sh         # Content validation + fitness functions (v2.2.0)
├── docs/
│   └── adr/                        # Architecture Decision Records
│       ├── ADR-001-orchestration-patterns.md
│       ├── ADR-002-research-modes.md
│       └── ADR-003-content-validation.md
├── rules/
│   └── effective-development.md    # Auto-loaded Claude Code rules
├── CLAUDE.md                       # Plugin development guide
├── CONTRIBUTING.md                 # Skill authoring guide
├── SELF-CHECK.md                   # Meta cross-verification
└── README.md                       # This file

Design decisions:

  • Skills are empowering, not restrictive — reminders and tools, not blocking gates
  • Habit content loaded on-demand — skills reference habits/*.md only when invoked, keeping session context lean
  • Session hook under 300 tokens — light reminder with progress indicators, not a wall of text
  • Handoff contracts — each skill declares what it expects from its predecessor and produces for its successor
  • Definition of Done — every skill has 3-5 verifiable checkbox items
  • When to Skip — honest conditions prevent compliance theater (H8: contribution over compliance)
  • Output templates — structured formats for PRD, ADR, task list, review report, research brief
  • Dimension mapping — all 17 cross-verify questions tagged with Body/Mind/Heart/Spirit
  • Zero dependencies — pure markdown + bash. No npm, no pip, no runtime requirements

What's New in v2.15.0

Theme: spec-kit /analyze Inspiration — Cross-Artifact Consistency + Opt-In Spec Persistence (#165)

Inspired by github/spec-kit's /analyze pattern, adapted to our discipline-not-enforcement philosophy. Three workflow skills (/requirements, /design, /breakdown) gain an opt-in --persist <slug> flag that writes their output to docs/specs/<slug>/{prd,design,tasks}.md while preserving conversation SKILL_OUTPUT blocks (back-compat invariant). The new /consistency-check skill reads those files and runs 5 detection passes — Coverage, Drift, Ambiguity, Underspec, Inconsistency — emitting severity-graded findings with file:line citations.

  • Hybrid evaluation — deterministic when artifacts include FR-NNN/Decision-N/Task #N ID markers (recommended), LLM semantic with explicit warning when absent. PRD-EARS-2 back-compat preserved.
  • Read-only analyzerallowed-tools: ["Read", "Glob", "Grep"]. No gating, no enforcement (per plugin boundary doctrine — enforcement still belongs in claude-governance).
  • Self-applied dogfood — this release's own PRD/design/tasks/ADR live in docs/specs/consistency-check/. Run /consistency-check consistency-check to verify.
  • 5 alternatives considered — see ADR-013 for design rationale, flag-style argument convention attestation (/ai-dev-log, /calibrate precedents), and slug validation regex.

Skills count: 17 → 18. Closes #165.

What's New in v2.14.3

Theme: Post-Migration Cleanup + Validator Self-Discipline (#163)

Patch release closing post-v2.14.2 metadata drift and applying the 800-line file-size rule the validator enforces on skills to the validator itself.

  • ADR-012 metadata closureSELF-CHECK.md lines 103-104 reframed (described deleted files as if still present); ADR-012 status header upgraded with Implementation: field naming commit ed65b97 (v2.14.2) and metadata-closure date.
  • .gitignore hardening — created with /deep-project/ and /.claude/ entries to gate against accidental git add . of third-party plugin clones and Claude Code session artifacts. Working copies preserved locally.
  • tests/validate-content.sh trim — 831 → 793 lines via comment consolidation across Check 15, Check 19 sub-checks B/C/D/E/F/G, and F2/F3 sections. Logic untouched, total checks unchanged (10), PASS count preserved (205). Closes the credibility gap where the validator violated the 800-line rule it enforces.

Pattern: validator self-discipline — when a fitness function applies to the rest of the codebase, it applies to the validator too. Same shape as v2.14.1's "validator assertion, not checklist" principle.


What's New in v2.14.2

Theme: EU AI Act Migration Completion (cross-plugin coordination with pitimon/claude-governance)

Plugin-boundary correction. The EU AI Act compliance toolkit migrated to pitimon/claude-governance v3.1.0 (PR #26) per memory observation #233270 (2026-04-07): 8-habit-ai-dev = workflow discipline; claude-governance = compliance enforcement + framework mappings. EU AI Act compliance is a framework mapping, not a workflow step.

  • Removedskills/eu-ai-act-check/reference.md, docs/research/eu-ai-act-obligations.md, guides/eu-ai-act-mapping.md (now canonical in pitimon/claude-governance v3.1.0+)
  • Stubskills/eu-ai-act-check/SKILL.md rewritten as a redirect that names the canonical location, provides install + invocation examples, preserves NOT LEGAL ADVICE disclaimer, and links to ADR-012. Skill name remains valid in the catalog so existing cross-references resolve.
  • ADR-012 — completion-side migration ADR. ADR-005 marked Superseded. Establishes a reusable migration pattern.
  • Validator — Check 15 in tests/validate-content.sh rewritten as stub-mode + negative-restore assertions.
  • Cross-refs reframed — RESOLVER, using-8-habits, design Step 5, ai-dev-log, session-start hook, README skill table.

Pattern: smaller atomic deletion + stub + validator + ADR PR; cosmetic wiki/README badge cleanup deferred to a follow-up doc-only PR (precedent: pitimon/claude-governance PR #25 + #26 — local Markdown formatter rewrites tables on every Edit, producing 140+ lines of unrelated noise).


What's New in v2.14.1

Theme: README "What's New" Drift Guard (#157)

Patch release closing an external QA finding — same bug class as #124 (CHANGELOG pointer-fallback) and #141 (SELF-CHECK.md body drift) recurring on a sibling surface (README.md "What's New" section).

  • README.md backfill## What's New in v2.14.0 block restored (was missing — Check 19A passed silently because grep -q "v${current_version}" matched the badge URL releases/tag/v2.14.0 instead of asserting the section header). Architecture file tree backfilled with 4 missing skills (calibrate, using-8-habits, eu-ai-act-check, ai-dev-log) so it matches the declared 17-skill count. TOC anchor fixed (#whats-new-in-v220#whats-new-in-v2141) — broken since at least v2.3.0.
  • tests/validate-content.sh Check 19 sub-check G — anchored grep ^## What's New in v${current_version} in README.md. Mirrors sub-checks E + F mechanism from #144 (SELF-CHECK.md body freshness). Prevents the badge-URL false-positive class permanently.
  • Check 20 hardeningFind → Fix → Re-Verify literal name pinned + assert exactly 5 numbered steps (1-5) in skills/review-ai/SKILL.md Verification Phase. Closes the v2.14.0 self-disclosed follow-up.

Pattern: validator assertion, not checklist — when QA surfaces the same drift class across multiple releases, the fix is a fitness function. Same shape as v2.11.1 (CHANGELOG drift guard) and v2.13.1 (SELF-CHECK body freshness).

What's New in v2.14.0

Theme: TOH Framework Inspirations (milestone #15)

Three workflow-discipline imports from Toh Framework (an "AI-Orchestration Driven Development" framework for solo SaaS builders), filtered through plugin-boundary rule (3/10 candidates imported, 7 rejected with route-elsewhere reasoning).

  • SKILL_OUTPUT attribution lines (#151, PR #152) — visible [/<skill>] COMPLETE SKILL_OUTPUT:<type> directly above each HTML comment in 4 emitter skills (design, breakdown, requirements, review-ai). Status markers COMPLETE / PARTIAL / FAILED (text-only, no emoji). New validate-structure.sh Check 22 enforces format (BSD-safe via grep -B1). cross-verify parser unaffected. Inspired by Toh's Agent Announcement format.
  • Argument-driven smart-routing for /using-8-habits (#149, PR #154) — /using-8-habits "<intent>" returns ≤3 ranked skills + reasoning + alternatives + a single direct question instead of the full narrative tree. Reads ~/.claude/habit-profile.md for verbosity tier (dependence → independence → interdependence → significance) and recent ~/.claude/lessons/ for context. Activates existing argument-hint frontmatter — no new skill file. Inspired by Toh's /toh Smart Command (reshape: extend rather than wrap).
  • /review-ai Verification Phase (#150, PR #155) — Find → Fix → Re-Verify loop ending in a Verification Table (Finding | Severity | Fix Evidence | Status). Plugin boundary: section header reads "guidance only — NOT a hook"; three independent anchors (header + blockquote + step-5 prose) prevent future enforcement creep. New validate-content.sh Check 20 enforces all three anchors. Inspired by Toh's Test → Fix → Loop adapted as discipline guidance, not automated enforcement.

Companion proposal: pitimon/claude-governance#24 for the 7-file project memory persistence layer (out-of-boundary for this plugin).

What's New in v2.13.1

Theme: SELF-CHECK.md Body Freshness (#141)

  • SELF-CHECK.md body drift fixed (#142) — footer updated from Previous: 2.7.1 to Previous: 2.12.0 and 6 missing per-release rows added (v2.9.0 through v2.13.0). Plugin opens with "H8 Modeling: Follow the process always, no shortcuts when unwatched" — the 6-release silent gap contradicted the stated principle.
  • CONTRIBUTING.md § Version Bumping corrected (#143) — "Version lives in 3 files" → "4 files" and adds SELF-CHECK.md header to the list. The 4-file convention has been enforced by tests/validate-structure.sh since #106, but CONTRIBUTING.md never caught up.
  • tests/validate-content.sh Check 19 sub-checks E + F (#144) — CI invariant against SELF-CHECK.md body drift. Sub-check E asserts the footer references the tag immediately preceding plugin.json.version (derived from git tag -l "v2.*" | sort -V). Sub-check F asserts every v2.x tag has a matching ^- v<x.y.z>: row — no gaps. Prevents recurrence of the #141 drift class.
  • .github/workflows/validate.ymlfetch-tags: true + fetch-depth: 0 added to actions/checkout@v4 so CI can read the tag list for sub-checks E + F.

What's New in v2.13.0

Theme: Cross-Agent Discoverability (#137 + #135 + #136)

  • skills/RESOLVER.md — flat phrase-to-skill dispatcher so new users and non-Claude agents can find the right skill without knowing slash-command names. All 17 skills in 3 sections (Workflow / Assessment / Meta), ≤3 triggers each. Bidirectional coverage enforced by Check 20 in tests/validate-structure.sh.
  • llms.txt + AGENTS.md at repo root — Codex, Cursor, Windsurf, Aider, Continue, and LLM-based repo fetchers now have canonical entry points. llms.txt follows the llmstxt.org convention (flat doc-map with raw.githubusercontent.com URLs); AGENTS.md is the non-Claude operating protocol. Check 21 enforces both files exist and point to skills/RESOLVER.md + CLAUDE.md.
  • "Design Principle" section — cites Garry Tan's 2026 essay "Thin Harness, Fat Skills" as external validation of the bounded-hook + fat-skills pattern the plugin has always enforced (≤300 token session hook).
  • ADR-010 (Flat Skill Dispatcher) and ADR-011 (Cross-Agent Discoverability) — decision records with 6 options considered each.
  • Pattern extracted: "Bidirectional Validator for Canonical Cross-References" — when a new canonical artifact ships, its validator check should assert both directions (source→target AND target→source). See CHANGELOG.md v2.13.0 or ~/.claude/lessons/2026-04-22-cross-agent-discoverability-batch.md.

What's New in v2.12.0

Theme: Code-Symbol Grep Evidence (#133)

  • /research Evidence Standard — code-symbol verdicts require grep evidence — when an Audit-mode or Findings-table row's verdict matches /remove|dead|unused|transitional|safe to (drop|remove)/i on a code symbol (dep, module, function, exported type, file), the row must cite a grep-check showing consumers across the repo's source directories. Declaration-site citations (e.g. package.json:6) do not establish liveness. Closes a false-positive class where plausible-sounding "brand names differ, must be unrelated" reasoning passed Deep-mode verification with pristine citations (real-world: memforge neo4j-driver audit — neo4j-driver is the canonical Bolt client for Memgraph; would have broken production graph on first rebuild).
  • /research Step 4 clarification — one-line callout after the Deep-mode dispatch makes the verifier's scope explicit inline: citation integrity, not semantic correctness.
  • research-verifier agent scope clarifieddescription: frontmatter rewritten to say "citation-integrity verification agent" and to spell out what is out of scope. New ## Limit of Verification section inside the agent body defines in-scope vs. out-of-scope, and introduces a SEMANTIC-EVIDENCE-MISSING flag the verifier emits (without performing the grep itself) when a code-symbol verdict row lacks liveness evidence. The agent's execution behavior is unchanged — this is a documentation change preventing authors from over-trusting a passing Deep-mode gate.

What's New in v2.11.1

Theme: CHANGELOG Drift Guard (#124, #131)

  • tests/validate-content.sh Check 19 strengthened — 3 new FAIL-severity assertions close the pointer-fallback loophole that let v2.9.0 and v2.11.0 ship with stale changelogs despite 491/491 validators passing. Now asserts: root CHANGELOG.md contains ^## v<version> entry, wiki Changelog.md contains ^## v<version> entry (no pointer-to-CHANGELOG.md fallback), wiki badge latest-v<version>-blue match.
  • Backfill v2.9.0 + v2.11.0 in root CHANGELOG.md (was jumping v2.10.0 → v2.8.0) and v2.11.0 in wiki Changelog.md + badge bump.
  • Lesson persisted (~/.claude/lessons/2026-04-17-v2.11.0-changelog-drift-recurrence.md) as H7 capability pattern: when a QA surfaces the same drift class across 2+ releases, the fix is a validator assertion, not a checklist.

What's New in v2.11.0

Theme: Design Skill Pipeline Completion + Wiki Professional Redesign (#128, #127)

  • /design structured output block (#128) — adds SKILL_OUTPUT:design block, closing the only gap in the /requirements/design/breakdown/review-ai cross-skill handoff chain. /cross-verify can now auto-check design decisions (Q4, Q14, Q16).
  • /design tech stack + Whole Person — decision list now includes language/runtime and framework selection. H8 Checkpoint expanded with Body/Mind/Heart/Spirit dimensions.
  • /design scope validation — new step 1b consumes SKILL_OUTPUT:requirements to verify design doesn't expand beyond agreed scope.
  • /design decision heuristic — H3-based guidance: split decisions affecting >3 layers, group decisions sharing trade-offs.
  • /research tech stack questions — Step 1 research questions now include technology evaluation and ecosystem trade-offs.
  • Wiki professional redesign (#127) — 20 pages upgraded: hero Home page, new Architecture and Maturity-Model pages, Skills-Reference expanded to 17 skills with quick-select matrix, Workflow-Overview with Mermaid diagram, all Step pages with > [!IMPORTANT] checkpoints.

What's New in v2.10.0

Theme: Progressive-Disclosure SKILL.md Split (#125, ADR-009) — refactor the 3 largest skills into SKILL.md + reference.md + examples.md triads, creating headroom for future features without breaking the F3 word-budget fitness function.

  • using-8-habits split — SKILL 1990w → 1108w; reference.md (17-skill inventory + cross-plugin composition tables); examples.md (password-reset onboarding walkthrough).
  • eu-ai-act-check split — SKILL 1989w → 908w; reference.md (full 9-obligation checklist with 60 tier-tagged items and paragraph references).
  • calibrate split — SKILL 1774w → 1161w; reference.md (scoring rubric + profile-write procedure); examples.md (4 sample profiles, one per maturity level).
  • ADR-009 codifies the convention — inline Load ${CLAUDE_PLUGIN_ROOT}/skills/<name>/<file>.md directives enforce lazy loading. Check 8 (structure validator) already hard-fails on broken sibling references — no new existence check needed.
  • F6 sibling word-budget soft limit — new Check 9b warns (not fails) when reference.md or examples.md exceeds 5000 words, preventing unbounded growth without blocking legitimate reference material.
  • Content validator triad-awareness — Checks 15 and 18 now search the triad as a unit, so content moved to sibling files still satisfies anti-drift and tier-count assertions.

What's New in v2.9.0

Theme: Claude Code Architecture Insights — production patterns from Anthropic's Claude Code internals (reverse-engineered by Alejandro Balderas) adapted into plugin workflow guidance.

  • /build-brief context compression awareness (#114) — new step 6 "Context survival" guides users to front-load critical info, keep briefs under ~4,000 tokens, and order stable content before volatile content. Based on Claude Code's 4-layer context compression pipeline (Ch5) and prompt cache stability pattern (Ch17).
  • /design sticky latch principle (#116) — new step 5 "Identify sticky decisions" with rework-level classification table. Decisions marked STICKY (>50% rework to reverse) require a new /design cycle to change, preventing costly mid-session pivots. Inspired by Claude Code's sticky boolean latches for prompt cache stability.
  • /reflect lesson consolidation (#113) — new Step 7 "Consolidation check" nudges when lesson files exceed 10. New /reflect consolidate argument runs a 4-phase cycle (Orient → Gather → Consolidate → Prune) inspired by Claude Code's auto-dream memory consolidation. Human approval gate before any deletions.
  • /breakdown fork agent pattern (#115) — new step 5 "Token-efficient parallel design" with prompt prefix sharing guidance. Parallel tasks sharing context achieve ~90% input token savings via cache hits. Most valuable for 3+ parallel tasks.

What's New in v2.7.1

Theme: Review Discipline Refinement — small post-milestone patch on top of v2.7.0, adding two review-time disciplines to /review-ai after a cost/benefit audit against addyosmani/agent-skills (MIT). Only the one genuine gap was imported; five other candidates were evaluated and rejected as duplicative or out-of-scope.

  • /review-ai Performance axis (#110, PR #111) — fourth review category alongside Security/Quality/Completeness. Flags N+1 queries, unbounded loops, missing pagination, unindexed queries, and memory leaks. Same file:line evidence standard as other axes.
  • /review-ai review-tests-first directive — new Process step 2 directs the reviewer to open the new or changed test files before judging the implementation. Tests declare intent; reading them first gives the specification to review against.
  • Rejections preserved in PRguides/anti-rationalization.md, guides/red-flags.md, guides/google-engineering-principles.md, /cross-verify Q18, and a cross-plugin hard-gate spec were all evaluated and rejected. Rationale archived in the PR #111 body so future "research hype" passes don't re-litigate them.

What's New in v2.7.0

Theme: Reader Adoption — closes the /calibrate feature loop (v2.6.0 wrote the profile, v2.7.0 makes the other 16 skills read it).

  • Hook-based verbosity adaptation (#96) — hooks/session-start.sh reads ~/.claude/habit-profile.md at session start and emits a level-specific directive into context. Skills auto-adapt from Dependence (full guidance) through Significance (minimal prompts) with zero changes to individual skill files — F3 word budget preserved
  • guides/verbosity-adaptation.md — canonical per-level adaptation rules with worked examples across 5 skill archetypes
  • tests/test-verbosity-hook.sh — 12-assertion regression coverage for all 8 hook branches plus HABIT_QUIET opt-out and ≤300-token budget check
  • preferences.verbosity-override in habit-profile schema — verbose promotes any level to Dependence, concise demotes to Significance
  • 4 validators in CI (up from 3) — total 482 assertions across validate-structure, test-skill-graph, validate-content, test-verbosity-hook
  • Milestone v2.7.0 CLOSED — Hermes-inspired feature loop (v2.6.0 + v2.7.0) complete; plugin at a local maximum given pure-markdown constraints

What's New in v2.6.0 / v2.6.1

Theme: Hermes-Inspired Improvements — user-modeling + learning-loop patterns filtered through plugin-boundary discipline.

  • /calibrate skill + habit-profile schema v1 (#90, ADR-008) — 5-7 question self-assessment that writes ~/.claude/habit-profile.md for other skills to adapt to. Dominant-level scoring across Dependence → Independence → Interdependence → Significance
  • guides/habit-profile-schema.md — public schema contract (YAML + markdown, schema-version: 1) defining the writer-reader API future skills code against
  • Persistent reflection artifacts (#88) — /reflect persists lessons to ~/.claude/lessons/YYYY-MM-DD-<slug>.md; /research and /build-brief search these before starting work. Closes the loop: reflect → persist → retrieve → apply
  • /reflect Q6 Skill Effectiveness signal (#92) — new 6th retro question naming "most useful" and "least useful/confusing" skill this session. Feeds maintainer-curated SKILL-EFFECTIVENESS.md trend tracker — H7 applied to the plugin itself
  • guides/habit-nudges.md + ADR-007 — nudge specification (hook implementation delegated to claude-governance per plugin boundary) + agentskills.io NO-GO decision record
  • Skill count: 16 → 17 (/calibrate added)

Earlier Versions

  • v2.5.0 (2026-04-09) — Testing & Discoverability: test-skill-graph.sh DAG validator, pre-commit.sh.example template, bidirectional wiki ↔ skills linking
  • v2.4.1 (2026-04-09) — Honest Correction: removed /brainstorm after finding superpowers:brainstorming is a better implementation; added HABIT_QUIET=1 opt-out; introduced "Core 5" tier (ADR-006)
  • v2.4.0 (2026-04-09) — Workflow Completions: /brainstorm (later removed), EARS-notation in /requirements, /using-8-habits onboarding meta-skill
  • v2.3.0 (2026-04-09) — EU AI Act Compliance Toolkit (flagship): /eu-ai-act-check 9-obligation tiered checklist, /ai-dev-log AI transparency, /design Article 14 human-oversight checkpoint, docs/research/eu-ai-act-obligations.md primary-source research, ADR-005, Plugin Boundary section
  • v2.2.0 (2026-04-07) — Body Dimension Level-Up: content validation (validate-content.sh), 3 architecture fitness functions, ADR-003
  • v2.1.0 (2026-04-07) — Multi-Agent Research (Feynman-inspired): research depth levels, research modes, research-verifier agent, ADR-002
  • v2.0.0 (2026-04-07) — Orchestration-Aware Development (ULW-inspired): task classification in /breakdown, context boundaries in /build-brief, ADR-001
  • v1.9.0 — Feynman-inspired: /research skill, evidence grounding, 12 AI Integrity Commandments, confidence levels, lazy parallelism gate

Full release history: CHANGELOG.md (v2.3.0+) · docs/wiki/Changelog.md (v2.2.0 and earlier) · GitHub Releases


Not a Checklist

Checklists create compliance theater — people tick boxes without understanding why.

These are principles that change how you think:

  • You don't "apply H5" — you develop the instinct to read before writing
  • You don't "check H3" — you naturally prioritize tests over gold-plating
  • You don't "follow H8" — you genuinely ask "does this help someone?"

The cross-verification exists for planning reviews, not as a gate for every commit.


Origin

This framework was developed while building MemForge — a production AI memory system with 15 services, 154K+ observations, and a 3-node Docker Swarm cluster. Over 910 man-day-equivalents of AI-assisted development, these habits emerged from real mistakes:

  • H1: A deploy bypassed staging and went straight to production (now there's a mandatory staging-first rule)
  • H4: Code reviews were skipped "just this once" — 2 CRITICAL and 3 HIGH issues shipped (now review-before-commit is enforced)
  • H7: Monitoring was the weakest step across 3 projects — a systematic blind spot we only caught through cross-project analysis
  • H5: A database password mismatch crashed production because nobody validated the .env file before deploying

Every habit in this plugin exists because skipping it caused real damage.


FAQ

Q: Do I need to use all 17 skills for every task? No. Start with /requirements before building and /review-ai before committing. Those two alone eliminate most Vibe Coding problems. Add more skills as they feel natural. See Use Cases.

Q: What is "Vibe Coding"? Building software by feel — jumping straight to "build me X" without requirements, design, or review. AI tools amplify this tendency because they make coding feel effortless. This plugin provides structure without removing speed.

Q: How is this different from a linter or CI tool? Linters check syntax. CI checks tests. This plugin checks process — did you define success criteria? Did you review before committing? Did you consider security? It operates at the planning and judgment layer, not the code layer.

Q: What does "ทำเสร็จ ≠ ทำดี" mean? Thai: "Done is not done well." Completing a task (ทำเสร็จ) is not the same as completing it with quality (ทำดี). This principle is the plugin's core identity — speed without discipline creates debt.

Q: Can I use this without Claude Code? The skills require Claude Code's plugin system. However, the underlying principles work with any AI tool. See Alternative Setup to load just the rules file, or read the habits/ files as standalone guides.

Q: What are the "Whole Person dimensions"? Covey's model: Body (discipline/quality), Mind (vision/architecture), Heart (passion/craft/empathy), Spirit (conscience/ethics/security). AI excels at Body and Mind but systematically neglects Heart and Spirit. See Whole Person Assessment.

Q: How do the agents work? Agents are read-only reviewers that run inside Claude Code. They analyze your code and produce reports but never modify files. See Agents.

Q: Is this plugin opinionated? Yes, deliberately. The opinions come from 910 man-day-equivalents of production AI-assisted development. Every rule exists because skipping it caused real damage. See Origin.


Glossary

Term Definition
Vibe Coding Building software by feel without structured process. The anti-pattern this plugin addresses.
Handoff Contract What a skill expects from its predecessor and produces for its successor. Ensures workflow continuity.
AI Blind Spot The systematic tendency of AI tools to excel at Body/Mind dimensions while neglecting Heart/Spirit.
Whole Person Model Covey's 4-dimension framework: Body (discipline), Mind (vision), Heart (passion), Spirit (conscience).
Cross-Verification 17-question checklist covering all 8 habits with dimension mapping and scoring bands.
Confidence Level Marking a cross-verify answer as Verified (V), Inferred (I), or Unverified (U). From Feynman principle.
Domain Pack Optional 5-question set for specific domains (API, Frontend, Infra, AI/ML, Mobile). Scored separately.
Definition of Done 3-5 verifiable checkbox items that define when a skill's output is complete.
When to Skip Honest conditions under which a skill is genuinely unnecessary. Prevents compliance theater.
Orchestration Multi-agent task dispatch patterns: sequential, parallel-safe, parallel-worktree (v2.0.0).
Fitness Function Automated metric tracking architecture health. Breach fails the CI build (v2.2.0).
Maturity Model Progression: Dependence → Independence → Interdependence → Significance.
Private Victory Habits 1-3: self-management (proactive, end-in-mind, first-things-first).
Public Victory Habits 4-6: collaboration (win-win, understand-first, synergize).
ทำเสร็จ ≠ ทำดี Thai: "Done is not done well." The plugin's core philosophy.

Alternative Setup (Without Plugin)

If you prefer not to install the plugin, you can use the rules file directly:

# Install rules only (no skills, no hooks)
mkdir -p ~/.claude/rules
curl -sL https://raw.githubusercontent.com/pitimon/8-habit-ai-dev/main/rules/effective-development.md \
  -o ~/.claude/rules/effective-development.md

This auto-loads the 8-Habit principles into every Claude Code session without the skills or hooks.


Contributing

Found a habit that worked (or broke) in your AI-assisted development? PRs welcome.

See CONTRIBUTING.md for the full guide — skill authoring conventions, blank templates, and quality checklist.

Quick options:

  • Add a new skill — follow the template in CONTRIBUTING.md
  • Add real examples to habits/*.md files
  • Add domain question packs in guides/cross-verify-packs/
  • Report issues at GitHub Issues

License

MIT


Version: 2.15.0 | Last updated: 2026-05-03

Packages

 
 
 

Contributors

Languages