Open Source Framework

AI-Scrum

A Scrum-inspired framework for human-AI collaboration,
optimized for GitHub Copilot CLI.

What This Is

🤖

AI Agent Team

A lead agent manages the sprint cycle — planning, execution, retros. It dispatches specialized sub-agents for code, tests, security, docs, and reviews. You're the stakeholder — you set direction, not details.

🔄

Real Scrum, Not Theater

Sprint Planning, Daily Huddles, Sprint Review, Retrospectives — all adapted for human-AI dynamics. Each ceremony has a purpose. Nothing exists because "that's how Scrum works."

📋

Template Repository

Fork it, customize the placeholders, and you have a fully configured AI development process. Agents, skills, process docs, sprint templates — everything included.

Two Modes

Start guided to learn the framework. Graduate to autonomous when you trust the process.

🎯

Guided

You're at the keyboard. You trigger each ceremony, review AI decisions, approve scope. Interactive sessions — the AI works while you watch.

🤖

Autonomous

Start a sprint and walk away. The AI runs for hours — planning, coding, testing, reviewing. You come back to finished work and a push notification.

The Problem

AI coding assistants are powerful. But without structure, they create as many problems as they solve.

😵

No Memory

Every session starts from zero. The AI doesn't know what you did yesterday, what decisions were made, or what failed last sprint. Context is lost every time.

🏃

Feature Chasing

The AI happily starts new features while old ones are half-done. No discipline, no focus, no "finish what you started." Your backlog grows while nothing ships.

"It Should Work"

The AI says "done" without running tests. It claims success without evidence. You merge confidently — and discover the bug in production.

🤷

No Process

Ad-hoc prompting produces ad-hoc results. Without a defined workflow, there's no way to improve. Every session is a fresh experiment.

AI-Scrum solves this.

Sprint logs preserve memory. Quality gates enforce evidence. The backlog enforces focus. Retros improve the process itself. Every sprint is better than the last.

How It Compares

Ad-hoc Prompting Spec-Driven Development AI-Scrum
Approach Chat until it works Write full spec upfront, AI implements Iterative sprints with feedback loops
Planning None Heavy upfront — spec, tests, architecture Just-in-time — ICE scoring, sprint-sized
Memory None — context lost each session Spec is the memory Sprint logs, issue comments, velocity data
Quality Hope for the best Test-first from spec Quality gates, DoD, CI before merge
Adapts to change Instantly — no plan to break Slowly — spec rewrite needed Via backlog — changes are welcomed, not feared
Process improves ❌ No ❌ Spec improves, process doesn't ✅ Every retro improves tools + workflow
Best when Quick prototypes, one-offs Well-understood problems, clear specs Ongoing projects, evolving requirements

Spec-driven works great for one-shot tasks with clear requirements. AI-Scrum is for projects that live and evolve — where you need sustained quality, not just initial delivery.

Built for GitHub Copilot CLI

Every file in the framework maps to a Copilot CLI feature. No custom tooling needed.

.github/copilot-instructions.md
Auto-loaded by Copilot CLI on every session. Project rules, safety constraints, agent dispatch — the AI reads this before doing anything.
.github/agents/*.agent.md
Custom agents invokable via @agent-name. Each agent has a role, specialized instructions, and domain expertise. 11 agents included.
.github/skills/*/SKILL.md
Skills auto-loaded by Copilot CLI when relevant. Sprint ceremonies, orchestration pipelines, code review — 20 skills ready to use.
Plan mode built-in
Copilot CLI's native planning mode (Shift+Tab). Creates structured implementation plans before touching code. The framework integrates with it.
Built-in agents built-in
The framework dispatches to explore, task, code-review, and general-purpose agents — Copilot CLI's native agents with full toolsets.
SQL database built-in
Per-session SQLite for todo tracking, batch operations, and sprint state. The framework uses it for structured execution without external tools.

The Operating Model

Two roles. Clear boundaries. No ambiguity about who decides what.

👤

You (Stakeholder)

  • Set strategic direction
  • Review sprint deliverables
  • Accept or reject at sprint review
  • Provide domain context when asked
  • Create issues for new ideas
Escalation
Boundary
🤖

AI Agent Team (PO + SM + Devs)

  • Lead agent: triage, plan, execute, retro
  • Sub-agents: code, tests, docs, security, review
  • Built-in agents: search, commands, code review
  • Continuous process improvement
  • Escalate only on MUST criteria

The AI-Scrum Manifesto

Inspired by the Agile Manifesto (2001) — adapted for human-AI collaboration.

Autonomous execution
over
constant approval
Verified evidence
over
claimed completion
Sprint discipline
over
feature chasing
Continuous improvement
over
static workflows

That is, while there is value in the items below, we value the items above more.

12 Principles Behind the Manifesto

01

The best architecture emerges from small, tested diffs — not grand rewrites.

02

Quality gates are non-negotiable. Every feature gets tests. Every PR gets CI. Every claim gets verification.

03

The human brings judgment; the agent brings throughput. Play to each side's strengths.

04

Documentation is not overhead — it's memory. The agent has no memory between sessions.

05

Escalation is a feature, not a failure. MUST-escalate criteria protect the project from autonomous overreach.

06

Process improvements compound. Each retro makes the next sprint better. Sprint 10 is radically smoother than Sprint 1.

07

Evidence before assertions, always. Never say "done" without proof. Trust, but verify — on both sides.

08

The backlog is sacred. Ideas don't get lost — they get queued. If it's not an issue, it doesn't exist.

09

Velocity is descriptive, not prescriptive. Track it to understand capacity. Don't use it to pressure.

10

The agent is not a junior developer — it's a different kind of collaborator. Design the process for what it actually is.

11

Welcome scope changes — route them through the backlog, not mid-sprint pivots.

12

Simplicity — maximize the work not done. Every line of code is a liability; every automation is an asset.

The Sprint Cycle

Five ceremonies. Each has a slash command. Human-triggered or fully autonomous — your choice.

1

Refinement

Stakeholder drops ideas. AI researches, decomposes into concrete issues with acceptance criteria.

/refine
2

Sprint Planning

AI triages backlog, scores issues (ICE), selects scope, assigns labels and milestones.

/sprint-planning
3

Sprint Execution

AI implements issues with quality gates. Huddle after each issue. Tests required for every feature.

/sprint-start
4

Sprint Review

Demo deliverables, metrics, velocity tracking. Stakeholder accepts or requests changes.

/sprint-review
5

Sprint Retro

What went well, what didn't. Process improvements. The process that improves itself.

/sprint-retro

What's Included

🤖 11 Custom Agents

code-developer test-engineer documentation-agent security-reviewer research-agent architect release-agent challenger ci-fixer copilot-customization-builder + your own

⚡ 20 Skills

sprint-planning sprint-start sprint-review sprint-retro orchestrate-feature orchestrate-bugfix code-review create-pr tdd-workflow architecture-review direction-gate issue-triage web-research writing-plans + 5 more

📜 Process Constitution

Stakeholder Model with escalation rules
ICE scoring for prioritization
Definition of Done checklist
Label-based status tracking
Milestone-based sprint grouping
Sprint documentation templates

🛡️ Quality Gates

Verification before completion
Challenger agent for adversarial review
Direction gate for strategic pivots
CI gate before merge
YAGNI + Boy Scout principles
Evidence-based assertions only

Getting Started

0

Prerequisites

You need an active GitHub Copilot subscription (Pro, Pro+, Business, or Enterprise) and Copilot CLI installed.

# Install Copilot CLI
brew install copilot-cli          # macOS / Linux
winget install GitHub.Copilot     # Windows

# Enable experimental mode (required for Autopilot)
copilot --experimental

Autopilot mode lets the agent continue working until a task is complete. Activate it with Shift+Tab to cycle to Autopilot mode inside a session. This is especially important for the autonomous variant.

1

Use the Template

# Guided (interactive, you steer)
gh repo create my-project \
  --template trsdn/copilot-scrum-guided \
  --public --clone

# Autonomous (multi-hour, unattended)
gh repo create my-project \
  --template trsdn/copilot-scrum-autonomous \
  --public --clone
2

Replace Placeholders

Search for {{PROJECT_NAME}}, {{PROJECT_DESCRIPTION}}, and {{OWNER}} across all files and replace with your values.

3

Create Labels

gh label create "status:planned" --color "0E8A16"
gh label create "status:in-progress" --color "FBCA04"
gh label create "status:validation" --color "1D76DB"
gh label create "priority:high" --color "B60205"
gh label create "priority:medium" --color "F9D0C4"
gh label create "priority:low" --color "C5DEF5"
gh label create "type:idea" --color "D4C5F9"
4

Drop Ideas & Start

# Create an idea:
gh issue create --title "idea: Your feature" \
  --label "type:idea" --body "Brief description"

# Refine ideas into concrete issues:
/refine

# Plan and run your first sprint:
/sprint-planning

The AI refines your ideas into concrete issues, then plans and executes sprints.

The Process Constitution

Five principles, ranked by priority. When they conflict, higher beats lower.

0

Stakeholder Authority

The human decides what and why. The AI decides how. Stakeholder-created issues must not be deprioritized, descoped, or closed without approval. ICE scoring is advisory — human intent overrides it.

1

Protect Focus

Complete what you start. New ideas go to the backlog, not into the current sprint. Never abandon work-in-progress to chase shiny objects.

2

Quality Gates

Every change is tested, reviewed, and verified before merge. No exceptions. CI must be green. Evidence must exist.

3

Small, Testable Diffs

One feature per PR. ~150 lines ideal, 300 max. Config changes over code changes. Each PR is independently shippable.

4

Continuous Improvement

Every retro evaluates the process itself — agents, skills, workflows. Friction gets automated. Failures get root-caused.

Escalation Model

Clear rules for when the AI decides alone and when it must ask.

MUST Escalate

Stop and wait
  • Strategic direction changes
  • ADR creation or modification
  • Constitution amendments
  • New dependencies
  • Production deployments
  • Spending or resource decisions
  • Sprint scope drift (>2 unplanned issues)
  • Modifying stakeholder intent (descoping, deprioritizing, closing stakeholder issues)
⚠️

SHOULD Escalate

Notify, continue if no response
  • Sprint scope > 8 issues
  • Deprioritizing high-priority items
  • Significant refactoring (>500 LOC)
  • Closing >5 stale issues at once

Autonomous

No escalation needed
  • Sprint planning + execution
  • Code, tests, documentation
  • Bug fixes with regression tests
  • Config changes, label hygiene
  • Agent/skill improvements
  • Issue creation for new work

Drift Control

Autonomous execution needs guardrails. Individual commits can be valid while the project silently drifts off course.

🔒

Sprint Scope Lock

Only planned issues may be executed. Discovered work goes to backlog — never into the current sprint. If >2 unplanned issues are created, the AI must escalate.

📋

Drift Check at Every Huddle

After each issue: Was this planned? Are files related to sprint scope? Is the goal still achievable? If any check fails — stop and escalate.

📊

Sprint Boundary Review

At sprint review: holistic git diff --stat across all changes. Flag files that don't relate to any sprint issue. Report planned vs unplanned work.

Definition of Done

Every issue must meet ALL applicable criteria before closing. No shortcuts.

Acceptance criteria defined on the issue before implementation starts
Code implemented, lint clean, types clean
Unit tests written — min 3 per feature: happy path, edge case, parameter effect
Tests verify actual behavior changes, not just "runs without error"
If bugfix: regression test that FAILS before fix, PASSES after
PR created, code-reviewed, squash-merged
CI green before merge — wait 3-5 min, verify with gh run list
Issue closed with summary comment, status labels removed
Worktree cleaned up

Issue-Driven Development

GitHub Issues are the only task system. No external trackers, no internal todo lists.

Backlog
Open issue, no status label
Planned
status:planned
In Progress
status:in-progress
Validation
status:validation
Done
Issue closed

ICE Prioritization

Score = Impact × Confidence ÷ Effort (each 1-3)

≥ 4 priority:high
2-3 priority:medium
< 2 priority:low

Sprint Grouping

Milestones replace board columns. Full history preserved.

gh issue edit 42 --milestone "Sprint 5"
gh issue list --milestone "Sprint 5"
gh issue list --milestone "Sprint 5" --state closed

Documentation as Memory

The AI has no memory between sessions. These artifacts are how continuity survives.

📋

Sprint Log

Huddle decisions, learnings, blockers — created at sprint start, updated after each issue.

docs/sprints/sprint-N-log.md
📊

Velocity Data

Sprint-over-sprint performance tracking. Created at retro, used at planning.

docs/sprints/velocity.md
💬

Issue Comments

Traceable audit trail per issue. Huddle results posted as comments.

GitHub Issue #N
🏛️

ADRs

Architectural Decision Records. Immutable unless stakeholder approves change.

docs/architecture/ADR.md
The Huddle Rule: Results must be documented in two places — comment on the issue (audit trail) and sprint log (retro context).

Choose Your Style

Start guided, graduate to autonomous. Same process, same constitution — different level of human presence.

🎯

Guided

Interactive. Step by step.

You trigger each ceremony, review results, approve scope. Sessions last as long as you're engaged.

Best for: Learning AI-Scrum, team projects, hands-on control
Use Guided Template →
🤖

Autonomous

Runs for hours. Unsupervised.

Start a sprint and walk away. The AI agent team runs for hours — you come back to finished work and a notification on your phone.

Best for: Solo devs, side projects, overnight runs
Use Autonomous Template →