Skip to content

Superpowers

Superpowers is a methodology and skill library for making AI coding agents behave more like disciplined software engineers. It emphasizes brainstorming, design approval, detailed planning, test-driven development, subagent execution, code review, worktrees, and branch finishing.

If Spec Kit is a spec compiler, AWS AI-DLC is a governance cockpit, and GSD is a delivery factory, Superpowers is an engineering discipline layer.

mermaid
flowchart TB
    A[Brainstorm] --> B[Design]
    B --> C[Plan]
    C --> D[Worktree]
    D --> E[TDD red-green-refactor]
    E --> F[Code review]
    F --> G[Finish branch]

Mental model

Superpowers does not try to own the entire project lifecycle. It gives the agent situational skills:

SituationSkill mindset
The request is vagueBrainstorm before committing to a solution
A change is non-trivialProduce and approve a design
Implementation is complexWrite a detailed plan
Behavior changesUse test-driven development
Work can be parallelizedUse subagent-driven development
Code is completeRequest code review
Branch work is doneFinish the development branch cleanly

Workflow

mermaid
flowchart TB
    U[User request] --> B{Clear enough?}
    B -->|No| BR[Brainstorming]
    B -->|Yes| D[Design]
    BR --> D
    D --> A{Human approves design?}
    A -->|No| BR
    A -->|Yes| P[Detailed plan]
    P --> T[TDD implementation]
    T --> R[Code review]
    R --> F{Issues found?}
    F -->|Yes| T
    F -->|No| S[Finish branch]

TDD as a core strength

Superpowers strongly emphasizes red-green-refactor:

mermaid
flowchart LR
    R[RED: write failing test] --> C{Fails for the right reason?}
    C -->|No| R
    C -->|Yes| G[GREEN: minimal code]
    G --> P{All tests pass?}
    P -->|No| G
    P -->|Yes| Ref[REFACTOR]
    Ref --> P

This matters because AI agents often sound confident without evidence. TDD shifts confidence from language to executable proof.

Strengths

  1. It prevents "code first, think later." The agent is pushed to brainstorm, design, and plan before editing many files.
  2. TDD reduces hallucinated correctness. Behavior is proven through failing and passing tests.
  3. Review is explicit. A separate review step catches issues the implementer missed.
  4. It is composable. You can use one skill for a small bug or several skills for a larger change.
  5. It is tool-agnostic in spirit. The method can be adapted across multiple coding agents.

Weaknesses

WeaknessPractical consequence
Not a full spec management systemUse Spec Kit if organization-wide SDD artifacts are needed
Not a governance frameworkUse AWS AI-DLC if audit, NFRs, and approval matrix matter
TDD has upfront costTeams not used to TDD may feel slowed down
Depends on correct skill useThe agent may need reminders or rules to activate the right skill
Not a deep project memory systemGSD is stronger for long-running, multi-phase memory

Best use cases

Use caseWhy Superpowers fits
Bug fix requiring regression testsTDD is the natural workflow
Feature in an existing codebaseDesign, plan, test, and review without full ceremony
RefactorTests and review reduce regression risk
Team wants higher AI qualitySkills directly shape agent behavior
Agent tends to guessBrainstorming and design approval force clarification

Example: forgot password

With Superpowers:

StepOutput
BrainstormClarify UX, security, token, email, and abuse cases
DesignAPI, model, flow, and edge cases
PlanFiles, tests, and implementation order
TDDFailing tests for expiration, invalid token, rate limit, no enumeration
ImplementMinimal code to pass tests
ReviewSecurity and quality review
FinishClean branch summary and verification

The core benefit is disciplined execution, not lifecycle governance.

Expert implementation guide

Use Superpowers when you want to change the agent's working habits: ask before coding, design before changing architecture, test before implementation, review before declaring done.

Step 1: Install only in the agents you actually use

Superpowers supports several agent harnesses and plugin systems. Install it where your team works daily, not everywhere at once.

Environment stylePractical guidance
Claude-style plugin marketplaceInstall the Superpowers plugin and verify skills are listed
Cursor-style agentAdd the plugin/rules and test with a small design task
CLI agentFollow the agent-specific install path, then ask it to list available skills
Mixed teamStandardize on one primary harness before scaling

After installation, run a tiny task and verify the agent uses the expected skill instead of jumping into code.

Step 2: Make skill activation explicit

For the first few weeks, do not rely on the agent to infer everything. Use direct prompts:

text
Use brainstorming first. Do not edit files until we approve the design.
text
Use test-driven development. Start by writing the failing test for the behavior.
text
Request a code review after implementation and fix all review findings before finalizing.

This trains the collaboration pattern for both humans and agents.

Step 3: Use the full workflow for non-trivial changes

mermaid
flowchart LR
    A[Brainstorm] --> B[Design approval]
    B --> C[Worktree]
    C --> D[Detailed plan]
    D --> E[TDD implementation]
    E --> F[Code review]
    F --> G[Finish branch]

For small changes, use a subset. For high-risk changes, use the full chain.

Step 4: Apply TDD correctly

Correct TDD:

  1. Describe the next behavior.
  2. Write a failing test.
  3. Confirm it fails for the right reason.
  4. Write minimal implementation.
  5. Run the test.
  6. Refactor only after green.
  7. Repeat for the next behavior.

Incorrect TDD:

  • Write implementation first, then tests.
  • Write tests that mirror implementation details.
  • Accept a failing test without checking the failure reason.
  • Skip refactor because "AI code looks fine".

Step 5: Use worktrees for risk isolation

Use worktrees when:

  • The change may touch many files.
  • You want a clean baseline.
  • Multiple agents may work in parallel.
  • You need to abandon an experiment safely.

Before implementation:

CheckReason
Clean git statusAvoid mixing unrelated work
Baseline tests passKnow whether failures are new
Branch name reflects taskEasier review and cleanup
Setup commands documentedSubagents can reproduce environment

Step 6: Treat review as a separate phase

Review should check:

Review layerQuestion
Spec complianceDid we build what was agreed?
Test qualityDo tests prove behavior, not implementation details?
Code qualityIs the change simple and maintainable?
ArchitectureDid boundaries remain intact?
RiskDid auth, data, migration, or operations risks change?

The strongest pattern is two-stage review: first compliance with plan/spec, then code quality.

Optimization playbook

For bug fixes

Use the shortest high-quality loop:

mermaid
flowchart LR
    A[Reproduce bug] --> B[Failing regression test]
    B --> C[Minimal fix]
    C --> D[Run tests]
    D --> E[Review diff]

For feature work

Use brainstorming, design, plan, TDD, review. If the feature is large, combine with Spec Kit for stronger artifact management.

For refactoring

Require:

  • Baseline tests pass before refactor.
  • Characterization tests for risky behavior.
  • Small commits or reviewable diff chunks.
  • No behavior change unless explicitly approved.

For teams

Create a team rule:

Any AI-generated behavior change must either start with a failing test or include a documented reason why TDD is not appropriate.

Definition of done for Superpowers

A task is done when:

  1. Design was clarified for non-trivial work.
  2. Plan was approved or the change was small enough to skip formal planning.
  3. Tests were written before or alongside implementation.
  4. All relevant tests pass.
  5. Code review was requested.
  6. Review findings were fixed or explicitly rejected with rationale.
  7. Branch/worktree is clean and ready to merge.

Built as a static bilingual AI engineering stack guide.