Skip to content

GSD / Get Shit Done

GSD is a context-engineering and multi-agent execution framework focused on shipping velocity under the practical limits of AI context windows.

The original repository provided by many references now points to the maintained continuation under open-gsd/gsd-core. When evaluating it today, use the maintained repository and architecture docs.

If Spec Kit is a spec compiler and AWS AI-DLC is a governance cockpit, GSD is a delivery factory for agents.

mermaid
flowchart TB
    U[User goal] --> P[.planning project memory]
    P --> R[Research agents]
    P --> PL[Planning agents]
    PL --> T[Phase tasks]
    T --> E1[Executor agent 1]
    T --> E2[Executor agent 2]
    T --> E3[Executor agent N]
    E1 --> V[Verification]
    E2 --> V
    E3 --> V
    V --> S[Ship]
    S --> P

Mental model

GSD assumes a long chat is not a reliable project brain. Instead:

  • Project state should live in files.
  • Main context should stay light.
  • Research, planning, execution, and verification should use separate agents.
  • Independent tasks can run in parallel.
  • Shipping should update project memory and produce clear commits or PRs.

Artifact model

GSD typically uses .planning/ as durable project memory.

ArtifactRole
.planning/PROJECT.mdProject purpose and context
.planning/REQUIREMENTS.mdRequirements
.planning/ROADMAP.mdMilestones and phases
.planning/STATE.mdCurrent status
.planning/config.jsonMode, model profile, parallelism, quality agents
Phase artifactsDiscussion, plan, tasks, and verification for a phase

Workflow

mermaid
sequenceDiagram
    participant U as User
    participant M as Main Agent
    participant S as Subagents
    participant P as .planning
    participant R as Repo

    U->>M: Start project
    M->>P: Create project memory
    U->>M: Discuss phase
    M->>S: Research and analysis
    S->>P: Record findings
    U->>M: Plan phase
    M->>P: Create phase tasks
    U->>M: Execute phase
    M->>S: Dispatch executor agents
    S->>R: Implement tasks
    S->>P: Update state
    U->>M: Verify work
    M->>S: Run verification agents
    S->>R: Test and review
    U->>M: Ship
    M->>R: Commit, PR, or release artifact

Strengths

  1. It directly attacks context rot. State is externalized to .planning/, and subagents get fresh context.
  2. It supports long-running projects. Work can continue across many sessions without relying on chat memory.
  3. It enables parallel execution. Independent tasks can be split across executor agents.
  4. It separates roles. Researcher, planner, executor, checker, and verifier are different mental roles.
  5. It is optimized for shipping. The workflow does not stop at documentation; it moves toward verified output.

Weaknesses

WeaknessPractical consequence
Automation-heavyToo many subagents can create broad changes that are hard to review
Not governance-firstIt needs extra process for compliance, security, and formal approval
Command surface is largeUsers must learn the workflow and configuration
Tooling/version riskTrack the maintained repo and package security
Source-of-truth conflicts.planning/ can conflict with specs, ADRs, or aidlc-docs/

Best use cases

Use caseWhy GSD fits
Solo founder building an app over days or weeks.planning/ preserves memory and roadmap
Startup team optimizing shipping speedPhase execution and parallel agents improve throughput
Many independent tasksSubagents can execute in parallel
Migration split into phasesRoadmap and state files track progress
Long-running AI projectLess dependence on one long conversation

When to avoid it as primary

Prefer another primary workflow when:

  • Compliance and audit are the main concern: use AWS AI-DLC.
  • Specification correctness is the main concern: use Spec Kit.
  • The task is small and quality discipline is the main need: use Superpowers.
  • The team cannot review parallel changes safely.

Example: forgot password

GSD would structure the work as a phase:

Phase stepOutput
DiscussFlow, security constraints, email provider, success criteria
PlanBackend, frontend, email, tests, and verification tasks
ExecuteSubagents implement model/API, UI, email template, tests
VerifySeparate review for security, tests, and UX
ShipCommit/PR and update .planning/STATE.md

The value is not the spec alone. The value is orchestration and context stability.

Expert implementation guide

Use GSD when your delivery problem is not just "what should we build?" but "how do we keep AI productive over many sessions and tasks without context collapse?"

Step 1: Decide whether GSD should be primary

GSD should be primary when:

  • The project has multiple phases.
  • The work spans several days or weeks.
  • Many tasks are independent.
  • You want subagents to research, plan, execute, and verify.
  • You can review generated changes safely.

Do not make GSD primary if compliance approval, not execution throughput, is the central problem.

Step 2: Map existing code first

For brownfield projects, run codebase mapping before creating a new GSD project context. The maintained GSD README recommends this pattern for returning or existing codebases.

text
/gsd-map-codebase
/gsd-new-project

Mapping should capture:

AreaWhat to extract
StackFrameworks, languages, package managers
ArchitectureModules, boundaries, data flow
ConventionsNaming, testing, linting, PR style
Risk areasAuth, billing, migrations, shared components
Test commandsFast tests, full tests, e2e tests

Step 3: Create project memory

/gsd-new-project should create or update .planning/ artifacts:

  • Project goal.
  • Requirements.
  • Roadmap.
  • Current state.
  • Configuration.

Treat .planning/ as operational memory, not marketing documentation. It should be concise enough for agents to reload and accurate enough for humans to trust.

Step 4: Run the six-command loop

mermaid
flowchart LR
    A["/gsd-new-project"] --> B["/gsd-discuss-phase N"]
    B --> C["/gsd-plan-phase N"]
    C --> D["/gsd-execute-phase N"]
    D --> E["/gsd-verify-work N"]
    E --> F["/gsd-ship N"]

The loop works best when each phase has a single delivery outcome. Avoid phases like "improve the app"; prefer "add password reset email request and token verification flow".

Step 5: Tune .planning/config.json

The maintained GSD docs describe several important dials. Use them intentionally.

DialConservative settingAggressive settingWhen to use
modeinteractiveyoloUse interactive for production; yolo only for low-risk prototypes
Model profilequalitybudgetUse quality for design/risk; budget for repetitive tasks
Research agentOnOffTurn on for unfamiliar domains/code
Plan checkerOnOffTurn on before parallel execution
VerifierOnOffKeep on unless tests are trivial
ParallelizationOffOnTurn on only for independent tasks

Step 6: Parallelize only with hard boundaries

Good parallel tasks:

  • Backend endpoint and frontend copy updates.
  • Independent test suites.
  • Documentation and isolated UI component.
  • Migration script and read-only analysis.

Bad parallel tasks:

  • Two agents editing the same component.
  • Database schema and API contract without a shared plan.
  • Auth changes across shared middleware.
  • Cross-cutting refactor with unclear ownership.

Use this rule:

If two tasks may edit the same file or depend on the same design decision, do not run them in parallel.

Step 7: Verification is not optional

/gsd-verify-work should include:

  1. Automated test commands.
  2. Manual acceptance checklist.
  3. Diff review.
  4. Security/regression review for risky areas.
  5. Update to .planning/STATE.md.

Skipping verification turns GSD into an acceleration tool for technical debt.

Optimization playbook

For solo builders

Use GSD to maintain momentum:

  • Keep phases short.
  • Use interactive mode until trust improves.
  • Use parallelism for docs/tests/UI polish, not core architecture.
  • Ship after each verified phase.

For teams

Add team controls:

  • Require PR review before /gsd-ship.
  • Keep .planning/ROADMAP.md aligned with issue tracker.
  • Limit phase size to reviewable diffs.
  • Record test evidence in PR descriptions.

For enterprise contexts

GSD can still help, but add governance:

  • AI-DLC or internal review for high-risk phases.
  • Security review for auth/data/infra changes.
  • Architecture approval before parallel work.
  • CI gating before merge.

Definition of done for GSD

A phase is done when:

  1. Phase decisions are captured.
  2. Plan has been checked.
  3. Parallel tasks did not conflict.
  4. Verification passed.
  5. Manual acceptance checklist is complete.
  6. .planning/STATE.md is updated.
  7. PR/commit scope is understandable.
  8. Next phase is clear.

Built as a static bilingual AI engineering stack guide.