GSD / Get Shit Done

GSD is a context-engineering and multi-agent execution framework focused on shipping velocity under the practical limits of AI context windows.

The original repository provided by many references now points to the maintained continuation under open-gsd/gsd-core. When evaluating it today, use the maintained repository and architecture docs.

If Spec Kit is a spec compiler and AWS AI-DLC is a governance cockpit, GSD is a delivery factory for agents.

mermaid

flowchart TB
    U[User goal] --> P[.planning project memory]
    P --> R[Research agents]
    P --> PL[Planning agents]
    PL --> T[Phase tasks]
    T --> E1[Executor agent 1]
    T --> E2[Executor agent 2]
    T --> E3[Executor agent N]
    E1 --> V[Verification]
    E2 --> V
    E3 --> V
    V --> S[Ship]
    S --> P

Mental model

GSD assumes a long chat is not a reliable project brain. Instead:

Project state should live in files.
Main context should stay light.
Research, planning, execution, and verification should use separate agents.
Independent tasks can run in parallel.
Shipping should update project memory and produce clear commits or PRs.

Artifact model

GSD typically uses .planning/ as durable project memory.

Artifact	Role
`.planning/PROJECT.md`	Project purpose and context
`.planning/REQUIREMENTS.md`	Requirements
`.planning/ROADMAP.md`	Milestones and phases
`.planning/STATE.md`	Current status
`.planning/config.json`	Mode, model profile, parallelism, quality agents
Phase artifacts	Discussion, plan, tasks, and verification for a phase

Workflow

mermaid

sequenceDiagram
    participant U as User
    participant M as Main Agent
    participant S as Subagents
    participant P as .planning
    participant R as Repo

    U->>M: Start project
    M->>P: Create project memory
    U->>M: Discuss phase
    M->>S: Research and analysis
    S->>P: Record findings
    U->>M: Plan phase
    M->>P: Create phase tasks
    U->>M: Execute phase
    M->>S: Dispatch executor agents
    S->>R: Implement tasks
    S->>P: Update state
    U->>M: Verify work
    M->>S: Run verification agents
    S->>R: Test and review
    U->>M: Ship
    M->>R: Commit, PR, or release artifact

Strengths

It directly attacks context rot. State is externalized to .planning/, and subagents get fresh context.
It supports long-running projects. Work can continue across many sessions without relying on chat memory.
It enables parallel execution. Independent tasks can be split across executor agents.
It separates roles. Researcher, planner, executor, checker, and verifier are different mental roles.
It is optimized for shipping. The workflow does not stop at documentation; it moves toward verified output.

Weaknesses

Weakness	Practical consequence
Automation-heavy	Too many subagents can create broad changes that are hard to review
Not governance-first	It needs extra process for compliance, security, and formal approval
Command surface is large	Users must learn the workflow and configuration
Tooling/version risk	Track the maintained repo and package security
Source-of-truth conflicts	`.planning/` can conflict with specs, ADRs, or `aidlc-docs/`

Best use cases

Use case	Why GSD fits
Solo founder building an app over days or weeks	`.planning/` preserves memory and roadmap
Startup team optimizing shipping speed	Phase execution and parallel agents improve throughput
Many independent tasks	Subagents can execute in parallel
Migration split into phases	Roadmap and state files track progress
Long-running AI project	Less dependence on one long conversation

When to avoid it as primary

Prefer another primary workflow when:

Compliance and audit are the main concern: use AWS AI-DLC.
Specification correctness is the main concern: use Spec Kit.
The task is small and quality discipline is the main need: use Superpowers.
The team cannot review parallel changes safely.

Example: forgot password

GSD would structure the work as a phase:

Phase step	Output
Discuss	Flow, security constraints, email provider, success criteria
Plan	Backend, frontend, email, tests, and verification tasks
Execute	Subagents implement model/API, UI, email template, tests
Verify	Separate review for security, tests, and UX
Ship	Commit/PR and update `.planning/STATE.md`

The value is not the spec alone. The value is orchestration and context stability.

Expert implementation guide

Use GSD when your delivery problem is not just "what should we build?" but "how do we keep AI productive over many sessions and tasks without context collapse?"

Step 1: Decide whether GSD should be primary

GSD should be primary when:

The project has multiple phases.
The work spans several days or weeks.
Many tasks are independent.
You want subagents to research, plan, execute, and verify.
You can review generated changes safely.

Do not make GSD primary if compliance approval, not execution throughput, is the central problem.

Step 2: Map existing code first

For brownfield projects, run codebase mapping before creating a new GSD project context. The maintained GSD README recommends this pattern for returning or existing codebases.

text

/gsd-map-codebase
/gsd-new-project

Mapping should capture:

Area	What to extract
Stack	Frameworks, languages, package managers
Architecture	Modules, boundaries, data flow
Conventions	Naming, testing, linting, PR style
Risk areas	Auth, billing, migrations, shared components
Test commands	Fast tests, full tests, e2e tests

Step 3: Create project memory

/gsd-new-project should create or update .planning/ artifacts:

Project goal.
Requirements.
Roadmap.
Current state.
Configuration.

Treat .planning/ as operational memory, not marketing documentation. It should be concise enough for agents to reload and accurate enough for humans to trust.

Step 4: Run the six-command loop

mermaid

flowchart LR
    A["/gsd-new-project"] --> B["/gsd-discuss-phase N"]
    B --> C["/gsd-plan-phase N"]
    C --> D["/gsd-execute-phase N"]
    D --> E["/gsd-verify-work N"]
    E --> F["/gsd-ship N"]

The loop works best when each phase has a single delivery outcome. Avoid phases like "improve the app"; prefer "add password reset email request and token verification flow".

Step 5: Tune `.planning/config.json`

The maintained GSD docs describe several important dials. Use them intentionally.

Dial	Conservative setting	Aggressive setting	When to use
`mode`	`interactive`	`yolo`	Use interactive for production; yolo only for low-risk prototypes
Model profile	`quality`	`budget`	Use quality for design/risk; budget for repetitive tasks
Research agent	On	Off	Turn on for unfamiliar domains/code
Plan checker	On	Off	Turn on before parallel execution
Verifier	On	Off	Keep on unless tests are trivial
Parallelization	Off	On	Turn on only for independent tasks

Step 6: Parallelize only with hard boundaries

Good parallel tasks:

Backend endpoint and frontend copy updates.
Independent test suites.
Documentation and isolated UI component.
Migration script and read-only analysis.

Bad parallel tasks:

Two agents editing the same component.
Database schema and API contract without a shared plan.
Auth changes across shared middleware.
Cross-cutting refactor with unclear ownership.

Use this rule:

If two tasks may edit the same file or depend on the same design decision, do not run them in parallel.

Step 7: Verification is not optional

/gsd-verify-work should include:

Automated test commands.
Manual acceptance checklist.
Diff review.
Security/regression review for risky areas.
Update to .planning/STATE.md.

Skipping verification turns GSD into an acceleration tool for technical debt.

Optimization playbook

For solo builders

Use GSD to maintain momentum:

Keep phases short.
Use interactive mode until trust improves.
Use parallelism for docs/tests/UI polish, not core architecture.
Ship after each verified phase.

For teams

Add team controls:

Require PR review before /gsd-ship.
Keep .planning/ROADMAP.md aligned with issue tracker.
Limit phase size to reviewable diffs.
Record test evidence in PR descriptions.

For enterprise contexts

GSD can still help, but add governance:

AI-DLC or internal review for high-risk phases.
Security review for auth/data/infra changes.
Architecture approval before parallel work.
CI gating before merge.

Definition of done for GSD

A phase is done when:

Phase decisions are captured.
Plan has been checked.
Parallel tasks did not conflict.
Verification passed.
Manual acceptance checklist is complete.
.planning/STATE.md is updated.
PR/commit scope is understandable.
Next phase is clear.

GSD / Get Shit Done ​

Mental model ​

Artifact model ​

Workflow ​

Strengths ​

Weaknesses ​

Best use cases ​

When to avoid it as primary ​

Example: forgot password ​

Expert implementation guide ​

Step 1: Decide whether GSD should be primary ​

Step 2: Map existing code first ​

Step 3: Create project memory ​

Step 4: Run the six-command loop ​

Step 5: Tune .planning/config.json ​

Step 6: Parallelize only with hard boundaries ​

Step 7: Verification is not optional ​

Optimization playbook ​

For solo builders ​

For teams ​

For enterprise contexts ​

Definition of done for GSD ​

GSD / Get Shit Done

Mental model

Artifact model

Workflow

Strengths

Weaknesses

Best use cases

When to avoid it as primary

Example: forgot password

Expert implementation guide

Step 1: Decide whether GSD should be primary

Step 2: Map existing code first

Step 3: Create project memory

Step 4: Run the six-command loop

Step 5: Tune `.planning/config.json`

Step 6: Parallelize only with hard boundaries

Step 7: Verification is not optional

Optimization playbook

For solo builders

For teams

For enterprise contexts

Definition of done for GSD