Skip to content

One-Page Cheat Sheet

Use this page when you need the fastest possible answer to: which tool belongs to which layer, and what should it produce?

The 30-second model

mermaid
flowchart TB
    A[Model API or local LLM] --> B[Agent app framework]
    B --> C[Agent harness/runtime]
    C --> D[Workflow/methodology]
    D --> E[Artifacts and source of truth]
    E --> F[Repo, CI, evals, audit]

    B --> B1[LangChain / LangGraph]
    C --> C1[Codex CLI / Claude Code / Hermes]
    D --> D1[Spec Kit / OpenSpec / AI-DLC / GSD / Superpowers]

The key idea:

Most frameworks share verbs like plan, implement, and review. They differ by what they govern.

Choose by problem

Your real problemBest starting pointWhy
The agent guesses requirementsGitHub Spec KitMake intent, spec, plan, and tasks explicit before coding
You want lightweight SDDOpenSpecKeep change proposals and delta specs without heavy process
You need enterprise traceabilityAWS AI-DLC WorkflowsAdd approvals, risk records, NFRs, and audit trail
Work spans many sessionsGSDPreserve context, phase plans, and handoffs
Agent coding lacks disciplineSuperpowersEnforce design, TDD, review, and finish habits
You are building a RAG or tool-calling appLangChainCompose models, prompts, tools, retrievers, and chains
You are building a stateful agent serviceLangGraphModel state, nodes, edges, checkpoints, and human-in-the-loop
You need a customizable agent runtimeHermesOwn the harness, tools, memory, skills, and model routing
You need a protocol for toolsMCPStandardize tool exposure and integration boundaries

Same words, different ownership

WordWorkflow framework meaningHarness/runtime meaningApp framework meaning
PlanDelivery plan, spec, tasks, approval pathAgent execution plan for tool useGraph, chain, node, or state transition design
ImplementCode changes against requirementsTool calls, file edits, terminal actionsRuntime logic inside an AI app
ReviewHuman/code/spec reviewAgent output verificationEval, trace, or behavior inspection
MemoryProject context or long-lived handoffAgent memory/session contextApplication memory, state, or retrieval
GovernanceDelivery gates and accountabilityTool permissions and sandboxingRuntime guardrails and eval thresholds

Selection matrix

ContextPrimary workflowSupporting layersMinimum artifacts
Small product featureSpec Kit or OpenSpecSuperpowersspec/change proposal, tasks, tests
Startup MVPOpenSpecSuperpowers, LangChain if AI appchange proposal, test checklist, done criteria
Enterprise modernizationAWS AI-DLCSpec Kit, Superpowersrisk record, NFRs, approval log, migration plan
RAG productOpenSpecLangChain, evals, observabilitydata contract, eval set, prompt contract
Long-running agent serviceAI-DLC or OpenSpecLangGraph, security/governancestate schema, tool policy, eval gates
Internal agent platformAI-DLCHermes, MCP, LangGraph, observabilitytool registry, memory policy, audit trail
Multi-agent delivery projectGSDSuperpowers, Spec Kit/OpenSpecphase plan, context packet, handoff notes

Do not compare these directly

Wrong comparisonBetter framing
LangGraph vs AI-DLCLangGraph builds runtime behavior; AI-DLC governs delivery
Hermes vs Spec KitHermes runs agents; Spec Kit structures specs
MCP vs LangChainMCP exposes tools; LangChain builds app logic that may use tools
Superpowers vs Codex CLISuperpowers is engineering method; Codex CLI is an agent harness
OpenSpec vs LangGraphOpenSpec governs changes; LangGraph implements stateful agent behavior

Minimal stack by maturity

mermaid
flowchart LR
    A[Prototype] --> B[OpenSpec + Superpowers]
    B --> C[Production AI app]
    C --> D[OpenSpec + LangChain + evals]
    C --> E[LangGraph + tool policy]
    E --> F[Enterprise agent system]
    F --> G[AI-DLC + LangGraph + MCP + observability + audit]

Rule of thumb

Use one primary workflow as the source of truth. Add app frameworks, harnesses, tools, and eval layers around it, but do not let every layer invent its own plan.

Built as a static bilingual AI engineering stack guide.