Skip to content

Decision Guide

The wrong question is "Which framework is best?" The useful question is "Which layer of AI engineering is failing in my context?"

First questionGo to
Are you building an AI app?LangChain, LangGraph, Data/RAG, Tools/MCP
Are you improving software delivery with agents?Spec Kit, OpenSpec, AI-DLC, GSD, Superpowers
Are you running/customizing an agent harness?Hermes, Codex vs Claude vs Hermes
Are you preparing for production?Evals & Observability, Security & Governance
Are you choosing models or serving strategy?Model & Serving Layer
mermaid
flowchart TB
    Q["What are you building or choosing?"] --> A0{"AI app / agent backend?"}
    A0 -->|Yes| A1{"Simple chains, RAG, tools?"}
    A1 -->|Yes| LC["LangChain"]
    A1 -->|No, stateful graph| LG["LangGraph"]
    A0 -->|No| R{"Runtime or workflow?"}
    R -->|Runtime/harness| H{"Need open-source custom runtime?"}
    H -->|Yes| HA["Hermes Agent"]
    H -->|No| HC["Codex CLI or Claude Code"]
    R -->|Workflow/methodology| A{"Governance, audit, many stakeholders?"}
    A -->|Yes| AIDLC["AWS AI-DLC Workflows"]
    A -->|No| B{"Are requirements/specs vague?"}
    B -->|Yes| SK["GitHub Spec Kit"]
    B -->|No| O{"Want lightweight iterative change specs?"}
    O -->|Yes| OS["OpenSpec"]
    O -->|No| C{"Long project, many phases, context breaks?"}
    C -->|Yes| GSD["GSD"]
    C -->|No| D{"Need TDD, review, agent discipline?"}
    D -->|Yes| SP["Superpowers"]
    D -->|No| MIN["Minimal workflow: clarify -> implement -> test -> review"]

Choose by need

I need...Choose
Full AI engineering stack contextAI Engineering Stack Map
Model routing, local LLMs, or serving strategyModel & Serving Layer
RAG data pipeline and retrieval qualityData, RAG & Retrieval
Safe tool use, MCP, or tool gatewaysTools, MCP & Gateways
Evals, tracing, and production feedbackEvals & Observability
Security, governance, and risk tiersSecurity & Governance
Build AI app, RAG, or tool-calling agentLangChain
Build stateful long-running agent backendLangGraph
A polished coding agent CLICodex CLI or Claude Code
An open-source/customizable agent runtimeHermes Agent
A way to describe features so AI builds the right thingSpec Kit
A lightweight spec layer for iterative brownfield changesOpenSpec
A lifecycle with approval and auditAWS AI-DLC Workflows
A system for many phases across many sessionsGSD
A skill layer that stops the agent from coding recklesslySuperpowers
MVP speed with some structureGSD + lightweight Spec Kit
Important product feature with acceptance criteriaSpec Kit + Superpowers
Enterprise modernizationAWS AI-DLC primary
Safer refactoringSuperpowers + tests
Compliance or security-sensitive deliveryAWS AI-DLC + explicit security gates

Choose by work size

Work sizeRecommended workflow
5-30 minutesSuperpowers light or manual prompt with tests
Half day to 2 daysSpec Kit or Superpowers
1-5 day iterative brownfield changeOpenSpec
1-3 weeksSpec Kit + GSD, or AWS AI-DLC if risk is high
1-3 monthsAWS AI-DLC or GSD with added governance
Enterprise programAWS AI-DLC primary; others as supporting layers

Choose by codebase

CodebaseBest fit
Greenfield product appSpec Kit for clarity; GSD for speed
Brownfield monolithAWS AI-DLC for modernization; Superpowers for targeted refactors
Brownfield feature change with low-medium riskOpenSpec
API/librarySpec Kit because contract clarity matters
Internal toolGSD or Spec Kit
Regulated systemAWS AI-DLC
Open source projectSpec Kit or Superpowers for PR reviewability

Red flags

Red flagWhat to avoid
You want to install all frameworks at onceYou will create multiple sources of truth
You cannot review generated docsDo not use a heavy governance flow
Your CI is weakDo not allow broad automated execution
The feature is security-sensitiveDo not use a purely speed-optimized workflow
The task is tinyDo not create full lifecycle artifacts
You need formal auditDo not rely on OpenSpec alone
You only need a workflow processDo not add Hermes unless runtime customization matters

Built as a static bilingual AI engineering stack guide.