Bản đồ AI Engineering Stack

Trang này là bản đồ tư duy cho toàn bộ guide. Ý chính rất đơn giản: nhiều công cụ AI trông giống nhau vì đều có planning, execution, review và iteration. Chúng khác nhau vì hoạt động ở các layer khác nhau trong AI engineering stack.

Các framework có vẻ giống nhau vì đều có plan, implement, review, iterate. Chúng khác nhau vì mỗi framework sở hữu một layer khác nhau trong stack.

Nếu so sánh sai layer, mọi thứ sẽ mơ hồ. LangGraph, Hermes, AWS AI-DLC, Spec Kit và MCP đều có thể xuất hiện trong một hệ thống agentic, nhưng chúng không giải quyết cùng một vấn đề.

Stack tổng thể

mermaid

flowchart TB
    A[Business intent và product risk] --> B[Workflow và methodology]
    B --> C[Agent harness hoặc coding runtime]
    C --> D[Agent app framework]
    D --> E[Tools và protocols]
    D --> F[Data, RAG, retrieval]
    E --> G[Model/provider/serving]
    F --> G
    H[Evals và observability] -. cross-cutting .-> B
    H -. cross-cutting .-> C
    H -. cross-cutting .-> D
    I[Security và governance] -. cross-cutting .-> B
    I -. cross-cutting .-> C
    I -. cross-cutting .-> D
    I -. cross-cutting .-> E

Bản đồ các layer

Layer	Ví dụ	Câu hỏi chính	Output
Workflow/methodology	Spec Kit, OpenSpec, AWS AI-DLC, GSD, Superpowers	Human và agent nên delivery software như thế nào?	Specs, plans, approvals, tests, reviews
Agent harness/runtime	Codex CLI, Claude Code, Hermes	Agent chạy với tools, memory, filesystem và subagents như thế nào?	Tool calls, code changes, terminal actions
Agent app framework	LangChain, LangGraph, LlamaIndex, Semantic Kernel	Làm sao build AI application hoặc long-running agent service?	Chains, graphs, agents, state machines
Tool/protocol layer	MCP, OpenAPI tools, function calling, tool gateways	Model được phép làm gì một cách an toàn?	Tool schemas, policies, audit events
Data/RAG layer	Embeddings, vector DBs, retrievers, rerankers	AI dùng tri thức nào?	Indexed content, retrieved context, citations
Model/serving layer	OpenAI, Anthropic, local LLMs, vLLM, Ollama, LiteLLM	Model nào chạy inference và chạy như thế nào?	Responses, tool calls, token/cost/latency profile
Evals/observability	LangSmith, Langfuse, Phoenix, OpenTelemetry	Làm sao biết hệ thống vẫn chạy đúng?	Traces, eval scores, regression results
Security/governance	RBAC, sandboxing, audit logs, approval gates	Điều gì được phép, được review và ai chịu trách nhiệm?	Policies, logs, approvals, risk records

Vì sao dễ bị rối

Các layer khác nhau dùng lại cùng một nhóm động từ:

Động từ	Trong workflow frameworks	Trong app frameworks	Trong harnesses
Plan	Plan feature hoặc delivery unit	Plan execution path của agent	Plan terminal/file actions
Implement	Sinh code từ specs/tasks	Chạy node/chain/tool path	Sửa file và chạy command
Review	Review spec, design, code, evidence	Evaluate outputs và traces	Kiểm tra diff, tests, logs
Iterate	Đổi artifacts và implementation	Cải thiện prompts, tools, graphs	Retry task với context tốt hơn

Động từ giống nhau. Quyền sở hữu khác nhau.

Vì vậy, chỉ nhìn thấy plan -> implement -> review là chưa đủ để phân loại framework. Cần hỏi framework đó sở hữu artifact nào, kiểm soát rủi ro nào và ai chịu trách nhiệm cuối cùng.

Cách đọc guide này

Dùng trang này để xác định layer bạn đang giải quyết.
Đọc deep-dive pages để hiểu từng framework.
Dùng comparison pages để chọn workflow mặc định.
Dùng reference architectures để kết hợp các layer an toàn.

Quy tắc chọn nhanh nhất

Vấn đề	Bắt đầu ở đây
Requirement còn mơ hồ	Spec Kit hoặc OpenSpec
Enterprise delivery cần audit và approval	AWS AI-DLC
Project dài bị mất context	GSD
Agent code thiếu kỷ luật engineering	Superpowers
Cần open/custom agent harness	Hermes
Build RAG hoặc tool-calling AI app	LangChain
Build stateful long-running agent app	LangGraph
Cần production assurance	Evals, observability, security, governance

Ví dụ thực tế

Với một production support agent, các layer có thể như sau:

mermaid

flowchart TB
    A[AWS AI-DLC] --> B[Delivery governance và approval]
    C[OpenSpec] --> D[Change proposal cho một agent capability]
    E[LangGraph] --> F[Stateful support-agent runtime]
    G[MCP/tool gateway] --> H[CRM, ticketing, knowledge-base tools]
    I[RAG layer] --> J[Policy và product documentation]
    K[Model router] --> L[Hosted và local models]
    M[LangSmith/Langfuse/Phoenix] --> N[Traces và evals]
    O[Security controls] --> P[Tool approval, audit, memory retention]

Không có một framework duy nhất sở hữu toàn bộ stack đó. Một team trưởng thành sẽ kết hợp ít layer nhất có thể, nhưng ranh giới trách nhiệm phải rõ.

Bản đồ AI Engineering Stack ​

Stack tổng thể ​

Bản đồ các layer ​

Vì sao dễ bị rối ​

Cách đọc guide này ​

Quy tắc chọn nhanh nhất ​

Ví dụ thực tế ​

References ​