AI Solution Architecture

Docs

View source

Learn AI Solution Architecture

This course turns the repository-by-repository architecture notes into one coherent knowledge system. It is for senior developers, solution architects, staff engineers, and technical leads who need to design AI systems that survive production constraints.

The key idea: a real AI solution is not "an LLM plus an app." It is a layered system where agent orchestration, model runtime, retrieval data, adaptation strategy, evaluation, observability, tools, security, and operations all affect each other.

What You Will Be Able To Do

After working through this course, you should be able to:

The Knowledge Model

flowchart TB Product[Product and user workflow] --> App[Application / agent architecture] App --> Runtime[Inference runtime] App --> Retrieval[RAG and vector data] App --> Tools[Tools, MCP, platform UI] Runtime --> Model[Model artifact and tokenizer] Training[Training and adaptation] --> Model Retrieval --> Knowledge[Documents, chunks, metadata, embeddings] App --> Eval[Evaluation and feedback] Runtime --> Eval Retrieval --> Eval Eval --> Ops[LLMOps and production operations] Tools --> Ops Ops --> Governance[Security and governance]

Architecture work happens at the boundaries:

Course Structure

PagePurpose
CurriculumTwelve conceptual lessons that build a complete mental model.
ProjectsSix hands-on architecture projects, ending in a capstone.
Repository atlasA comparison map for all 17 repositories and their roles.
GlossaryShared vocabulary for architecture reviews and design discussions.

Source Deep Dives

The underlying reference notes live in repo-architecture-docs. Each repository has an English and Vietnamese architecture document with source tree maps, diagrams, extension points, security risks, operational guidance, failure modes, production readiness checklists, and glossary entries.

Use this course layer when you need the end-to-end system view. Use the source deep dives when you need implementation-level repository detail.

Domain Map

DomainDeep-Dive DocsArchitecture Responsibility
Agent applicationsGroup 01Planning, tool use, workflow control, memory, human escalation, multi-agent coordination
Inference servingGroup 02Loading, scheduling, batching, quantization, local/distributed serving, token streaming
Training and adaptationGroup 03Adapter strategy, optimizer state, distributed scaling, checkpoint governance
RAG and vector dataGroup 04Embeddings, indexing, metadata, tenancy, durability, hybrid retrieval
LLMOps and evaluationGroup 05Tracing, evaluation, experiment tracking, feedback, lineage, model/prompt governance
Tooling and platformGroup 06MCP servers, tool gateways, self-hosted chat UI, admin controls, provider routing

How To Study

  1. Start with the curriculum and read the lesson summaries in order.
  2. For each lesson, open the matching repository deep dives and inspect the diagrams.
  3. Capture decisions in a design log: selected layer, alternatives rejected, failure modes, validation plan.
  4. Run the matching project from projects.
  5. Return to the repository atlas whenever you need to compare tools.
  6. Finish with the capstone production readiness review.

The Central Pattern

AI SOLUTION ARCHITECTURE
========================
User workflow
  -> AI application boundary
  -> agent / workflow / retrieval decisions
  -> model runtime and data plane
  -> evaluation and feedback loop
  -> operations and governance

The model supplies capability.
The architecture supplies reliability.
The evaluation loop supplies evidence.
The governance layer supplies control.

What This Course Is Not

This is not a prompt engineering checklist, a benchmark leaderboard, or a catalog of every AI library. It is an architecture course built from real repository structures. The expected output is better design judgment: knowing which layer owns which problem, which trade-off matters, and which production failure should be rehearsed before launch.