An open, community-built learning hub for engineers who design, build, evaluate, and ship real agentic AI systems โ not just prompts.
| 9 Learning paths (all substantial) |
59 Hands-on labs (notebooks) |
8 Capstone projects |
16 Math foundations (with code) |
| 12 Architecture patterns |
9 Frameworks compared |
65+ Glossary entries |
71+ Concept pages |
Stack covered: LangGraph 1.0 ยท LangChain 1.0 ยท MCP 2025-11-25 ยท A2A v1.0 ยท CrewAI ยท AutoGen ยท Pydantic AI ยท OpenAI Agents SDK ยท LlamaIndex ยท Haystack ยท LangSmith ยท Google ADK
Three problems this repository fixes.
Tutorials go stale fast. A LangGraph blog post from a year ago is mostly wrong today. Every tool page here carries a verified-as-of date and a link to the official changelog.
Concepts and tools get tangled. Most courses teach "LangGraph" when they mean "agent state machines." This repo keeps them apart โ concepts and math live in their own folders and don't change when the framework does.
The math is usually skipped or overdone. Agentic AI rests on real math โ autoregressive generation, embeddings, MDPs, policies, retrieval theory โ but most engineers don't need a textbook. The math pages here include the equations that change how you debug a system, with executable Python next to each. Nothing else.
The result: a practical, technically deep, community-maintained reference that separates stable fundamentals (still right in five years) from fast-moving tooling (LangGraph, MCP, A2A, vector DBs, eval platforms). Updates to fast-moving tools don't break the curriculum. Six months from now, the concept pages still work.
- ๐ ๏ธ Software engineers moving from chatbots to real agentic systems with tools, memory, and orchestration.
- ๐งช ML/AI practitioners who know the modeling side and want a structured map of agent patterns, protocols, and production concerns.
- ๐ Advanced learners who already understand transformers, embeddings, and Python at a working level and want to go from notebook prototypes to evaluated, observable, deployable agents.
If you've never written Python or used an LLM API, this isn't the right starting point. Try a foundational LLM course first, then come back.
Working through the labs and projects, you'll end up with code worth keeping:
- ๐ค A ReAct-style agent from scratch, no framework, so you understand the loop before you abstract it away.
- ๐ RAG systems including agentic RAG where retrieval is a tool, not a fixed pipeline.
- ๐ฅ Multi-agent topologies โ supervisor, hierarchical, swarm โ and a clear sense of when each one is wrong for the job.
- ๐ MCP servers and clients wired to real data sources.
- ๐ค A2A agents that discover and delegate to each other.
- ๐ Evaluated agents with traces, golden datasets, and judge-based scorers.
- ๐ข Production-shaped systems with cost engineering, latency budgets, streaming, human-in-the-loop, and safety guardrails.
Get a working agent running locally in a few minutes.
# 1. Clone
git clone https://github.com/MHHamdan/Agentic-AI-Engineer.git
cd Agentic-AI-Engineer
# 2. Set up the environment (uv recommended; pip works too)
uv sync # or: python -m venv .venv && pip install -r requirements.txt
# 3. Add your API keys
cp .env.example .env # then edit .env
# 4. Run the first lab
uv run jupyter lab labs/01-first-agent-from-scratch/lab.ipynbFull setup details โ Docker, local-model fallbacks via Ollama, troubleshooting โ live in setup/.
New to the repo? Open these in order.
- ๐บ๏ธ
docs/start-here.mdโ 5-minute tour of the repo and how to use it. - ๐ค
labs/01-first-agent-from-scratch/โ build a working agent before reading any theory. - ๐
concepts/agents/what-is-an-agent.mdโ the vocabulary the rest of the repo uses. - ๐ฏ
learning-paths/โ pick a curated path based on your goal.
Skip the linear reading order and jump to what you actually need.
| If you want toโฆ | Start here | Status |
|---|---|---|
| ๐ค Build your first real agent | 01 โ Foundations | โ Complete |
| ๐ Build retrieval-augmented agents | 02 โ Agentic RAG | โ v1 + v2 |
| ๐ฅ Orchestrate multiple cooperating agents | 03 โ Multi-Agent Systems | โ v1 + v2 + v3 + frameworks page |
| ๐ Wire agents to tools, data, and other agents | 04 โ Tool Protocols (MCP + A2A) | โ Complete โ all 7 modules |
| ๐ช Get more out of the context window | 05 โ Context Engineering | โ v1 complete โ 6/6 modules |
| ๐ Add tracing, evals, and observability | 06 โ Evaluation & Observability | โ v1 + v2 |
| ๐ข Ship to production safely | 07 โ Production & Safety | โ v1 complete โ 8/8 modules |
| ๐งฎ Understand the math behind it all | 08 โ Mathematical Foundations | โ v1 complete โ 13/13 pages (refreshed with code examples) |
| ๐ Build something portfolio-worthy | 09 โ Capstones | โ v1 complete โ 8 project briefs |
Each path is a curated reading list across the rest of the repo โ concepts, labs, recipes, patterns โ not a duplicate folder of content. Every link in this table resolves to a real, authored README.
agentic-ai-engineer/
โโโ ๐ docs/ Start-here pages, FAQ, community pages
โโโ ๐ฏ learning-paths/ Curated journeys (links into the rest of the repo)
โโโ ๐ concepts/ Short explainers - what something is and when to use it
โโโ ๐งฎ math-foundations/ Engineer-useful math with citations and Python code
โโโ ๐งช labs/ Hands-on guided exercises (notebooks + READMEs)
โโโ ๐งฐ recipes/ Copy-paste solutions to common problems
โโโ ๐ patterns/ Architecture patterns with diagrams and tradeoffs
โโโ ๐ projects/ Build Challenges and Capstone Projects
โโโ ๐ examples/ Minimal reference implementations
โโโ ๐ง tools/ Versioned snapshots of fast-moving frameworks
โโโ ๐ evaluation/ Eval frameworks, datasets, scorers
โโโ ๐ข production/ Deployment, cost, latency, streaming, concurrency
โโโ ๐ก security/ Threats, defenses, red-teaming
โโโ ๐จ diagrams/ Mermaid sources + rendered images
โโโ ๐ references/ Papers, books, talks, community resources
โโโ ๐ค glossary/ A-to-Z terminology
โโโ โ๏ธ setup/ Environment setup
โโโ ๐ assets/ Working artifacts (not user-facing curriculum)
A more detailed walkthrough of every folder lives in docs/how-to-use-this-repo.md.
Nine paths, each curating content across the rest of the repo. They overlap deliberately: the Multi-Agent path reuses concept pages from Foundations, the Production path leans on Evaluation, and so on.
flowchart TD
F[๐ค 01 · Foundations<br/><i>Agent loop, ReAct, tools, memory</i>]
R[๐ 02 · Agentic RAG<br/><i>Retrieval as a tool</i>]
M[๐ฅ 03 · Multi-Agent Systems<br/><i>Supervisor, hierarchical, swarm</i>]
T[๐ 04 · Tool Protocols<br/><i>MCP + A2A</i>]
C[๐ช 05 · Context Engineering<br/><i>Budgets, compression, selection</i>]
E[๐ 06 · Evaluation & Observability<br/><i>Tracing, judge ensembles, drift</i>]
P[๐ข 07 · Production & Safety<br/><i>Cost, latency, guardrails, red-team</i>]
X[๐งฎ 08 · Mathematical Foundations<br/><i>Probability, embeddings, MDPs</i>]
CAP[๐ 09 · Capstones<br/><i>End-to-end portfolio builds</i>]
F --> R
F --> M
F --> T
R --> C
M --> C
T --> C
R --> E
M --> E
C --> E
E --> P
F -.-> X
R -.-> X
M -.-> X
P --> CAP
E --> CAP
classDef beginner fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px,color:#1b5e20
classDef rag fill:#e3f2fd,stroke:#1565c0,stroke-width:2px,color:#0d47a1
classDef multi fill:#e0f7fa,stroke:#00838f,stroke-width:2px,color:#006064
classDef tool fill:#fff8e1,stroke:#f57f17,stroke-width:2px,color:#e65100
classDef context fill:#fff3e0,stroke:#e65100,stroke-width:2px,color:#bf360c
classDef eval fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px,color:#4a148c
classDef prod fill:#ffebee,stroke:#c62828,stroke-width:2px,color:#b71c1c
classDef math fill:#e8eaf6,stroke:#283593,stroke-width:2px,color:#1a237e
classDef capstone fill:#fce4ec,stroke:#c2185b,stroke-width:3px,color:#880e4f
class F beginner
class R rag
class M multi
class T tool
class C context
class E eval
class P prod
class X math
class CAP capstone
| # | Path | Focus | Difficulty |
|---|---|---|---|
| 01 | Foundations | Agent loop, ReAct, tools, memory, first frameworks | ๐ข Beginner-friendly |
| 02 | Agentic RAG | Retrieval as a tool, hybrid search, RAG failure modes | ๐ก Intermediate |
| 03 | Multi-Agent Systems | Supervisor, hierarchical, swarm topologies | ๐ก Intermediate |
| 04 | Tool Protocols (MCP + A2A) | Standardized integration with tools and other agents | ๐ก Intermediate |
| 05 | Context Engineering | Token budgets, compression, selection strategies | ๐ก Intermediate |
| 06 | Evaluation & Observability | Tracing, golden datasets, LLM-as-judge, RAG eval | ๐ด Advanced |
| 07 | Production & Safety | Cost, latency, guardrails, deployment, red-teaming | ๐ด Advanced |
| 08 | Mathematical Foundations | LM probability, embeddings, MDPs, policies, eval metrics | ๐ข โ ๐ด |
| 09 | Capstones | End-to-end build challenges that combine everything | ๐ด Advanced |
Five content types, each doing one job well.
| Type | What it answers | Length | Where it lives |
|---|---|---|---|
| ๐ Concept | What is this and when do I use it? | ~10-min read | concepts/ |
| ๐งช Lab | Walk me through building this hands-on. | 30 to 120 min, notebook + README | labs/ |
| ๐งฐ Recipe | I have this specific problem โ what's the fix? | Copy-paste, 5-min read | recipes/ |
| ๐ Pattern | Which architecture should I use, and why? | Diagram + tradeoffs | patterns/ |
| ๐ Project | Let me build something substantial. | Hours to days | projects/ |
Two conventions used in this repo for non-tutorial work: Build Challenges are smaller, time-boxed builds living inside paths or labs, and Projects are larger end-to-end builds in projects/. Neither is called "homework" โ this is a public resource, not a classroom.
The math is here because it makes you a better engineer, not because it's a textbook. All 13 pages are authored, refreshed, and ready to read. Every page follows the same template:
- โจ Why this matters for agentic AI โ the engineering motivation in two or three sentences.
- ๐ The equation โ clean GitHub-rendered LaTeX, every symbol defined immediately below.
- ๐ฃ๏ธ How to read this equation โ a plain-language walkthrough.
- ๐ก Mathematical intuition โ the underlying ideas.
- ๐ง Where this appears in agentic systems โ specific connections to repo content.
- ๐ Code example โ a minimal, executable Python snippet.
โ ๏ธ Common mistakes โ failure modes engineers actually run into.- ๐ Repo cross-references โ direct links into concepts, labs, and patterns.
- ๐งญ Related pages โ what to read next.
- ๐ References โ papers and textbooks with one-sentence relevance notes.
What's covered (all 13 โ ):
| # | Page | Equation anchor |
|---|---|---|
| 01 | Language model probability | |
| 02 | Embeddings and vector similarity | |
| 03 | RAG formulation as marginalization | |
| 04 | Agents as policies | |
| 05 | MDP / POMDP intuition | |
| 06 | The ReAct loop, formalized | $(\tau_t, a_t, \text{stop}t) \sim \pi\theta$ |
| 07 | Tool selection as function selection |
|
| 08 | Planning and search | BFS / A* / MCTS |
| 09 | Memory models | |
| 10 | Multi-agent coordination graphs | |
| 11 | Evaluation metrics | P, R, F1, faithfulness, ECE |
| 12 | Uncertainty and safety |
|
| 13 | Context-window optimization |
A symbol-and-notation cheat sheet is at math-foundations/notation.md โ one source of truth for
Math pages are cross-linked from the concept pages, so you can read either track first. None require more than undergraduate probability and linear algebra. The full landing page with reading approaches is at math-foundations/README.md.
Agentic AI moves fast. The shape of this repo reflects that.
flowchart TD
subgraph Stable["๐ข Stable - months to years"]
direction TB
S1[๐ concepts/]
S2[๐งฎ math-foundations/]
S3[๐ patterns/]
end
subgraph SlowMoving["๐ก Slow-moving - quarters"]
direction TB
M1[๐งฐ recipes/]
M2[๐ projects/]
M3[๐ evaluation/]
end
subgraph FastChanging["๐ด Fast-changing - weeks to months"]
direction TB
F1[๐ง tools/]
F2[๐ examples/]
F3[๐ข production/ deployment specifics]
end
Stable --> SlowMoving
SlowMoving --> FastChanging
classDef stable fill:#c8e6c9,stroke:#1b5e20,stroke-width:2px,color:#1b5e20
classDef slow fill:#fff9c4,stroke:#f57f17,stroke-width:2px,color:#e65100
classDef fast fill:#ffcdd2,stroke:#b71c1c,stroke-width:2px,color:#b71c1c
classDef sgStable fill:#e8f5e9,stroke:#2e7d32,stroke-width:3px,color:#1b5e20
classDef sgSlow fill:#fffde7,stroke:#f9a825,stroke-width:3px,color:#e65100
classDef sgFast fill:#ffebee,stroke:#c62828,stroke-width:3px,color:#b71c1c
class S1,S2,S3 stable
class M1,M2,M3 slow
class F1,F2,F3 fast
class Stable sgStable
class SlowMoving sgSlow
class FastChanging sgFast
| Tier | Update cadence | What lives here |
|---|---|---|
| ๐ข Stable | Years | The ReAct loop, RAG marginalization, supervisor vs swarm, eval metric definitions, safety theory |
| ๐ก Slow-moving | 6 to 12 months | Architecture patterns, agentic-RAG strategies, eval workflows |
| ๐ด Fast-changing | Weeks to months | LangGraph APIs, LangSmith UI, MCP spec revisions, A2A SDKs, vector-DB pricing, model names |
Anything in ๐ด territory carries a verification badge โ see below.
Every page in tools/ and every code snippet that depends on a specific library version carries a header like this:
> ๐ด Tool snapshot โ <tool> <version>, verified <YYYY-MM-DD>
> Source: <official docs / changelog / spec link>
Concrete examples of how this policy is applied (verified at the time of writing):
| Tool / Spec | Status | Source |
|---|---|---|
| ๐ MCP specification | Current stable: 2025-11-25. A release candidate dated 2026-07-28 was announced on May 21, 2026. | modelcontextprotocol.io/specification/2025-11-25, MCP blog |
| ๐ค A2A protocol | v1.0 released; protocol donated to the Linux Foundation in June 2025. | a2a-protocol.org/latest/, announcing-1.0 |
| ๐ LangGraph | 1.0 GA (Oct 2025) โ first stable major release. langgraph.prebuilt is deprecated in favor of langchain.agents. |
LangChain changelog |
| ๐ฆ LangChain | 1.0 GA (Oct 2025) โ create_agent abstraction, middleware system. |
LangChain changelog |
| ๐ LangSmith, Google ADK, CrewAI, AutoGen, vector DBs | Each carries its own verified-as-of date in tools/. |
Linked per page. |
Verified-as-of dates are refreshed during routine maintenance sweeps (tracked in CHANGELOG.md). If you spot stale information, please open an issue with the stale-tool-version label.
๐ Repo v1 phase is structurally complete. All nine paths have substantial content; six are fully complete at v1; three have multiple revisions (v1, v2, v3) plus extended materials.
Substantial-content paths (nine of nine):
- โ Path 01 โ Foundations: complete
- โ Path 02 โ Agentic RAG: v1 + v2 shipped
- โ Path 03 โ Multi-Agent Systems: v1 + v2 (6 patterns) + v3 (3 capstones) + frameworks comparison page (9 frameworks)
- โ Path 04 โ Tool Protocols (MCP + A2A): all 7 modules complete
- โ Path 05 โ Context Engineering: v1 complete, 6 of 6 modules
- โ Path 06 โ Evaluation & Observability: v1 + v2 complete
- โ Path 07 โ Production & Safety: v1 complete, 8 of 8 modules
- โ Path 08 โ Mathematical Foundations: v1 complete, all 13 pages authored and refreshed with Python code examples
- โ Path 09 โ Capstones: v1 complete, all 8 project briefs
Future work is continuous improvement (depth, breadth, examples, recipes, references), not gap-filling.
Supporting infrastructure:
- ๐งช 50 lab notebooks (all pre-executed; outputs visible on GitHub)
- ๐ 12 of 12 top-level architecture patterns authored
- ๐
concepts/(~71 files):agents/,context/,evaluation/,memory/,multi-agent/,rag/,tools/โ multiple subdirs complete - ๐ข
production/(cost engineering, latency, streaming) โ 5 deep pages - ๐ก
security/(prompt injection, tool abuse, data exfiltration, safety policy, red-teaming) โ 5 deep pages - ๐ค
glossary/terms.mdโ 65+ A-to-Z entries cross-linked to canonical sources
Currently scaffold-state โ community contribution welcomed:
- ๐
examples/โ minimal reference implementations (placeholder) - ๐งฐ
recipes/โ copy-paste solutions to specific problems (placeholder) - ๐
references/โ curated reading lists (placeholder; concept-page inline citations cover the gap) - ๐
evaluation/โ eval frameworks/datasets/scorers (the conceptual side is already inconcepts/evaluation/) - ๐
quizzes/โ knowledge checks across the curriculum (31 files; partial coverage)
For each scaffold-state folder, the folder's own README documents what's planned and where related content currently lives. The curriculum is usable now; the supporting folders deepen it over time.
Full release history with verification fingerprints lives in CHANGELOG.md.
Architecture and concept diagrams are written as Mermaid in diagrams/ with rendered SVG/PNG committed alongside the source. Inline Mermaid blocks (like the two above) render natively on GitHub. The full list of diagrams currently in the repo, with descriptions, lives in diagrams/README.md.
This is built to be a community resource, not a one-author site. Useful contributions include:
- ๐งฐ New recipes for problems you've actually hit in production.
- ๐ New patterns or comparison tables.
- ๐ง Updating a
tools/page when a framework ships a breaking change. - ๐ Translating a concept page.
- ๐ Filing issues when something is unclear, wrong, or stale.
- ๐ Adding your project to the community showcase (planned
docs/community/showcase.md).
The contribution workflow, templates for each content type, and the style guide are in CONTRIBUTING.md. Good first issues are labeled good-first-issue.
We follow a Code of Conduct โ please read it before posting.
Curated reading lives in references/, organized by type. The folder currently scaffolds four planned pages:
- ๐
references/papers.mdโ foundational papers (ReAct, RAG, Toolformer, Reflexion, and so on) with citations (planned) - ๐
references/books.mdโ books that have aged well (planned) - ๐ค
references/talks.mdโ conference talks worth your time (planned) - ๐ฅ
references/community.mdโ blogs, repos, and people worth following (planned)
In the interim, every concept and pattern page cites its sources inline. The references/ folder is where those citations will be consolidated into curated reading lists.
External references cited in this README:
- ๐ Model Context Protocol โ modelcontextprotocol.io, blog.modelcontextprotocol.io
- ๐ค Agent2Agent (A2A) Protocol โ a2a-protocol.org
- ๐ฆ LangGraph & LangChain changelog โ changelog.langchain.com
- ๐ LangChain docs โ docs.langchain.com
If you use this material in research or teaching, please cite the repo via the CITATION.cff file. A BibTeX snippet is also provided there.
This repository uses a dual license.
- ๐ป Code (Python, notebooks, scripts, configs) is licensed under Apache License 2.0. You can use it commercially, modify it, and distribute it, with attribution and a patent grant.
- ๐ Educational prose, diagrams, and other written content are licensed under Creative Commons Attribution 4.0 (CC-BY-4.0). You can reuse and adapt them with attribution.
When in doubt, attribute. When attributing, link back to this repo.
๐ฑ Built in the open. Maintained by the community. PRs, issues, and "this is wrong" comments all welcome.
โญ Star the repo ยท ๐ด Fork & adapt ยท ๐ Open an issue