Agentic AI Engineer

An open, community-built learning hub for engineers who design, build, evaluate, and ship real agentic AI systems — not just prompts.

🚀 At a glance

9 _{Learning paths (all substantial)}	59 _{Hands-on labs (notebooks)}	8 _{Capstone projects}	16 _{Math foundations (with code)}
12 _{Architecture patterns}	9 _{Frameworks compared}	65+ _{Glossary entries}	71+ _{Concept pages}

Stack covered: LangGraph 1.0 · LangChain 1.0 · MCP 2025-11-25 · A2A v1.0 · CrewAI · AutoGen · Pydantic AI · OpenAI Agents SDK · LlamaIndex · Haystack · LangSmith · Google ADK

🎯 Why this exists

Three problems this repository fixes.

Tutorials go stale fast. A LangGraph blog post from a year ago is mostly wrong today. Every tool page here carries a verified-as-of date and a link to the official changelog.

Concepts and tools get tangled. Most courses teach "LangGraph" when they mean "agent state machines." This repo keeps them apart — concepts and math live in their own folders and don't change when the framework does.

The math is usually skipped or overdone. Agentic AI rests on real math — autoregressive generation, embeddings, MDPs, policies, retrieval theory — but most engineers don't need a textbook. The math pages here include the equations that change how you debug a system, with executable Python next to each. Nothing else.

The result: a practical, technically deep, community-maintained reference that separates stable fundamentals (still right in five years) from fast-moving tooling (LangGraph, MCP, A2A, vector DBs, eval platforms). Updates to fast-moving tools don't break the curriculum. Six months from now, the concept pages still work.

👤 Who this is for

🛠️ Software engineers moving from chatbots to real agentic systems with tools, memory, and orchestration.
🧪 ML/AI practitioners who know the modeling side and want a structured map of agent patterns, protocols, and production concerns.
🎓 Advanced learners who already understand transformers, embeddings, and Python at a working level and want to go from notebook prototypes to evaluated, observable, deployable agents.

If you've never written Python or used an LLM API, this isn't the right starting point. Try a foundational LLM course first, then come back.

🧰 What you'll build

Working through the labs and projects, you'll end up with code worth keeping:

🤖 A ReAct-style agent from scratch, no framework, so you understand the loop before you abstract it away.
🔍 RAG systems including agentic RAG where retrieval is a tool, not a fixed pipeline.
👥 Multi-agent topologies — supervisor, hierarchical, swarm — and a clear sense of when each one is wrong for the job.
🔌 MCP servers and clients wired to real data sources.
🤝 A2A agents that discover and delegate to each other.
📊 Evaluated agents with traces, golden datasets, and judge-based scorers.
🚢 Production-shaped systems with cost engineering, latency budgets, streaming, human-in-the-loop, and safety guardrails.

⚡ Quickstart

Get a working agent running locally in a few minutes.

# 1. Clone
git clone https://github.com/MHHamdan/Agentic-AI-Engineer.git
cd Agentic-AI-Engineer

# 2. Set up the environment (uv recommended; pip works too)
uv sync                  # or: python -m venv .venv && pip install -r requirements.txt

# 3. Add your API keys
cp .env.example .env     # then edit .env

# 4. Run the first lab
uv run jupyter lab labs/01-first-agent-from-scratch/lab.ipynb

Full setup details — Docker, local-model fallbacks via Ollama, troubleshooting — live in setup/.

🧭 Start here

New to the repo? Open these in order.

🗺️ docs/start-here.md — 5-minute tour of the repo and how to use it.
🤖 labs/01-first-agent-from-scratch/ — build a working agent before reading any theory.
📖 concepts/agents/what-is-an-agent.md — the vocabulary the rest of the repo uses.
🎯 learning-paths/ — pick a curated path based on your goal.

🎯 Choose your path

Skip the linear reading order and jump to what you actually need.

If you want to…	Start here	Status
🤖 Build your first real agent	01 — Foundations	✅ Complete
🔍 Build retrieval-augmented agents	02 — Agentic RAG	✅ v1 + v2
👥 Orchestrate multiple cooperating agents	03 — Multi-Agent Systems	✅ v1 + v2 + v3 + frameworks page
🔌 Wire agents to tools, data, and other agents	04 — Tool Protocols (MCP + A2A)	✅ Complete — all 7 modules
🪟 Get more out of the context window	05 — Context Engineering	✅ v1 complete — 6/6 modules
📊 Add tracing, evals, and observability	06 — Evaluation & Observability	✅ v1 + v2
🚢 Ship to production safely	07 — Production & Safety	✅ v1 complete — 8/8 modules
🧮 Understand the math behind it all	08 — Mathematical Foundations	✅ v1 complete — 13/13 pages (refreshed with code examples)
🏆 Build something portfolio-worthy	09 — Capstones	✅ v1 complete — 8 project briefs

Each path is a curated reading list across the rest of the repo — concepts, labs, recipes, patterns — not a duplicate folder of content. Every link in this table resolves to a real, authored README.

🗂️ Repository structure

agentic-ai-engineer/
├── 📚 docs/              Start-here pages, FAQ, community pages
├── 🎯 learning-paths/    Curated journeys (links into the rest of the repo)
├── 📖 concepts/          Short explainers - what something is and when to use it
├── 🧮 math-foundations/  Engineer-useful math with citations and Python code
├── 🧪 labs/              Hands-on guided exercises (notebooks + READMEs)
├── 🧰 recipes/           Copy-paste solutions to common problems
├── 🏛 patterns/          Architecture patterns with diagrams and tradeoffs
├── 🚀 projects/          Build Challenges and Capstone Projects
├── 💎 examples/          Minimal reference implementations
├── 🔧 tools/             Versioned snapshots of fast-moving frameworks
├── 📊 evaluation/        Eval frameworks, datasets, scorers
├── 🚢 production/        Deployment, cost, latency, streaming, concurrency
├── 🛡 security/          Threats, defenses, red-teaming
├── 🎨 diagrams/          Mermaid sources + rendered images
├── 📚 references/        Papers, books, talks, community resources
├── 🔤 glossary/          A-to-Z terminology
├── ⚙️ setup/             Environment setup
└── 📁 assets/            Working artifacts (not user-facing curriculum)

A more detailed walkthrough of every folder lives in docs/how-to-use-this-repo.md.

🛤️ Learning paths

Nine paths, each curating content across the rest of the repo. They overlap deliberately: the Multi-Agent path reuses concept pages from Foundations, the Production path leans on Evaluation, and so on.

flowchart TD
    F[🤖 01 &middot; Foundations<br/><i>Agent loop, ReAct, tools, memory</i>]
    R[🔍 02 &middot; Agentic RAG<br/><i>Retrieval as a tool</i>]
    M[👥 03 &middot; Multi-Agent Systems<br/><i>Supervisor, hierarchical, swarm</i>]
    T[🔌 04 &middot; Tool Protocols<br/><i>MCP + A2A</i>]
    C[🪟 05 &middot; Context Engineering<br/><i>Budgets, compression, selection</i>]
    E[📊 06 &middot; Evaluation &amp; Observability<br/><i>Tracing, judge ensembles, drift</i>]
    P[🚢 07 &middot; Production &amp; Safety<br/><i>Cost, latency, guardrails, red-team</i>]
    X[🧮 08 &middot; Mathematical Foundations<br/><i>Probability, embeddings, MDPs</i>]
    CAP[🏆 09 &middot; Capstones<br/><i>End-to-end portfolio builds</i>]

    F --> R
    F --> M
    F --> T
    R --> C
    M --> C
    T --> C
    R --> E
    M --> E
    C --> E
    E --> P
    F -.-> X
    R -.-> X
    M -.-> X
    P --> CAP
    E --> CAP

    classDef beginner fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px,color:#1b5e20
    classDef rag fill:#e3f2fd,stroke:#1565c0,stroke-width:2px,color:#0d47a1
    classDef multi fill:#e0f7fa,stroke:#00838f,stroke-width:2px,color:#006064
    classDef tool fill:#fff8e1,stroke:#f57f17,stroke-width:2px,color:#e65100
    classDef context fill:#fff3e0,stroke:#e65100,stroke-width:2px,color:#bf360c
    classDef eval fill:#f3e5f5,stroke:#6a1b9a,stroke-width:2px,color:#4a148c
    classDef prod fill:#ffebee,stroke:#c62828,stroke-width:2px,color:#b71c1c
    classDef math fill:#e8eaf6,stroke:#283593,stroke-width:2px,color:#1a237e
    classDef capstone fill:#fce4ec,stroke:#c2185b,stroke-width:3px,color:#880e4f

    class F beginner
    class R rag
    class M multi
    class T tool
    class C context
    class E eval
    class P prod
    class X math
    class CAP capstone

#	Path	Focus	Difficulty
01	Foundations	Agent loop, ReAct, tools, memory, first frameworks	🟢 Beginner-friendly
02	Agentic RAG	Retrieval as a tool, hybrid search, RAG failure modes	🟡 Intermediate
03	Multi-Agent Systems	Supervisor, hierarchical, swarm topologies	🟡 Intermediate
04	Tool Protocols (MCP + A2A)	Standardized integration with tools and other agents	🟡 Intermediate
05	Context Engineering	Token budgets, compression, selection strategies	🟡 Intermediate
06	Evaluation & Observability	Tracing, golden datasets, LLM-as-judge, RAG eval	🔴 Advanced
07	Production & Safety	Cost, latency, guardrails, deployment, red-teaming	🔴 Advanced
08	Mathematical Foundations	LM probability, embeddings, MDPs, policies, eval metrics	🟢 → 🔴
09	Capstones	End-to-end build challenges that combine everything	🔴 Advanced

📐 How the content is organized

Five content types, each doing one job well.

Type	What it answers	Length	Where it lives
📖 Concept	What is this and when do I use it?	~10-min read	`concepts/`
🧪 Lab	Walk me through building this hands-on.	30 to 120 min, notebook + README	`labs/`
🧰 Recipe	I have this specific problem — what's the fix?	Copy-paste, 5-min read	`recipes/`
🏛 Pattern	Which architecture should I use, and why?	Diagram + tradeoffs	`patterns/`
🚀 Project	Let me build something substantial.	Hours to days	`projects/`

Two conventions used in this repo for non-tutorial work: Build Challenges are smaller, time-boxed builds living inside paths or labs, and Projects are larger end-to-end builds in projects/. Neither is called "homework" — this is a public resource, not a classroom.

🧮 Mathematical foundations

The math is here because it makes you a better engineer, not because it's a textbook. All 13 pages are authored, refreshed, and ready to read. Every page follows the same template:

✨ Why this matters for agentic AI — the engineering motivation in two or three sentences.
📐 The equation — clean GitHub-rendered LaTeX, every symbol defined immediately below.
🗣️ How to read this equation — a plain-language walkthrough.
💡 Mathematical intuition — the underlying ideas.
🔧 Where this appears in agentic systems — specific connections to repo content.
🐍 Code example — a minimal, executable Python snippet.
⚠️ Common mistakes — failure modes engineers actually run into.
🔗 Repo cross-references — direct links into concepts, labs, and patterns.
🧭 Related pages — what to read next.
📚 References — papers and textbooks with one-sentence relevance notes.

What's covered (all 13 ✅):

#	Page	Equation anchor
01	Language model probability	$p(x_t \mid x_{<t}; \theta)$
02	Embeddings and vector similarity	$\cos(\mathbf{u}, \mathbf{v})$
03	RAG formulation as marginalization	$p(y \mid x) = \sum_z p(y \mid x, z) , p(z \mid x)$
04	Agents as policies	$\pi_\theta(a_t \mid s_t)$
05	MDP / POMDP intuition	$(\mathcal{S}, \mathcal{A}, P, R, \gamma)$
06	The ReAct loop, formalized	$(\tau_t, a_t, \text{stop}t) \sim \pi\theta$
07	Tool selection as function selection	$a_t \sim \pi_\theta$ over $\mathcal{A}$
08	Planning and search	BFS / A* / MCTS
09	Memory models	$M = M_s \cup M_w \cup M_l$
10	Multi-agent coordination graphs	$G = (V, E)$
11	Evaluation metrics	P, R, F1, faithfulness, ECE
12	Uncertainty and safety	$H(p)$, calibration, abstention
13	Context-window optimization	$\max \sum v_i x_i \text{ s.t. } \sum c_i x_i \leq B$

A symbol-and-notation cheat sheet is at math-foundations/notation.md — one source of truth for $\pi$, $s$, $a$, $\theta$, $z$, and friends.

Math pages are cross-linked from the concept pages, so you can read either track first. None require more than undergraduate probability and linear algebra. The full landing page with reading approaches is at math-foundations/README.md.

🌳 Stable vs. fast-changing content

Agentic AI moves fast. The shape of this repo reflects that.

flowchart TD
    subgraph Stable["🟢 Stable - months to years"]
        direction TB
        S1[📖 concepts/]
        S2[🧮 math-foundations/]
        S3[🏛 patterns/]
    end

    subgraph SlowMoving["🟡 Slow-moving - quarters"]
        direction TB
        M1[🧰 recipes/]
        M2[🚀 projects/]
        M3[📊 evaluation/]
    end

    subgraph FastChanging["🔴 Fast-changing - weeks to months"]
        direction TB
        F1[🔧 tools/]
        F2[💎 examples/]
        F3[🚢 production/ deployment specifics]
    end

    Stable --> SlowMoving
    SlowMoving --> FastChanging

    classDef stable fill:#c8e6c9,stroke:#1b5e20,stroke-width:2px,color:#1b5e20
    classDef slow fill:#fff9c4,stroke:#f57f17,stroke-width:2px,color:#e65100
    classDef fast fill:#ffcdd2,stroke:#b71c1c,stroke-width:2px,color:#b71c1c
    classDef sgStable fill:#e8f5e9,stroke:#2e7d32,stroke-width:3px,color:#1b5e20
    classDef sgSlow fill:#fffde7,stroke:#f9a825,stroke-width:3px,color:#e65100
    classDef sgFast fill:#ffebee,stroke:#c62828,stroke-width:3px,color:#b71c1c

    class S1,S2,S3 stable
    class M1,M2,M3 slow
    class F1,F2,F3 fast
    class Stable sgStable
    class SlowMoving sgSlow
    class FastChanging sgFast

Tier	Update cadence	What lives here
🟢 Stable	Years	The ReAct loop, RAG marginalization, supervisor vs swarm, eval metric definitions, safety theory
🟡 Slow-moving	6 to 12 months	Architecture patterns, agentic-RAG strategies, eval workflows
🔴 Fast-changing	Weeks to months	LangGraph APIs, LangSmith UI, MCP spec revisions, A2A SDKs, vector-DB pricing, model names

Anything in 🔴 territory carries a verification badge — see below.

✅ Tool-version verification policy

Every page in tools/ and every code snippet that depends on a specific library version carries a header like this:

> 🔴 Tool snapshot — <tool> <version>, verified <YYYY-MM-DD>
> Source: <official docs / changelog / spec link>

Concrete examples of how this policy is applied (verified at the time of writing):

Tool / Spec	Status	Source
🔌 MCP specification	Current stable: 2025-11-25. A release candidate dated 2026-07-28 was announced on May 21, 2026.	modelcontextprotocol.io/specification/2025-11-25, MCP blog
🤝 A2A protocol	v1.0 released; protocol donated to the Linux Foundation in June 2025.	a2a-protocol.org/latest/, announcing-1.0
🔗 LangGraph	1.0 GA (Oct 2025) — first stable major release. `langgraph.prebuilt` is deprecated in favor of `langchain.agents`.	LangChain changelog
🦜 LangChain	1.0 GA (Oct 2025) — `create_agent` abstraction, middleware system.	LangChain changelog
📊 LangSmith, Google ADK, CrewAI, AutoGen, vector DBs	Each carries its own verified-as-of date in `tools/`.	Linked per page.

Verified-as-of dates are refreshed during routine maintenance sweeps (tracked in CHANGELOG.md). If you spot stale information, please open an issue with the stale-tool-version label.

📊 Current state

🎉 Repo v1 phase is structurally complete. All nine paths have substantial content; six are fully complete at v1; three have multiple revisions (v1, v2, v3) plus extended materials.

Substantial-content paths (nine of nine):

✅ Path 01 — Foundations: complete
✅ Path 02 — Agentic RAG: v1 + v2 shipped
✅ Path 03 — Multi-Agent Systems: v1 + v2 (6 patterns) + v3 (3 capstones) + frameworks comparison page (9 frameworks)
✅ Path 04 — Tool Protocols (MCP + A2A): all 7 modules complete
✅ Path 05 — Context Engineering: v1 complete, 6 of 6 modules
✅ Path 06 — Evaluation & Observability: v1 + v2 complete
✅ Path 07 — Production & Safety: v1 complete, 8 of 8 modules
✅ Path 08 — Mathematical Foundations: v1 complete, all 13 pages authored and refreshed with Python code examples
✅ Path 09 — Capstones: v1 complete, all 8 project briefs

Future work is continuous improvement (depth, breadth, examples, recipes, references), not gap-filling.

Supporting infrastructure:

🧪 50 lab notebooks (all pre-executed; outputs visible on GitHub)
🏛 12 of 12 top-level architecture patterns authored
📖 concepts/ (~71 files): agents/, context/, evaluation/, memory/, multi-agent/, rag/, tools/ — multiple subdirs complete
🚢 production/ (cost engineering, latency, streaming) — 5 deep pages
🛡 security/ (prompt injection, tool abuse, data exfiltration, safety policy, red-teaming) — 5 deep pages
🔤 glossary/terms.md — 65+ A-to-Z entries cross-linked to canonical sources

Currently scaffold-state — community contribution welcomed:

💎 examples/ — minimal reference implementations (placeholder)
🧰 recipes/ — copy-paste solutions to specific problems (placeholder)
📚 references/ — curated reading lists (placeholder; concept-page inline citations cover the gap)
📊 evaluation/ — eval frameworks/datasets/scorers (the conceptual side is already in concepts/evaluation/)
📝 quizzes/ — knowledge checks across the curriculum (31 files; partial coverage)

For each scaffold-state folder, the folder's own README documents what's planned and where related content currently lives. The curriculum is usable now; the supporting folders deepen it over time.

Full release history with verification fingerprints lives in CHANGELOG.md.

🎨 Diagrams

Architecture and concept diagrams are written as Mermaid in diagrams/ with rendered SVG/PNG committed alongside the source. Inline Mermaid blocks (like the two above) render natively on GitHub. The full list of diagrams currently in the repo, with descriptions, lives in diagrams/README.md.

🤝 Community and contributions

This is built to be a community resource, not a one-author site. Useful contributions include:

🧰 New recipes for problems you've actually hit in production.
🏛 New patterns or comparison tables.
🔧 Updating a tools/ page when a framework ships a breaking change.
🌐 Translating a concept page.
🐛 Filing issues when something is unclear, wrong, or stale.
🏆 Adding your project to the community showcase (planned docs/community/showcase.md).

The contribution workflow, templates for each content type, and the style guide are in CONTRIBUTING.md. Good first issues are labeled good-first-issue.

We follow a Code of Conduct — please read it before posting.

📚 References and further reading

Curated reading lives in references/, organized by type. The folder currently scaffolds four planned pages:

📄 references/papers.md — foundational papers (ReAct, RAG, Toolformer, Reflexion, and so on) with citations (planned)
📖 references/books.md — books that have aged well (planned)
🎤 references/talks.md — conference talks worth your time (planned)
👥 references/community.md — blogs, repos, and people worth following (planned)

In the interim, every concept and pattern page cites its sources inline. The references/ folder is where those citations will be consolidated into curated reading lists.

External references cited in this README:

🔌 Model Context Protocol — modelcontextprotocol.io, blog.modelcontextprotocol.io
🤝 Agent2Agent (A2A) Protocol — a2a-protocol.org
🦜 LangGraph & LangChain changelog — changelog.langchain.com
📘 LangChain docs — docs.langchain.com

📝 Citation

If you use this material in research or teaching, please cite the repo via the CITATION.cff file. A BibTeX snippet is also provided there.

⚖️ License

This repository uses a dual license.

💻 Code (Python, notebooks, scripts, configs) is licensed under Apache License 2.0. You can use it commercially, modify it, and distribute it, with attribution and a patent grant.
📖 Educational prose, diagrams, and other written content are licensed under Creative Commons Attribution 4.0 (CC-BY-4.0). You can reuse and adapt them with attribution.

When in doubt, attribute. When attributing, link back to this repo.

🌱 Built in the open. Maintained by the community. PRs, issues, and "this is wrong" comments all welcome.

⭐ Star the repo · 🍴 Fork & adapt · 🐛 Open an issue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic AI Engineer

🚀 At a glance

🎯 Why this exists

👤 Who this is for

🧰 What you'll build

⚡ Quickstart

🧭 Start here

🎯 Choose your path

🗂️ Repository structure

🛤️ Learning paths

📐 How the content is organized

🧮 Mathematical foundations

🌳 Stable vs. fast-changing content

✅ Tool-version verification policy

📊 Current state

🎨 Diagrams

🤝 Community and contributions

📚 References and further reading

📝 Citation

⚖️ License

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.github		.github
assets		assets
concepts		concepts
diagrams		diagrams
docs		docs
evaluation		evaluation
examples		examples
glossary		glossary
labs		labs
learning-paths		learning-paths
math-foundations		math-foundations
ops/otel-collector		ops/otel-collector
patterns		patterns
production		production
projects		projects
quizzes		quizzes
recipes		recipes
references		references
security		security
setup		setup
tools		tools
.env.example		.env.example
.gitignore		.gitignore
.lycheeignore		.lycheeignore
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE-CC-BY-4.0		LICENSE-CC-BY-4.0
LICENSING.md		LICENSING.md
MANIFEST.sha256		MANIFEST.sha256
README.md		README.md
README.md.bak		README.md.bak
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Agentic AI Engineer

🚀 At a glance

🎯 Why this exists

👤 Who this is for

🧰 What you'll build

⚡ Quickstart

🧭 Start here

🎯 Choose your path

🗂️ Repository structure

🛤️ Learning paths

📐 How the content is organized

🧮 Mathematical foundations

🌳 Stable vs. fast-changing content

✅ Tool-version verification policy

📊 Current state

🎨 Diagrams

🤝 Community and contributions

📚 References and further reading

📝 Citation

⚖️ License

About

Resources

License

Licenses found

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages