Agents and evals — the heart of the track. Ten questions, 80% to pass.
When you pass, week 4 ramps to production patterns: typed outputs, retry-on-bad-output extended, multi-step planning chains, and a capstone with real Composio.
| Lesson | Concept |
|---|---|
| 15 | What's an agent? — LLM + tools + loop, terminator = no more tool calls |
| 16 | Agent loop in code — pydantic-AI's Agent runs the loop, result.all_messages() to inspect |
| 17 | Multi-tool agent — 3 tools, model picks per task |
| 18 | Eval suite — (input, expected) pairs, pass-rate threshold |
| 19 | Iteration via evals — change one thing, re-score, attribute the win |
| 20 | Synthesis — agent + 2 tools + 3-case eval, ≥ 2/3 pass |
Week 4: typed outputs, retry-on-bad-output extended, scoring rubrics, multi-step planning chains, agent + Composio, final integration synthesis.
Create a free account to get started. Paid plans unlock all tracks.
Agents and evals — the heart of the track. Ten questions, 80% to pass.
When you pass, week 4 ramps to production patterns: typed outputs, retry-on-bad-output extended, multi-step planning chains, and a capstone with real Composio.
| Lesson | Concept |
|---|---|
| 15 | What's an agent? — LLM + tools + loop, terminator = no more tool calls |
| 16 | Agent loop in code — pydantic-AI's Agent runs the loop, result.all_messages() to inspect |
| 17 | Multi-tool agent — 3 tools, model picks per task |
| 18 | Eval suite — (input, expected) pairs, pass-rate threshold |
| 19 | Iteration via evals — change one thing, re-score, attribute the win |
| 20 | Synthesis — agent + 2 tools + 3-case eval, ≥ 2/3 pass |
Week 4: typed outputs, retry-on-bad-output extended, scoring rubrics, multi-step planning chains, agent + Composio, final integration synthesis.