How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Score Each Result's Semantic Match with an Agent — Ai For Explorers

Day 17 · ~12 min●

Search results come back in the engine's ranking. Sometimes that ranking is great; sometimes the top hit is technically on the right keywords but off-topic. How could you have an agent re-score each snippet for real semantic fit?

For each result, ask the agent how well the snippet matches the query, and have it return a rating like high, medium, low?

Exactly. A Literal classifier with three levels, one call per result, collect the labels in a list:

python

from typing import Literal
agent = Agent(model, result_type=Literal["high", "medium", "low"])
for r in results:
    prompt = f"Query: {query} | Snippet: {r['snippet']} | Rate fit"
    scores.append(agent.run_sync(prompt).output)
print(scores)

You're layering an opinion on top of the engine's mechanical ranking — the agent sees both pieces and judges fit in context.

So the agent sees both the query and the snippet on each call. It is not just classifying sentiment — it is comparing the two?

Exactly the shift. Week 2's classifier looked at one piece of text; today's classifier looks at two and asks "does the second answer the first?" The Literal still constrains the label to three values. The full function:

python

def score_result_semantic_match(query: str, count: int) -> list:
    results = search(query, count=count)
    agent = Agent(model, result_type=Literal["high", "medium", "low"])
    scores = []
    for r in results:
        prompt = f"Query: {query} | Snippet: {r['snippet']} | Rate fit"
        scores.append(agent.run_sync(prompt).output)
    return scores

Returning a list of labels is nice — but how do I use it downstream? Ideally I'd know which snippet got which score.

Align by index. Because the loop walks results in order and append preserves order, scores[i] is the label for results[i]. If you wanted them zipped explicitly, a comprehension like list(zip(results, scores)) gives you pairs — or pass the scores into a later function that filters or sorts.

So the list is the bridge from retrieval to ranking — I can sort, filter, or pick the highest-scoring snippet in the next step.

Exactly. Today's score list is tomorrow's ranking input. Three-value Literal scores are enough to separate signal from noise without overengineering the rubric.

Semantic Scoring Pattern

TL;DR: loop results through a Literal["high","medium","low"] agent, keeping score and snippet aligned by index.

Literal three-level — high/medium/low is enough signal
agent.run_sync(prompt) — prompt combines query + snippet
List-and-index alignment — scores[i] pairs with results[i]

What the agent sees

Input	Role
query	the target
snippet	the candidate

Every call compares the two and returns one of three labels — the downstream code picks what to do with them.

Day 17 · ~12 min●

For each result, ask the agent how well the snippet matches the query, and have it return a rating like high, medium, low?

Exactly. A Literal classifier with three levels, one call per result, collect the labels in a list:

python

from typing import Literal
agent = Agent(model, result_type=Literal["high", "medium", "low"])
for r in results:
    prompt = f"Query: {query} | Snippet: {r['snippet']} | Rate fit"
    scores.append(agent.run_sync(prompt).output)
print(scores)

You're layering an opinion on top of the engine's mechanical ranking — the agent sees both pieces and judges fit in context.

So the agent sees both the query and the snippet on each call. It is not just classifying sentiment — it is comparing the two?

python

def score_result_semantic_match(query: str, count: int) -> list:
    results = search(query, count=count)
    agent = Agent(model, result_type=Literal["high", "medium", "low"])
    scores = []
    for r in results:
        prompt = f"Query: {query} | Snippet: {r['snippet']} | Rate fit"
        scores.append(agent.run_sync(prompt).output)
    return scores

Returning a list of labels is nice — but how do I use it downstream? Ideally I'd know which snippet got which score.

So the list is the bridge from retrieval to ranking — I can sort, filter, or pick the highest-scoring snippet in the next step.

Exactly. Today's score list is tomorrow's ranking input. Three-value Literal scores are enough to separate signal from noise without overengineering the rubric.

Semantic Scoring Pattern

TL;DR: loop results through a Literal["high","medium","low"] agent, keeping score and snippet aligned by index.

Literal three-level — high/medium/low is enough signal
agent.run_sync(prompt) — prompt combines query + snippet
List-and-index alignment — scores[i] pairs with results[i]

What the agent sees

Input	Role
query	the target
snippet	the candidate

Every call compares the two and returns one of three labels — the downstream code picks what to do with them.

from typing import Literal agent = Agent(model, result_type=Literal["high", "medium", "low"]) for r in results: prompt = f"Query: {query} | Snippet: {r['snippet']} | Rate fit" scores.append(agent.run_sync(prompt).output) print(scores)

def score_result_semantic_match(query: str, count: int) -> list: results = search(query, count=count) agent = Agent(model, result_type=Literal["high", "medium", "low"]) scores = [] for r in results: prompt = f"Query: {query} | Snippet: {r['snippet']} | Rate fit" scores.append(agent.run_sync(prompt).output) return scores

Semantic Scoring Pattern

TL;DR: loop results through a Literal["high","medium","low"] agent, keeping score and snippet aligned by index.

Literal three-level — high/medium/low is enough signal

agent.run_sync(prompt) — prompt combines query + snippet

List-and-index alignment — scores[i] pairs with results[i]

What the agent sees

Input	Role
query	the target
snippet	the candidate

Every call compares the two and returns one of three labels — the downstream code picks what to do with them.

Semantic Scoring Pattern

TL;DR: loop results through a Literal["high","medium","low"] agent, keeping score and snippet aligned by index.

Literal three-level — high/medium/low is enough signal

agent.run_sync(prompt) — prompt combines query + snippet

List-and-index alignment — scores[i] pairs with results[i]

What the agent sees

Input	Role
query	the target
snippet	the candidate

Every call compares the two and returns one of three labels — the downstream code picks what to do with them.

Semantic Scoring Pattern

What the agent sees

Semantic Scoring Pattern

What the agent sees

Semantic Scoring Pattern

What the agent sees

Sign up to practice

Semantic Scoring Pattern

What the agent sees

Sign up to practice