How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Cost-aware batching — Ai Foundations

Day 27 · ~12 min●

Day 18 introduced result.usage().total_tokens — the per-call token count. In a batch, you sum across calls to know the total cost.

The pattern: track running total; print or log per-call; optionally stop early if you hit a budget:

python

from pydantic_ai import Agent

MAX_TOKENS = 10000
items = ["hello", "this is a longer item to classify", "short"]
results = []
total_tokens = 0

for item in items:
    if total_tokens > MAX_TOKENS:
        print(f"hit budget at {total_tokens}; stopping early")
        break
    result = Agent(model).run_sync(f'Classify: "{item}". Reply: positive or negative.')
    cost = result.usage().total_tokens
    total_tokens += cost
    results.append((item, result.output.strip(), cost))
    print(f"  cost={cost}  total={total_tokens}")

print(f"\nfinal total: {total_tokens} tokens across {len(results)} items")

What if I want a hard ceiling — never exceed N tokens?

Check before the call. Once a call has been made, the tokens are spent. The pre-check above (if total_tokens > MAX_TOKENS: break) stops before the next call, which is the right granularity.

Why not skip the check and just run them all?

For tiny batches (5-10 items), no need. For large batches (100+), or scripts running on a schedule, a runaway prompt that's 10× longer than expected can blow your monthly budget in one run. The cost cap is cheap insurance.

Cost-aware batching

Two questions:

How much did this batch cost?
Did the batch run within my budget?

Tracking the total

python

total_tokens = 0
for item in items:
    result = ask_call(item)
    total_tokens += result.usage().total_tokens
print(f"total: {total_tokens}")

The usage() object has:

.input_tokens — what you sent (prompt + system + history)
.output_tokens — what came back (the response)
.total_tokens — sum of input + output

Budget cap

python

MAX_TOKENS = 10000
for item in items:
    if total_tokens > MAX_TOKENS:
        break
    ...

The check is before the next call. Once a call has been made, the tokens are spent — no rolling back.

A softer version: warn at 80%, stop at 100%:

python

if total_tokens > MAX_TOKENS * 0.8 and not warned:
    print(f"warning: 80% of budget consumed")
    warned = True
if total_tokens > MAX_TOKENS:
    break

Cost per item — for diagnostics

Log the cost of each call to find expensive outliers:

python

for item in items:
    result = ask_call(item)
    cost = result.usage().total_tokens
    if cost > 500:
        print(f"expensive call ({cost} tokens) on item: {item[:50]}")
    total_tokens += cost

If one item out of 100 takes 5x the average, your prompt is probably handling that case poorly. Either fix the prompt or skip those items.

Cost models — input vs output

Most LLM APIs charge differently for input vs output tokens (output usually 2-5× more expensive per token). For accurate cost forecasting:

python

def estimate_cost_usd(usage, input_per_million=0.5, output_per_million=1.5):
    return (usage.input_tokens / 1e6) * input_per_million + (usage.output_tokens / 1e6) * output_per_million

Numbers above are illustrative. Check your provider's pricing for the model you're using.

Cost-aware retry — read together

Retry-on-bad-output (yesterday) costs N× per item. Combined with self-consistency (5× per item) you get up to 5N× per item — 15× isn't unusual for high-stakes tasks. The cost tracker helps you notice when this stacks up.

python

total_tokens = 0
for item in items:
    label = classify(item, retry_attempts=3)         # up to 3 calls
    label_majority = classify_majority(item, n=5)    # 5 calls
    total_tokens += sum(...)    # accumulate

Day 27 · ~12 min●

Day 18 introduced result.usage().total_tokens — the per-call token count. In a batch, you sum across calls to know the total cost.

The pattern: track running total; print or log per-call; optionally stop early if you hit a budget:

python

from pydantic_ai import Agent

MAX_TOKENS = 10000
items = ["hello", "this is a longer item to classify", "short"]
results = []
total_tokens = 0

for item in items:
    if total_tokens > MAX_TOKENS:
        print(f"hit budget at {total_tokens}; stopping early")
        break
    result = Agent(model).run_sync(f'Classify: "{item}". Reply: positive or negative.')
    cost = result.usage().total_tokens
    total_tokens += cost
    results.append((item, result.output.strip(), cost))
    print(f"  cost={cost}  total={total_tokens}")

print(f"\nfinal total: {total_tokens} tokens across {len(results)} items")

What if I want a hard ceiling — never exceed N tokens?

Check before the call. Once a call has been made, the tokens are spent. The pre-check above (if total_tokens > MAX_TOKENS: break) stops before the next call, which is the right granularity.

Why not skip the check and just run them all?

Cost-aware batching

Two questions:

How much did this batch cost?
Did the batch run within my budget?

Tracking the total

python

total_tokens = 0
for item in items:
    result = ask_call(item)
    total_tokens += result.usage().total_tokens
print(f"total: {total_tokens}")

The usage() object has:

.input_tokens — what you sent (prompt + system + history)
.output_tokens — what came back (the response)
.total_tokens — sum of input + output

Budget cap

python

MAX_TOKENS = 10000
for item in items:
    if total_tokens > MAX_TOKENS:
        break
    ...

The check is before the next call. Once a call has been made, the tokens are spent — no rolling back.

A softer version: warn at 80%, stop at 100%:

python

if total_tokens > MAX_TOKENS * 0.8 and not warned:
    print(f"warning: 80% of budget consumed")
    warned = True
if total_tokens > MAX_TOKENS:
    break

Cost per item — for diagnostics

Log the cost of each call to find expensive outliers:

python

for item in items:
    result = ask_call(item)
    cost = result.usage().total_tokens
    if cost > 500:
        print(f"expensive call ({cost} tokens) on item: {item[:50]}")
    total_tokens += cost

If one item out of 100 takes 5x the average, your prompt is probably handling that case poorly. Either fix the prompt or skip those items.

Cost models — input vs output

Most LLM APIs charge differently for input vs output tokens (output usually 2-5× more expensive per token). For accurate cost forecasting:

python

def estimate_cost_usd(usage, input_per_million=0.5, output_per_million=1.5):
    return (usage.input_tokens / 1e6) * input_per_million + (usage.output_tokens / 1e6) * output_per_million

Numbers above are illustrative. Check your provider's pricing for the model you're using.

Cost-aware retry — read together

python

total_tokens = 0
for item in items:
    label = classify(item, retry_attempts=3)         # up to 3 calls
    label_majority = classify_majority(item, n=5)    # 5 calls
    total_tokens += sum(...)    # accumulate

Cost-aware batching

Tracking the total

Budget cap

Cost per item — for diagnostics

Cost models — input vs output

Cost-aware retry — read together

Cost-aware batching

Tracking the total

Budget cap

Cost per item — for diagnostics

Cost models — input vs output

Cost-aware retry — read together

Cost-aware batching

Tracking the total

Budget cap

Cost per item — for diagnostics

Cost models — input vs output

Cost-aware retry — read together

Sign up to practice

Cost-aware batching

Tracking the total

Budget cap

Cost per item — for diagnostics

Cost models — input vs output

Cost-aware retry — read together

Sign up to practice