How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Parallel calls — Ai Mastery

Day 14 · ~11 min●

Three independent classifications run sequentially — q1 waits for the network, q2 waits, q3 waits. Total time = sum of latencies. In parallel — all three fire at once. Total time = max of latencies.

python

from concurrent.futures import ThreadPoolExecutor

def call(prompt):
    return Agent(model).run_sync(prompt).output

prompts = ["Reply: alpha", "Reply: beta", "Reply: gamma"]

with ThreadPoolExecutor(max_workers=3) as pool:
    answers = list(pool.map(call, prompts))

3 calls, all in flight at once. The wall-clock time is roughly the slowest single call, not the sum.

Why threads, not asyncio?

Both work. ThreadPoolExecutor is simpler — you write blocking code (.run_sync) and it parallelizes for you. asyncio requires async def and await everywhere. For LLM clients (network-bound, I/O-bound), threads are fine and read like normal code.

When NOT to parallelise?

When the calls depend on each other. If turn 2 needs turn 1's answer, parallel makes no sense. Multi-turn conversations are sequential. Independent classifications, eval batches, embedding batches — those parallelise cleanly.

Parallel calls

python

from concurrent.futures import ThreadPoolExecutor
import time

def call(prompt):
    return Agent(model).run_sync(prompt).output

prompts = ["q1", "q2", "q3"]

start = time.time()
with ThreadPoolExecutor(max_workers=3) as pool:
    answers = list(pool.map(call, prompts))
elapsed = time.time() - start
print(f"3 calls in {elapsed:.1f}s")    # roughly the slowest single call

Why this works

LLM calls are I/O-bound — most time is spent waiting for the network. Threads release the GIL during I/O, so multiple threads make progress concurrently. CPU-bound code wouldn't benefit (threads share one core), but LLM calls aren't CPU-bound.

Workers vs prompts

python

ThreadPoolExecutor(max_workers=N)

max_workers=N caps concurrent in-flight calls. Set to:

The number of prompts (small batches) — everything in parallel
Provider rate limit (large batches) — respect the rate limit
5–10 (general default) — more rarely helps; some providers throttle past this

Quota cost

N parallel calls = N quota slots, same as sequential. Parallelism saves latency, not cost. (For lessons we keep N=2-3 to avoid burning quota on demonstration.)

When ordering matters

pool.map preserves input ordering — answers[i] corresponds to prompts[i]. If you use pool.submit + as_completed, results come back in completion order — fast calls first. Pick based on whether you want ordered results or first-result-first.

Error handling

python

from concurrent.futures import as_completed

with ThreadPoolExecutor(max_workers=3) as pool:
    futures = [pool.submit(call, p) for p in prompts]
    for future in as_completed(futures):
        try:
            answer = future.result()
        except Exception as e:
            print(f"call failed: {e}")

One failed call doesn't crash the others. Each future carries its own exception state.

Day 14 · ~11 min●

python

from concurrent.futures import ThreadPoolExecutor

def call(prompt):
    return Agent(model).run_sync(prompt).output

prompts = ["Reply: alpha", "Reply: beta", "Reply: gamma"]

with ThreadPoolExecutor(max_workers=3) as pool:
    answers = list(pool.map(call, prompts))

3 calls, all in flight at once. The wall-clock time is roughly the slowest single call, not the sum.

Why threads, not asyncio?

When NOT to parallelise?

Parallel calls

python

from concurrent.futures import ThreadPoolExecutor
import time

def call(prompt):
    return Agent(model).run_sync(prompt).output

prompts = ["q1", "q2", "q3"]

start = time.time()
with ThreadPoolExecutor(max_workers=3) as pool:
    answers = list(pool.map(call, prompts))
elapsed = time.time() - start
print(f"3 calls in {elapsed:.1f}s")    # roughly the slowest single call

Why this works

Workers vs prompts

python

ThreadPoolExecutor(max_workers=N)

max_workers=N caps concurrent in-flight calls. Set to:

The number of prompts (small batches) — everything in parallel
Provider rate limit (large batches) — respect the rate limit
5–10 (general default) — more rarely helps; some providers throttle past this

Quota cost

N parallel calls = N quota slots, same as sequential. Parallelism saves latency, not cost. (For lessons we keep N=2-3 to avoid burning quota on demonstration.)

When ordering matters

Error handling

python

from concurrent.futures import as_completed

with ThreadPoolExecutor(max_workers=3) as pool:
    futures = [pool.submit(call, p) for p in prompts]
    for future in as_completed(futures):
        try:
            answer = future.result()
        except Exception as e:
            print(f"call failed: {e}")

One failed call doesn't crash the others. Each future carries its own exception state.

Parallel calls

Why this works

Workers vs prompts

Quota cost

When ordering matters

Error handling

Parallel calls

Why this works

Workers vs prompts

Quota cost

When ordering matters

Error handling

Parallel calls

Why this works

Workers vs prompts

Quota cost

When ordering matters

Error handling

Sign up to practice

Parallel calls

Why this works

Workers vs prompts

Quota cost

When ordering matters

Error handling

Sign up to practice