How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Privacy Terms

Day 24 · ~12 min●

A pipeline is replay-safe when running it twice over the same input produces the same final state as running it once. The whole pipeline is idempotent end-to-end.

python

state = {}

def process(events, state):
    """Apply each event to state. Idempotent on event id."""
    for event in events:
        eid = event["id"]
        if eid in state:
            continue                  # already applied — skip
        state[eid] = event["value"]

events = [
    {"id": "e1", "value": 10},
    {"id": "e2", "value": 20},
    {"id": "e3", "value": 30},
]

process(events, state)
state_after_run_1 = dict(state)

process(events, state)              # replay
state_after_run_2 = dict(state)

print(state_after_run_1 == state_after_run_2)

Expected: True. The second run is a no-op because every event ID is already in state.

This is just dedup with state instead of a set?

Same observation, slightly higher level. Dedup keeps a set of seen IDs. Replay safety means the final state itself encodes what's been done. They overlap: most replay-safe pipelines use dedup as one of the mechanisms.

Why is replay safety important if you have dedup?

Because dedup only catches the obvious duplicate (same event.id). Replay safety also covers:

Recovery scenarios: re-running a failed batch from yesterday's checkpoint
Full re-imports: "replay the last 30 days from scratch" must produce the same state, not a duplicated state
Disaster recovery: rebuilding from backup means replaying every event

A pipeline that's replay-safe survives every recovery scenario. A pipeline that isn't has a hidden trap waiting.

Replay safety — re-running produces the same state

The property

A pipeline P is replay-safe when:

P(input)        == final_state
P(P(input))     == final_state         # running twice = same
P(P(P(input)))  == final_state         # running thrice = same

In database terms: the operation is idempotent.

Why it matters

Real-world scenarios that re-run a pipeline:

Crash mid-run, you re-run from checkpoint
A bug fix requires reprocessing the last week of events
A database restore from backup requires replaying every event since the snapshot
A new consumer (analytics, an audit table) wants to replay history

Non-replay-safe pipelines turn each of these into a debugging nightmare. Replay-safe pipelines turn them into a quiet re-run.

How to make a pipeline replay-safe

Three common mechanisms (often combined):

1. Idempotency on event ID

python

if event_id in state:
    return                # already applied
state[event_id] = ...

First-write-wins, recorded by event_id. The simplest replay-safe shape.

2. Idempotency on natural key

python

# upsert by (date, user_id) — same input always lands in same row
db.upsert("daily_summary", key=(date, user_id), value=count)

The state itself is keyed by something derivable from the input. Re-running just overwrites the same row with the same value.

3. Pure functions of input

python

# state is fully derived from input — no incremental updates
state = {event["id"]: event["value"] for event in events}

State is a function of the entire input set. No matter how many times you run, the result is the same.

What breaks replay safety

Counters that increment — state["count"] += 1 doubles on replay
Append-only operations without dedup keys — list.append(...) duplicates
External side effects without keys — send_email(...) sends twice

The fix is always one of: dedup by event ID, key by natural identity, recompute from input.

External side effects in replay-safe pipelines

The trickiest layer. send_email is not idempotent — re-running sends the second email. Mitigations:

Idempotency keys on the external API (Stripe's Idempotency-Key header)
Track sent IDs in your state — "send only if (event_id, action) not in sent"
Quarantine effects — record the intent, separately decide whether to fire

The last is what production systems do for high-stakes effects. The pipeline computes what would happen, persists that, and a separate process actually fires the side effects.

Test for replay safety

The canonical test is the one in this lesson: run twice over the same input, compare final state, assert equal. Add this test to any pipeline that processes events — it catches non-idempotent updates immediately.

Sign up to practice

Create a free account to get started. Paid plans unlock all tracks.

or

Day 24 · ~12 min●

A pipeline is replay-safe when running it twice over the same input produces the same final state as running it once. The whole pipeline is idempotent end-to-end.

python

state = {}

def process(events, state):
    """Apply each event to state. Idempotent on event id."""
    for event in events:
        eid = event["id"]
        if eid in state:
            continue                  # already applied — skip
        state[eid] = event["value"]

events = [
    {"id": "e1", "value": 10},
    {"id": "e2", "value": 20},
    {"id": "e3", "value": 30},
]

process(events, state)
state_after_run_1 = dict(state)

process(events, state)              # replay
state_after_run_2 = dict(state)

print(state_after_run_1 == state_after_run_2)

Expected: True. The second run is a no-op because every event ID is already in state.

This is just dedup with state instead of a set?

Same observation, slightly higher level. Dedup keeps a set of seen IDs. Replay safety means the final state itself encodes what's been done. They overlap: most replay-safe pipelines use dedup as one of the mechanisms.

Why is replay safety important if you have dedup?

Because dedup only catches the obvious duplicate (same event.id). Replay safety also covers:

Recovery scenarios: re-running a failed batch from yesterday's checkpoint
Full re-imports: "replay the last 30 days from scratch" must produce the same state, not a duplicated state
Disaster recovery: rebuilding from backup means replaying every event

A pipeline that's replay-safe survives every recovery scenario. A pipeline that isn't has a hidden trap waiting.

Replay safety — re-running produces the same state

The property

A pipeline P is replay-safe when:

P(input)        == final_state
P(P(input))     == final_state         # running twice = same
P(P(P(input)))  == final_state         # running thrice = same

In database terms: the operation is idempotent.

Why it matters

Real-world scenarios that re-run a pipeline:

Crash mid-run, you re-run from checkpoint
A bug fix requires reprocessing the last week of events
A database restore from backup requires replaying every event since the snapshot
A new consumer (analytics, an audit table) wants to replay history

Non-replay-safe pipelines turn each of these into a debugging nightmare. Replay-safe pipelines turn them into a quiet re-run.

How to make a pipeline replay-safe

Three common mechanisms (often combined):

1. Idempotency on event ID

python

if event_id in state:
    return                # already applied
state[event_id] = ...

First-write-wins, recorded by event_id. The simplest replay-safe shape.

2. Idempotency on natural key

python

# upsert by (date, user_id) — same input always lands in same row
db.upsert("daily_summary", key=(date, user_id), value=count)

The state itself is keyed by something derivable from the input. Re-running just overwrites the same row with the same value.

3. Pure functions of input

python

# state is fully derived from input — no incremental updates
state = {event["id"]: event["value"] for event in events}

State is a function of the entire input set. No matter how many times you run, the result is the same.

What breaks replay safety

Counters that increment — state["count"] += 1 doubles on replay
Append-only operations without dedup keys — list.append(...) duplicates
External side effects without keys — send_email(...) sends twice

The fix is always one of: dedup by event ID, key by natural identity, recompute from input.

External side effects in replay-safe pipelines

The trickiest layer. send_email is not idempotent — re-running sends the second email. Mitigations:

Idempotency keys on the external API (Stripe's Idempotency-Key header)
Track sent IDs in your state — "send only if (event_id, action) not in sent"
Quarantine effects — record the intent, separately decide whether to fire

The last is what production systems do for high-stakes effects. The pipeline computes what would happen, persists that, and a separate process actually fires the side effects.

Test for replay safety

The canonical test is the one in this lesson: run twice over the same input, compare final state, assert equal. Add this test to any pipeline that processes events — it catches non-idempotent updates immediately.

Sign up to practice

Create a free account to get started. Paid plans unlock all tracks.

or