How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Group Respondents by Treatment Condition with a Dict — Python For Researchers

Group Respondents by Treatment Condition with a Dict — Python For Researchers | zuzu.codes

Day 13 · ~12 min●

You have 300 eligible respondents and need to count how many are in each treatment group. In SPSS, that's a Frequencies table or a pivot. What's the Python equivalent?

find_first_outlier from Day 12 shows me how to walk the list. For grouping I'd need... a dict? With group names as keys and lists of respondents as values?

Exactly — a dict of lists. Each treatment group name becomes a key; each value is the list of respondents in that group. The trick is .get(key, []) to initialise a new group on first encounter:

python

respondents = [{"id": "R_001", "treatment_group": "control", "outcome": 4.5}]
groups = {}
for r in respondents:
    g = r["treatment_group"]
    groups[g] = groups.get(g, []) + [r]

Why groups.get(g, []) and not just groups[g]? I thought dict access used brackets.

Bracket access raises KeyError when the key doesn't exist — the first time a new group appears, it hasn't been added yet. .get(g, []) returns an empty list as the default, so adding [r] creates the new entry cleanly. After the loop, groups.keys() lists all groups, groups.values() lists all respondent arrays, groups.items() gives both together:

python

def group_by_treatment(respondents: list) -> dict:
    groups = {}
    for r in respondents:
        g = r["treatment_group"]
        groups[g] = groups.get(g, []) + [r]
    print(f"Groups found: {list(groups.keys())}")
    return groups

So groups["control"] is the list of all control respondents, and len(groups["control"]) is my N for that group. That's the entire pivot table, in one function.

The entire pivot table, plus you can pass each group's list directly into a mean calculation. No screenshot required.

A dict is just a codebook that builds itself from the data. I didn't expect to like this as much as I do.

One gotcha: if the treatment_group field is inconsistently capitalised — "Control" in some rows and "control" in others — they land in separate buckets. Run clean_group_label from Day 4 on r["treatment_group"] before using it as the key.

Dicts as Codebook Entries

A dict maps keys to values — like a codebook maps variable names to their definitions.

Operation	Example	Result
`d[k]`	`groups["control"]`	value or `KeyError`
`d.get(k, default)`	`groups.get("control", [])`	value or `[]`
`d.keys()`	`groups.keys()`	all group names
`d.values()`	`groups.values()`	all respondent lists
`d.items()`	`groups.items()`	(name, list) pairs

Grouping pattern

python

groups = {}
for item in items:
    key = item[field]
    groups[key] = groups.get(key, []) + [item]

This pattern appears in every grouping operation — by treatment group, by wave, by journal, by country.

Problem

Rosa has a list of filtered respondent dicts, each with a `"treatment_group"` field. Write `group_by_treatment(respondents)` that groups the respondents into a dict keyed by treatment group — `{"control": [...], "treatment_a": [...]}` — using `.get()` to handle new groups safely.

Tests

No output yet. Use print() to log values.

Day 13 · ~12 min●

You have 300 eligible respondents and need to count how many are in each treatment group. In SPSS, that's a Frequencies table or a pivot. What's the Python equivalent?

find_first_outlier from Day 12 shows me how to walk the list. For grouping I'd need... a dict? With group names as keys and lists of respondents as values?

Exactly — a dict of lists. Each treatment group name becomes a key; each value is the list of respondents in that group. The trick is .get(key, []) to initialise a new group on first encounter:

python

respondents = [{"id": "R_001", "treatment_group": "control", "outcome": 4.5}]
groups = {}
for r in respondents:
    g = r["treatment_group"]
    groups[g] = groups.get(g, []) + [r]

Why groups.get(g, []) and not just groups[g]? I thought dict access used brackets.

python

def group_by_treatment(respondents: list) -> dict:
    groups = {}
    for r in respondents:
        g = r["treatment_group"]
        groups[g] = groups.get(g, []) + [r]
    print(f"Groups found: {list(groups.keys())}")
    return groups

So groups["control"] is the list of all control respondents, and len(groups["control"]) is my N for that group. That's the entire pivot table, in one function.

The entire pivot table, plus you can pass each group's list directly into a mean calculation. No screenshot required.

A dict is just a codebook that builds itself from the data. I didn't expect to like this as much as I do.

Dicts as Codebook Entries

A dict maps keys to values — like a codebook maps variable names to their definitions.

Operation	Example	Result
`d[k]`	`groups["control"]`	value or `KeyError`
`d.get(k, default)`	`groups.get("control", [])`	value or `[]`
`d.keys()`	`groups.keys()`	all group names
`d.values()`	`groups.values()`	all respondent lists
`d.items()`	`groups.items()`	(name, list) pairs

Grouping pattern

python

groups = {}
for item in items:
    key = item[field]
    groups[key] = groups.get(key, []) + [item]

This pattern appears in every grouping operation — by treatment group, by wave, by journal, by country.