How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Clean a Treatment Group Label with Python String Methods — Python For Researchers

Day 4 · ~12 min●

The SurveyMonkey export arrived. The treatment group column has " Control ", "Treatment_A", and "TREATMENT_B" in different rows. What happens when you run your analysis on that?

format_respondent from Day 3 would print them as-is. And if I try to group by treatment, they'd all land in separate buckets because of the different casing and whitespace.

Exactly the problem. Three steps fix it: strip() removes leading and trailing whitespace, lower() makes everything lowercase, and replace(" ", "_") standardises separators. Chained left to right:

python

raw = " Control "
clean = raw.strip().lower().replace(" ", "_")
# clean = "control"

Do all string methods return new strings? Or do they change the original?

All string methods return new strings — Python strings are immutable. raw.strip() never changes raw; it hands back a cleaned copy. That's why chaining works: each method takes the previous method's output as its input. The original raw is untouched.

So I can call clean_group_label inside format_respondent to get a normalised label before formatting?

That's exactly the Week 2 pipeline taking shape. Clean first, format second:

python

def clean_group_label(raw: str) -> str:
    result = raw.strip().lower().replace(" ", "_")
    print(f"Cleaned: {result}")
    return result

Three methods and the entire codebook normalisation problem is solved. That would have taken me a macro in SPSS.

The silent trap: replace(" ", "_") only targets spaces, not tabs or non-breaking spaces. If the export uses unusual whitespace, strip() handles the edges but interior oddities survive. For production pipelines, add a re.sub pass — but strip().lower().replace() covers 99% of SurveyMonkey exports.

String Cleaning Pipeline

Three chained methods, one clean label.

Method	What it does	Example
`.strip()`	removes leading + trailing whitespace	`" control "` → `"control"`
`.lower()`	lowercases all characters	`"Control"` → `"control"`
`.replace(a, b)`	replaces every occurrence of `a` with `b`	`"control group"` → `"control_group"`

Why order matters

lower() before replace() means you catch " Control " and " CONTROL " with the same replace call. Reverse the order and uppercase variants slip through.

Immutability

Strings never change in place. raw.strip() returns a new string — raw is always the original.

Day 4 · ~12 min●

The SurveyMonkey export arrived. The treatment group column has " Control ", "Treatment_A", and "TREATMENT_B" in different rows. What happens when you run your analysis on that?

format_respondent from Day 3 would print them as-is. And if I try to group by treatment, they'd all land in separate buckets because of the different casing and whitespace.

python

raw = " Control "
clean = raw.strip().lower().replace(" ", "_")
# clean = "control"

Do all string methods return new strings? Or do they change the original?

So I can call clean_group_label inside format_respondent to get a normalised label before formatting?

That's exactly the Week 2 pipeline taking shape. Clean first, format second:

python

def clean_group_label(raw: str) -> str:
    result = raw.strip().lower().replace(" ", "_")
    print(f"Cleaned: {result}")
    return result

Three methods and the entire codebook normalisation problem is solved. That would have taken me a macro in SPSS.

String Cleaning Pipeline

Three chained methods, one clean label.

Method	What it does	Example
`.strip()`	removes leading + trailing whitespace	`" control "` → `"control"`
`.lower()`	lowercases all characters	`"Control"` → `"control"`
`.replace(a, b)`	replaces every occurrence of `a` with `b`	`"control group"` → `"control_group"`

Why order matters

lower() before replace() means you catch " Control " and " CONTROL " with the same replace call. Reverse the order and uppercase variants slip through.

Immutability

Strings never change in place. raw.strip() returns a new string — raw is always the original.

String Cleaning Pipeline

Three chained methods, one clean label.

Method	What it does	Example
`.strip()`	removes leading + trailing whitespace	`" control "` → `"control"`
`.lower()`	lowercases all characters	`"Control"` → `"control"`
`.replace(a, b)`	replaces every occurrence of `a` with `b`	`"control group"` → `"control_group"`

Why order matters

lower() before replace() means you catch " Control " and " CONTROL " with the same replace call. Reverse the order and uppercase variants slip through.

Immutability

Strings never change in place. raw.strip() returns a new string — raw is always the original.

String Cleaning Pipeline

Three chained methods, one clean label.

Method	What it does	Example
`.strip()`	removes leading + trailing whitespace	`" control "` → `"control"`
`.lower()`	lowercases all characters	`"Control"` → `"control"`
`.replace(a, b)`	replaces every occurrence of `a` with `b`	`"control group"` → `"control_group"`

Why order matters

lower() before replace() means you catch " Control " and " CONTROL " with the same replace call. Reverse the order and uppercase variants slip through.

Immutability

Strings never change in place. raw.strip() returns a new string — raw is always the original.