How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

Extract Respondent IDs from Raw Text with Regex — Python For Researchers

Day 25 · ~12 min●

The archive system exports respondent IDs embedded in a JSON-like text dump — {"id": "R_001", ...} repeated for each row. You need just the IDs. How do you extract them without parsing the full JSON?

top_respondents_by_outcome from Day 24 showed r.get("id"). But if I only have a raw text dump with no proper JSON structure, I can't use json.loads. I'd need to search for the pattern "id": "R_..." directly in the string.

That's exactly what regular expressions do. re.findall with a pattern returns every match as a list. The pattern r'"id":\s*"([^"]+)"' finds every occurrence of "id": "some_value" and captures just the value:

python

import re
raw = '{"id": "R_001", "age": 29} {"id": "R_002", "age": 22}'
ids = re.findall(r'"id":\s*"([^"]+)"', raw)
# ids = ['R_001', 'R_002']

What does ([^"]+) mean in the pattern? The brackets look like a list but it can't be.

[^"]+ is a character class. The caret ^ inside brackets means "not". So [^"] matches any character except a double-quote. The + means "one or more". The parentheses () capture the match so findall returns just the captured group, not the whole match. Together: "one or more non-quote characters, captured":

python

import re
def extract_respondent_ids(raw_text: str) -> list:
    ids = re.findall(r'"id":\s*"([^"]+)"', raw_text)
    cleaned = [clean_group_label(i) for i in ids]
    print(f"Extracted {len(cleaned)} IDs")
    return cleaned

So re.findall scans the entire text and returns all matches at once? No loop needed?

findall handles the loop internally. You declare the pattern, not the iteration.

Regex is the part of Python I've been afraid of. But this pattern just reads as "find everything that looks like an id field."

That instinct is right — regex is readable once you learn the handful of metacharacters. The traps: greedy vs lazy matching (+ vs +?), and forgetting the r prefix on the pattern string so backslashes aren't treated as Python escape sequences.

re.findall and re.search

re.findall(pattern, text) returns a list of all non-overlapping matches.

python

import re
ids = re.findall(r'"id":\s*"([^"]+)"', raw_text)

Pattern anatomy

Part	Meaning
`"id":`	literal string
`\s*`	zero or more whitespace
`"`	literal quote
`([^"]+)`	capture group: one or more non-quote chars

re.search vs re.findall

re.search(pattern, text) returns the first match object (or None). re.findall returns a list of all match strings. Use search when you need the first occurrence; findall when you need all of them.

Raw string prefix

Always use r"pattern" for regex patterns — the r prevents Python from processing backslashes before the regex engine sees them.

Day 25 · ~12 min●

python

import re
raw = '{"id": "R_001", "age": 29} {"id": "R_002", "age": 22}'
ids = re.findall(r'"id":\s*"([^"]+)"', raw)
# ids = ['R_001', 'R_002']

What does ([^"]+) mean in the pattern? The brackets look like a list but it can't be.

python

import re
def extract_respondent_ids(raw_text: str) -> list:
    ids = re.findall(r'"id":\s*"([^"]+)"', raw_text)
    cleaned = [clean_group_label(i) for i in ids]
    print(f"Extracted {len(cleaned)} IDs")
    return cleaned

So re.findall scans the entire text and returns all matches at once? No loop needed?

findall handles the loop internally. You declare the pattern, not the iteration.

Regex is the part of Python I've been afraid of. But this pattern just reads as "find everything that looks like an id field."

re.findall and re.search

re.findall(pattern, text) returns a list of all non-overlapping matches.

python

import re
ids = re.findall(r'"id":\s*"([^"]+)"', raw_text)

Pattern anatomy

Part	Meaning
`"id":`	literal string
`\s*`	zero or more whitespace
`"`	literal quote
`([^"]+)`	capture group: one or more non-quote chars

re.search vs re.findall

Raw string prefix

Always use r"pattern" for regex patterns — the r prevents Python from processing backslashes before the regex engine sees them.