How much does zuzu.codes cost?

The starter track is free — read all lessons and practice for free. Full access to every track (current and future) is $14.99/month. Cancel anytime.

How long does each track take?

Each track is designed as a 30-day challenge — one lesson per day, about 15 minutes each. Go at your own pace, but the structure is built around daily consistency.

What's the lesson format?

Each lesson is a student-teacher dialogue with code examples, followed by a hands-on code challenge in an in-browser editor. You read, you understand, then you write real code.

Do I need prior coding experience?

Our beginner track starts from absolute zero — no prior experience needed. Advanced tracks build on earlier ones, and the platform tells you exactly where to start.

How is zuzu.codes different from freeCodeCamp or Codecademy?

zuzu.codes uses a structured 30-day track format with dialogue-based teaching, an in-browser code editor, and gamification (XP, streaks, progress tracking). The format builds genuine understanding through daily practice.

What is an LLM? — Ai Foundations

Day 2 · ~12 min●

Before you call one programmatically, the mental model. What does an LLM actually do, technically?

Predicts text? Reads a question and writes an answer?

Closer than most first-time guesses. Stripped to one line: an LLM predicts the next token. Given some text in front of it, it asks "what's the most likely word to come next?" — then once that word is chosen, it asks again, and again. The whole "answer" is just thousands of next-token guesses in sequence.

So it's not really thinking?

It's pattern-matching at enormous scale. The model trained on a huge corpus of text — books, code, conversations — and learned the statistical shape of language. When you ask "what's the capital of France?", it produces "Paris" not because it knows facts, but because in training data the pattern "capital of France" → "Paris" appeared often enough to dominate its prediction.

What does this imply for using one?

A few things:

It can be wrong. Confidently wrong. The output is the most likely next words, not correct next words. We call this hallucination.
It only knows what it saw in training. A model trained in 2024 doesn't know your event last week.
The same prompt can give different answers — sampling is random.
It's very good at language tasks (summarize, rephrase, translate) and progressively shakier at math, structured reasoning, and freshly-needed facts.

Today's exercise demonstrates the core mechanism. You ask the model for ONE next word — the smallest possible task that exposes next-token prediction. Tomorrow we go deeper on the call shape itself.

What an LLM is

A Large Language Model is a neural network trained on a huge text corpus to predict the next token. Given input, it produces tokens one at a time, conditioning each new token on everything that came before.

Tokens, not characters or words

A token is a chunk of text — usually ~3-4 characters or about 0.75 words for English. "Hello, world!" might tokenize as ["Hello", ",", " world", "!"] — four tokens. Different tokenizers split differently. The cost of an API call is measured in tokens, not words or characters.

What LLMs do well

Summarize long text into shorter
Infer / classify — sentiment, category, intent
Transform — translate, rephrase, reformat
Expand — generate more text from a brief outline
Pattern-match language tasks they've seen many examples of in training

What LLMs do poorly

Math beyond a few digits — they're predicting tokens, not running calculations
Anything past their training cutoff — recent events, your private data
Strict adherence to format — they'll mostly follow your output spec, but slip occasionally
Self-correction — they'll confidently produce wrong output and not flag it

Hallucination

When an LLM produces plausible-sounding but factually wrong output, we call it a hallucination. It's not a bug — it's how the system works. The model produces the most likely sequence of tokens given training. If your question's answer wasn't strongly represented in training, the model fills in something that looks right. That something might be wrong.

The defense: never let an LLM be the source of truth on facts that matter. Use it for language tasks, validate its facts via deterministic checks (a calculation, a database lookup, a regex against a known shape).

Sampling — non-determinism by design

Most APIs sample tokens probabilistically. The same prompt can yield slightly different outputs each call. Plan for it. Code that asserts the response equals an exact string will be flaky; code that checks the response shape ("contains a label from {positive, negative}") will work.

Today's exercise — see next-token prediction directly

The practice below sends the model a prompt asking for exactly one word that should follow a given phrase. The whole point of asking for one word: it's the unit the model is producing internally. You're calling the model to do the smallest possible thing — pick the next likely token.

Tomorrow's lesson formalises the call shape (Agent, run_sync, .output, quota). Today: feel the mechanism.

Day 2 · ~12 min●

Before you call one programmatically, the mental model. What does an LLM actually do, technically?

Predicts text? Reads a question and writes an answer?

So it's not really thinking?

What does this imply for using one?

A few things:

It can be wrong. Confidently wrong. The output is the most likely next words, not correct next words. We call this hallucination.
It only knows what it saw in training. A model trained in 2024 doesn't know your event last week.
The same prompt can give different answers — sampling is random.
It's very good at language tasks (summarize, rephrase, translate) and progressively shakier at math, structured reasoning, and freshly-needed facts.

What an LLM is

Tokens, not characters or words

What LLMs do well

Summarize long text into shorter
Infer / classify — sentiment, category, intent
Transform — translate, rephrase, reformat
Expand — generate more text from a brief outline
Pattern-match language tasks they've seen many examples of in training

What LLMs do poorly

Math beyond a few digits — they're predicting tokens, not running calculations
Anything past their training cutoff — recent events, your private data
Strict adherence to format — they'll mostly follow your output spec, but slip occasionally
Self-correction — they'll confidently produce wrong output and not flag it

Hallucination

Sampling — non-determinism by design

Today's exercise — see next-token prediction directly

Tomorrow's lesson formalises the call shape (Agent, run_sync, .output, quota). Today: feel the mechanism.

What an LLM is

Tokens, not characters or words

What LLMs do well

What LLMs do poorly

Hallucination

Sampling — non-determinism by design

Today's exercise — see next-token prediction directly

What an LLM is

Tokens, not characters or words

What LLMs do well

What LLMs do poorly

Hallucination

Sampling — non-determinism by design

Today's exercise — see next-token prediction directly

What an LLM is

Tokens, not characters or words

What LLMs do well

What LLMs do poorly

Hallucination

Sampling — non-determinism by design

Today's exercise — see next-token prediction directly

Sign up to practice

What an LLM is

Tokens, not characters or words

What LLMs do well

What LLMs do poorly

Hallucination

Sampling — non-determinism by design

Today's exercise — see next-token prediction directly

Sign up to practice