3–5 · Data, Classification, Fairness

Training Data

Unit 3 digs into the fuel AI runs on: training data. Across five 30–45 minute lessons, students label examples, sort by features, clean up messy data, and discover that clearer, better-labeled data leads to better, fairer AI.

📘 5 lessons · 30–45 min lessons📄 5 printable activities + certificate

The 5 lessons

  1. 1

    What Is Training Data?

    30–45 min

    Students explain that training data is the labeled information people prepare to teach an AI.

    Vocabulary: training data · input

  2. 2

    Labels Matter

    30–45 min

    Learners match labels to examples and discover how labels make data easier for AI to sort and understand.

    Vocabulary: label · group · sort

  3. 3

    Features and Groups

    30–45 min

    Kids identify the features — color, shape, size — AI uses to group data, and see groups change with the feature.

    Vocabulary: feature · group · sort

  4. 4

    Messy Data

    30–45 min

    Students spot messy, inconsistent data and clean it into clearer groups so the AI's learning isn't confused.

    Vocabulary: input · sort · confused

  5. 5

    Better Data, Better Learning

    30–45 min

    Learners improve a data set with clearer, broader, better-labeled examples and explain the payoff for the AI.

    Vocabulary: improve · training data · label

Get the full 3–5 Teacher Pack

Every lesson includes ready-to-teach plans — warm-ups, mini-lessons, activities, assessments, and printables. No prep required.