Training Data
Unit 3 digs into the fuel AI runs on: training data. Across five 30–45 minute lessons, students label examples, sort by features, clean up messy data, and discover that clearer, better-labeled data leads to better, fairer AI.
The 5 lessons
- 1
What Is Training Data?
30–45 minStudents explain that training data is the labeled information people prepare to teach an AI.
Vocabulary: training data · input
- 2
Labels Matter
30–45 minLearners match labels to examples and discover how labels make data easier for AI to sort and understand.
Vocabulary: label · group · sort
- 3
Features and Groups
30–45 minKids identify the features — color, shape, size — AI uses to group data, and see groups change with the feature.
Vocabulary: feature · group · sort
- 4
Messy Data
30–45 minStudents spot messy, inconsistent data and clean it into clearer groups so the AI's learning isn't confused.
Vocabulary: input · sort · confused
- 5
Better Data, Better Learning
30–45 minLearners improve a data set with clearer, broader, better-labeled examples and explain the payoff for the AI.
Vocabulary: improve · training data · label
Get the full 3–5 Teacher Pack
Every lesson includes ready-to-teach plans — warm-ups, mini-lessons, activities, assessments, and printables. No prep required.