Taxonomy of Data

STAT 20: Introduction to Probability and Statistics

Agenda

  • Announcements
  • Group Activity: Conceptual
  • Worksheet 1: Taxonomy of Data
  • Break
  • Concept Questions: Coding
  • More Practice

Assignments

RQ: Taxonomy of Data (due yesterday Thu 1/22 11:59pm)


Quiz-1 (Monday 1/26)


Lab-1 Getting Started (due Tuesday 1/27 @7am)


Problem Sets (due Tuesday 1/27 @7am)

  • PS0: 2 google forms
  • PS1: Understanding the World with Data + Taxonomy of Data

Lab 1: Getting Started

  • Complete qmd file

  • Render qmd to obtain PDF document

  • Download (export) pdf to your computer (or tablet)

  • Submission: upload PDF to gradescope

Notes Recap

Concept of Data

One or more characteristics observed or measured on a set of objects.

Concept of Data

One or more characteristics observed or measured on a set of objects.

  • Variables: characteristics, features, attributes.

  • Objects: individuals, subjects, items.


Data typically organized into a data table (or data frame), ideally one row per individual, and one column per variable.

Types of Variables

Variables can be classified in different ways.

The classification (aka taxonomy) adopted in STAT 20 involves 2 major classes, each one with two subcategories:

  • Numerical

    • Continuous
    • Discrete
  • Categorical

    • Ordinal
    • Nominal

Group Activity

  1. As a group, choose a set of individuals, and list at least:

    • 3 continuous numerical variables
    • 3 discrete numerical variables
    • 3 ordinal categorical variables
    • 3 nominal categorical variables.
  2. ❌ Do Not choose:

    • students
    • electronic devices
15:00

Worksheet 1: Taxonomy of Data

https://stat20.berkeley.edu/spring-2026/1-questions-and-data/02-taxonomy-of-data/ws.pdf

20:00

Break

05:00

Practice Problems

www.gastonsanchez.com/stat20/1-questions-and-data/02-taxonomy-of-data/practice.html

These are NOT worksheets (no need to submit to Gradescope)

25:00