1.3 Summary

  • Most statistical learning methods require data in a table format with multiple columns and rows.

  • It’s important to be aware of the difference between a raw data table, and a clean table numerically codified.

  • Unless otherwise specified, in this book we’ll assume that all the variables in a data matrix have numerical variables.

  • For most of this book, we’ll consider a data set to be integrated by a group of individuals or objects on which we observe or measure one or more characteristics called variables.


Make a donation

If you find this resource useful, please consider making a one-time donation in any amount. Your support really matters.