1.3 Summary
Most statistical learning methods require data in a table format with multiple columns and rows.
It’s important to be aware of the difference between a raw data table, and a clean table numerically codified.
Unless otherwise specified, in this book we’ll assume that all the variables in a data matrix have numerical variables.
For most of this book, we’ll consider a data set to be integrated by a group of individuals or objects on which we observe or measure one or more characteristics called variables.
Make a donation
If you find this resource useful, please consider making a one-time donation in any amount. Your support really matters.