Video

Cleaning, Formatting and Grouping Data

Activity: 1

Cleaning Data and Creating Derived Fields

In this activity you will explore whether people are more likely to cycle to work or walk to work in different parts of the country using the OCR large data set. You will meet examples of data that needs to be converted to a different data type, and features that can be combined to give derived values.

Click here for the Google Colab variation

Kaggle coding activity

Activity: 2

Grouping and Filtering Data

In this activity, you will explore whether petrol or diesel cars are heavier and which have higher emissions using the AQA large data set. You will use filtering to select subsets of the data and grouping to compare subsets.

Click here for the Google Colab variation

Kaggle coding activity

Video

The Importance of Pre-processing Data

Video

Meet a Data Scientist: Rachel, The Met Office