Skip to content

Lesson 2: Pre-processing Data

Introduction to Data Science

All the data in this lesson needs to be pre-processed or cleaned before you can analyse it. As well as learning some pre-processing (or ‘data wrangling’) techniques, you will find out how to group and filter data.


In this activity you will explore whether people are more likely to cycle to work or walk to work in different parts of the country, using the OCR large data set. You will meet examples of data that needs to be converted to a different data type, and features that can be combined to give derived values.

Go to coding activity

In this activity you will explore whether petrol or diesel cars are heavier, and which have higher emissions, using the AQA large data set. You will use filtering to select subsets of the data and grouping to compare subsets.

Go to coding activity