Skip to content

Lesson 3: Cleaning, formatting and grouping data

Introduction to Data Science

In this lesson you will meet some more techniques for pre-processing and cleaning data. These include how to change the data type of a value, such as from text to number, and how group similar items in the data set.

In this activity you will explore whether people more likely to cycle to work or walk to work in different parts of the country using the OCR large data set. You will meet examples of fields that need to be converted to numerical values and fields that can be combined to give derived values.

Go to coding activity
Download printable activity

In this activity you will explore whether petrol or diesel cars are heavier and which have higher emissions using the AQA large data set. You will use filtering to select subsets of the data and grouping to compare subsets.

Go to coding activity
Download printable activity