Lesson Two
Introduction to Data Science
In this lesson you will learn some pre-processing (or ‘data wrangling’) techniques, and how to group and filter data.
Video
Cleaning, Formatting and Grouping Data
Activity: 1
Cleaning Data and Creating Derived Fields
In this activity you will explore whether people are more likely to cycle to work or walk to work in different parts of the country using the OCR large data set. You will meet examples of data that needs to be converted to a different data type, and features that can be combined to give derived values.
Activity: 2
Grouping and Filtering Data
In this activity, you will explore whether petrol or diesel cars are heavier and which have higher emissions using the AQA large data set. You will use filtering to select subsets of the data and grouping to compare subsets.
Video
The Importance of Pre-processing Data
Video
Meet a Data Scientist: Rachel, The Met Office
Resources