Lesson Four

The Data Science Cycle

In this lesson you will use all the skills you have developed so far to carry out an analysis on one of four data sets. If you are studying A Level maths, then choose the activity that uses the large data set from your exam board. Otherwise, just choose whichever interests you the most.

Back to course home page

Video

The Data Science Cycle

Activity

Working with a Large Data Set

Activity 4a: AQA

Investigate differences in the emissions of cars from 2002 and 2016 in the AQA large data set.

Click here for the Google Colab variation

Kaggle coding activity

Activity 4b: EdExcel

Investigate whether there has been a change in weather between the two years across all the weather stations in the Edexcel large data set.

Click here for the Google Colab variation

Kaggle coding activity

Activity 4c: MEI

Investigate the differences between high- and low-income economies in the MEI large data set about countries.

Click here for the Google Colab variation

Kaggle coding activity

Activity 4d: OCR

Investigate whether there was a change in methods of travel between 2001 and 2011 in the OCR large data set.

Click here for the Google Colab variation

Kaggle coding activity

Video

PPDAC and CRISP-DM

Video

Meet a Data Scientist: Tim, Liverpool FC