Lesson 4: The Data Science Cycle
Introduction to Data Science
In this lesson you will use all the skills you have developed so far to carry out an analysis on one of four data sets. If you are studying A Level maths, then choose the activity that uses the large data set from your exam board. Otherwise, just choose whichever interests you the most.
Activity 4a: AQA
Investigate differences in the emissions of cars from 2002 and 2016 in the AQA large data set
Kaggle coding activity Colab coding activity
Activity 4b: EdExcel
Investigate whether there has been a change in weather between the two years across all the weather stations in the EdExcel large data set
Kaggle coding activity Colab coding activity
Activity 4c: MEI
Investigate the differences between high- and low-income economies in the MEI large data set about countries
Kaggle coding activity Colab coding activity
Activity 4d: OCR
Investigate whether there was a change in methods of travel between 2001 and 2011 in the OCR large data set
- Cross-industry standard process for data mining – Wikipedia
- CRISP-DM – a Standard Methodology to Ensure a Good Outcome – Data Science Central