You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers the essential processes of data cleaning and preparation — the steps required to transform raw data into a form suitable for statistical analysis. In real-world statistics (and in A-Level exam questions), data is rarely perfect. Learning to identify and handle problems systematically is a key skill.
Raw data collected from real-world sources almost always contains imperfections. If these are not addressed before analysis, the results may be misleading, inaccurate, or invalid. Data cleaning is the process of detecting and correcting (or removing) errors, inconsistencies, and gaps in a data set.
In the context of the AQA large data set:
Missing values are one of the most common issues in real data. There are several strategies for dealing with them, each with advantages and disadvantages.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.