You are viewing a free preview of this lesson.
Subscribe to unlock all 12 lessons in this course and every other course on LearningBro.
This lesson covers how to evaluate the quality and usefulness of fitness tests, as required by the Edexcel GCSE PE specification (1PE0). You must understand the concepts of validity, reliability, practicality and the use of normative data when assessing whether a fitness test is fit for purpose.
Not all fitness tests are equally useful. Before relying on the results of a test, a coach or performer should consider whether the test actually measures what it claims to, whether the results can be trusted, and whether the test is practical to carry out.
Definition: The degree to which a test measures what it claims to measure.
A valid test accurately reflects the component of fitness being assessed.
| Example | Validity Assessment |
|---|---|
| The Cooper 12-min run measures cardiovascular endurance | High validity — running for 12 minutes directly tests the heart and lungs' ability to supply oxygen |
| Using a grip dynamometer to measure overall body strength | Low validity — grip strength only measures hand/forearm strength, not whole-body strength |
| The sit and reach test measures hamstring and lower-back flexibility | Moderate validity — it only measures flexibility at one joint area, not overall flexibility |
Exam Tip: If a test only measures one aspect of a broad component, its validity for that overall component is reduced. For example, the sit and reach test is valid for hamstring flexibility but not valid for shoulder flexibility.
Definition: The degree to which a test produces consistent, repeatable results under the same conditions.
A reliable test gives similar results when repeated by the same person under the same conditions.
Factors that affect reliability:
| Factor | How It Affects Reliability |
|---|---|
| Standardised procedures | If the test is carried out the same way every time (same equipment, same instructions), reliability is higher |
| Environmental conditions | Temperature, wind, surface and time of day should be consistent |
| Calibrated equipment | Equipment must be checked and standardised (e.g. dynamometer calibrated to zero) |
| Human error | If a partner operates the stopwatch, reaction time in starting/stopping can vary |
| Performer's state | Fatigue, motivation, illness, time since last meal — all affect results |
Example: If a performer completes the bleep test on Monday and scores Level 9.4, then repeats it on Tuesday under the same conditions and scores Level 9.3, the test has high reliability (the results are consistent). If the score dropped to Level 6.2, the test (or conditions) would be unreliable.
Definition: How easy, affordable and feasible a test is to carry out.
| Factor | Questions to Ask |
|---|---|
| Cost | Is the equipment expensive? Can a school afford it? |
| Equipment | Is specialist equipment needed? Is it readily available? |
| Time | How long does the test take? Can it be completed in a single lesson? |
| Space | Is a large area needed? Is it available? |
| Expertise | Does the tester need specialist training to administer the test? |
| Number of participants | Can only one person be tested at a time, or can groups be tested? |
Subscribe to continue reading
Get full access to this lesson and all 12 lessons in this course.