You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Two of the most important concepts in research methods are reliability and validity. These determine the quality of a study and whether its findings can be trusted.
Reliability refers to the consistency of a measurement or finding. A reliable study produces the same results when repeated under the same conditions. If a study is not reliable, its findings cannot be trusted.
| Type | Description | How to Assess |
|---|---|---|
| Test-retest reliability | The same test produces consistent results when given to the same people on different occasions | Give the test twice to the same participants and compare results |
| Inter-rater reliability | Different observers or raters produce consistent results when measuring the same behaviour | Two or more observers independently record behaviour and their records are compared |
Validity refers to whether a study measures what it claims to measure and whether the findings are genuine and meaningful. A study can be reliable but not valid (e.g. a bathroom scale that consistently reads 5kg too heavy — it is reliable but not valid).
| Type | Description | Example |
|---|---|---|
| Internal validity | The extent to which the study measures what it claims to measure — did the IV really cause the change in the DV? | Were confounding variables controlled? |
| External validity | The extent to which findings can be generalised beyond the specific study | |
| — Ecological validity | Can findings be generalised to real-life settings? | Lab experiments may lack ecological validity |
| — Population validity | Can findings be generalised to other groups of people? | Studies using only psychology students may not generalise to the wider population |
| Face validity | Does the measurement appear to measure what it claims? | Does an intelligence test look like it tests intelligence? |
| Threat | Description |
|---|---|
| Demand characteristics | Participants guess the purpose of the study and change their behaviour accordingly |
| Social desirability bias | Participants give answers they think are socially acceptable rather than truthful |
| Observer bias | The researcher's expectations influence how they interpret behaviour |
| Confounding variables | Uncontrolled variables that provide alternative explanations for the results |
| Experimenter effects | The researcher's behaviour unintentionally influences participants |
| Reliable | Not Reliable | |
|---|---|---|
| Valid | Ideal — consistent AND measures what it claims | Impossible — cannot be valid if not reliable |
| Not Valid | Possible — consistently wrong | Poor quality research |
Exam Tip: Remember that a study must be reliable before it can be valid. A measurement that gives different results each time (unreliable) cannot possibly be measuring what it claims to (valid). However, a study can be reliable but not valid — like the broken scale that is consistently wrong.
quadrantChart
title Reliability vs Validity
x-axis Low Reliability --> High Reliability
y-axis Low Validity --> High Validity
quadrant-1 Reliable and Valid (ideal)
quadrant-2 Valid but inconsistent (rare)
quadrant-3 Neither (poor research)
quadrant-4 Reliable but inaccurate
Ideal study: [0.85, 0.85]
Broken scale: [0.85, 0.2]
Scattered darts: [0.2, 0.2]
Biased IQ test: [0.8, 0.25]
Peer review is the process by which research is evaluated by other experts in the same field before it is published in a scientific journal.
Reliability and validity are the twin tests of research quality. A study that is both reliable and valid provides findings that can be trusted and built upon. A study that fails either test provides results that must be treated with caution — and a study that fails both has little value at all.
For this reason, AQA examiners frequently ask students to evaluate studies in terms of reliability and validity. Use the two concepts as a scaffolding for your evaluations: ask whether the methods used would produce consistent results if repeated (reliability) and whether the measurements truly capture what they claim to capture (validity). Identify specific threats (e.g. demand characteristics, observer bias) and explain how the researcher could address them. This structured approach is the signature of top-grade answers.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.