You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Secondary sources are data that already exist and were originally collected or created for purposes other than the researcher's own study. They allow sociologists to analyse large-scale patterns, historical trends, and phenomena that would be impossible or impractical to study first-hand — but they also raise a profound methodological question that runs through this whole lesson: do statistics and documents simply record social reality, or do they construct it? The positivist treats official statistics as hard "social facts"; the interpretivist treats them as the by-product of the decisions, labels, and interests of the people who compile them. This lesson examines the two principal secondary sources — official statistics and documents — evaluating each using the PET framework and the criteria of validity, reliability, and representativeness, and grounds the analysis in landmark studies from Durkheim to the Glasgow University Media Group.
Secondary sources fall under Research Methods within the Theory and Methods component of AQA 7192, assessed on Paper 1 (Education with Theory and Methods) and Paper 3 (Crime and Deviance with Theory and Methods) — including the possible 10-mark "Outline and explain two…" question and, on Paper 3, a 30-mark essay. They are also central to the Methods in Context question (Paper 1, 20 marks): school records, attendance and exclusion data, and Ofsted reports are official documents frequently proposed for studying educational issues, and you must evaluate their fit — for example, whether exclusion statistics validly measure pupil "behaviour" or instead reflect schools' labelling practices.
Official statistics are quantitative data collected and published by government agencies and other official bodies. In the UK key sources include the Office for National Statistics (ONS), the Home Office, the Department for Education, and NHS data services.
Key Definition: Official statistics — numerical data collected and published by government departments and agencies, covering crime, health, education, employment, population, and poverty.
There are two ways the state gathers them. Registration data is generated when people are legally required to register an event (births, deaths, marriages). Survey data is gathered by official surveys (the Census; the Labour Force Survey; the Crime Survey for England and Wales). This distinction matters for evaluation: registration data is near-complete, whereas survey data carries the usual sampling and non-response limitations.
| Source | Data Provided |
|---|---|
| Census | Comprehensive UK population data every 10 years (age, ethnicity, occupation, housing, religion) |
| Police-recorded crime | Offences recorded by the police, published by the Home Office |
| Crime Survey for England and Wales (CSEW) | Large victim survey capturing unreported as well as reported crime |
| Educational statistics | School performance, exam results, exclusions, HE participation |
| Labour Force Survey | Employment, unemployment, economic activity, earnings |
| Vital statistics | Births, deaths, marriages, divorces |
| Health statistics | Illness, hospital admissions, mental-health diagnoses, life expectancy |
| Advantage | Explanation |
|---|---|
| Free and accessible | Mostly published online at no cost |
| Large scale | Cover the whole population or very large samples — potentially highly representative |
| Comparisons | Standardised collection allows comparison over time and between places |
| Already collected | Save the researcher the time and cost of primary fieldwork |
| Identify trends | Longitudinal data reveal change in divorce, crime, and attainment over time |
| Springboard | Highlight patterns warranting deeper primary investigation |
| Disadvantage | Explanation |
|---|---|
| Definitions may not match | Official definitions of 'unemployment', 'poverty' or 'crime' may not fit the sociologist's operationalisation |
| Changing definitions | Counting methods change over time, undermining longitudinal comparison (e.g. the method of counting unemployment has been altered many times) |
| Not for sociology | Compiled to serve government, not sociological questions |
| Incomplete | May omit the variables the researcher needs |
| Political pressure | Definitions can be adjusted to flatter the government's record |
The Positivist View. Positivists treat official statistics as objective social facts — reliable, valid measures usable to find patterns, correlations and causes. The classic example is Durkheim's Suicide (1897): using official suicide statistics across countries and groups, he argued that suicide rates were caused by social facts — levels of social integration and moral regulation — not individual psychology, identifying types such as egoistic and anomic suicide. For positivists this is the model of scientific sociology: hard public data revealing law-like social patterns.
The Interpretivist View. Interpretivists counter that statistics are social constructions — products of the decisions and labels of those who compile them, not transparent reflections of reality. Crime statistics are the prime example:
Key Definition: Dark figure of crime — the gap between the actual amount of crime and the amount recorded in official statistics, comprising crime that is not reported and crime that is reported but not recorded.
The Marxist View. Marxists argue official statistics serve an ideological function, reflecting ruling-class interests: crime figures spotlight working-class "street crime" while obscuring corporate, white-collar and state crime; unemployment figures may be massaged to flatter economic management; and the very categories (e.g. class classifications) embody dominant assumptions about social structure rather than the lived reality of inequality.
Documents are a broad secondary source — any written, visual, or digital material usable as evidence.
| Type | Examples |
|---|---|
| Personal documents | Diaries, letters, autobiographies, photographs, social-media posts |
| Public documents | Government and policy reports, Hansard, newspapers |
| Historical documents | Parish records, workhouse records, census returns, court records |
| Media content | Newspaper articles, television, films, advertisements, websites, social media |
| Official reports | Ofsted inspections, public inquiries (e.g. the Macpherson Report on institutional racism), Royal Commissions |
John Scott (1990) proposed four criteria for judging documentary sources — a framework examiners reward when you apply it explicitly.
| Criterion | Meaning |
|---|---|
| Authenticity | Is the document genuine and complete, not a forgery or altered copy? |
| Credibility | Is it accurate and sincere, or did the author have reason to distort or exaggerate? |
| Representativeness | Is it typical of its kind? Have some documents survived while others were lost or destroyed (survival bias)? |
| Meaning | Can the researcher understand and interpret it correctly, or does it need specialist decoding? |
Key Definition: Content analysis — a method of systematically categorising and counting words, themes, images, or messages in documents or media, converting qualitative material into quantitative data.
The procedure is to (1) sample documents or media; (2) define categories or a coding frame (e.g. types of gender representation); (3) systematically code the content; and (4) analyse the resulting figures for patterns. The Glasgow University Media Group (GUMG) famously used content analysis of television news on industrial disputes, finding that coverage systematically favoured management perspectives over those of workers.
| Advantage | Limitation |
|---|---|
| Produces quantitative data from qualitative sources | Reduces rich content to simplified categories |
| Can process large volumes of material | Category choice is subjective and may embed the researcher's assumptions |
| Reliable — the coding frame can be reapplied by others | May miss wider context and meaning |
| Reveals patterns and trends over time | Cannot by itself explain why patterns exist |
Thematic analysis is the qualitative counterpart: rather than counting categories, the researcher identifies and interprets recurring themes — patterns of meaning — in the text. It is more flexible and captures nuance and context (higher validity), but is more subjective and less reliable, since different researchers may identify different themes. The choice between content and thematic analysis is itself a positivist-versus-interpretivist decision in miniature.
Life (personal) documents are first-person accounts — diaries, letters, autobiographies, photographs — offering a window into subjective experience.
William Thomas and Florian Znaniecki drew on a large collection of personal documents, including hundreds of letters between Polish migrants in America and their families in Poland, to study migration and social change. This pioneering study showed the power of personal documents for understanding the subjective experience of social processes — an early interpretivist landmark.
Strengths: rich personal insight; access to aspects of life other methods miss; historical documents are often the only evidence about past societies. Limitations: subjectivity and weak representativeness; selectivity (people present themselves favourably, and only some documents survive); and difficulties of establishing authenticity, credibility, and meaning — exactly Scott's criteria.
Documents split sociologists along the familiar theoretical line, and the split maps onto the choice of analytic technique.
The interpretivist case. Interpretivists value personal and qualitative documents highly because they are rich in validity and meaning. A diary, letter or autobiography is a first-person account of how the writer understood their world — a direct route to the subjective meanings interpretivists seek. Thomas and Znaniecki's use of migrants' letters is the founding example; their concept of the "definition of the situation" (that people act on how they define reality, not on reality as such) emerged precisely from such documents. For interpretivists, the appropriate analytic tool is thematic analysis, which preserves nuance and context.
The positivist case. Positivists are uneasy with personal documents because they are unrepresentative, subjective, and impossible to standardise or replicate — they fail the tests of reliability and representativeness. Where positivists do use documents, they favour content analysis, which imposes a fixed coding frame to convert qualitative material into reliable, quantifiable, comparable data. The Glasgow University Media Group's counts of how often management versus worker viewpoints were favoured on the news exemplify this quantifying approach (though the GUMG drew radical conclusions from it).
Public and historical documents sit between the two. They can be enormously useful — official reports such as the Macpherson Report on institutional racism, or Ofsted inspection reports, provide authoritative, accessible evidence — but they must be read critically. Public documents are often written to justify a position or protect an institution (a credibility problem in Scott's terms), and historical documents suffer survival bias: the records that happen to have survived may be unrepresentative of those that were lost or destroyed. This is why Scott's four criteria — authenticity, credibility, representativeness, meaning — are the indispensable evaluative framework for any document, and why simply quoting a document without interrogating it scores poorly.
| Content analysis | Thematic analysis | |
|---|---|---|
| Approach | Quantitative — count categories | Qualitative — interpret themes |
| Strength | Reliable, comparable, handles large volumes | Valid, captures nuance and context |
| Weakness | Strips out meaning; category choice is subjective | Less reliable; researcher's reading may differ |
| Favoured by | Positivists | Interpretivists |
In the Methods in Context question, secondary sources are frequently proposed for educational issues — for example, using attendance and exclusion statistics to study anti-school behaviour, or Ofsted reports and school records to study a school's culture. Their applied strengths are genuine: school records are cheap and already collected, cover whole cohorts (high representativeness within the school), and allow comparison over time and between schools; statistics on exclusions or results can reveal class, gender and ethnic patterns of the kind positivists seek.
But you must apply the critical limitations specific to educational documents:
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.