Data collection (sampling, questionnaires)

    WJEC
    GCSE
    Mathematics

    Master WJEC GCSE Mathematics Data Collection (4.1) by learning how to design flawless questionnaires and calculate representative samples. This guide will show you how to secure every mark by avoiding common pitfalls and applying examiner-approved techniques for sampling and data presentation.

    6
    Min Read
    3
    Examples
    5
    Questions
    0
    Key Terms
    🎙 Podcast Episode
    Data collection (sampling, questionnaires)
    0:00-0:00

    Study Notes

    header_image.png

    Overview

    Welcome to the essential guide for WJEC GCSE Mathematics topic 4.1: Data Collection. This topic is fundamental to understanding how we gather and interpret information about the world around us. In your exam, you will be tested on your ability to critically evaluate data collection methods, design effective questionnaires, and use sampling techniques to create representative datasets. This is a highly practical area of mathematics, and examiners are looking for candidates who can apply their knowledge to real-world scenarios. You will often see questions that ask you to identify bias, correct poorly designed questions, or calculate sample sizes. Mastering these skills is crucial as they not only secure marks in this section but also link to data representation and analysis topics like charts and averages.

    Key Concepts

    Concept 1: Designing Effective Questionnaires

    A questionnaire is a tool for collecting information. To earn marks, your questions must be clear, unambiguous, and designed to collect the specific data you need. WJEC examiners focus on two key features:

    1. Specific Time Frame: Questions about frequency must be constrained to a specific period. Vague terms like 'often' or 'sometimes' are not acceptable as they are subjective. You must provide a clear time frame to ensure every respondent is answering on the same basis.

      • Bad Example: How often do you read?
      • Good Example: How many books did you read in the last month?
    2. Non-Overlapping and Exhaustive Response Boxes: The options you provide must cover all possible answers without any ambiguity.

      • Non-overlapping: A respondent must not be able to select two different boxes for a single answer. For example, boxes labelled '0-5' and '5-10' are flawed because a person who answers '5' could tick either box.
      • Exhaustive: The boxes must cover every possible answer. This usually means including a box for '0' and an open-ended upper category like '10 or more'.

    questionnaire_design_comparison.png

    Concept 2: Sampling Methods

    Sampling is the process of selecting a subset of a population to represent the whole group. The goal is to create a representative sample that accurately reflects the characteristics of the entire population.

    • Random Sampling: Every individual in the population has an equal chance of being selected. This can be done using a random number generator or drawing names from a hat. It is fair but may not always create a perfectly representative sample by chance.
    • Systematic Sampling: You select members at regular intervals from an ordered list (e.g., every 10th person). You must start from a random point. This is straightforward but can be biased if the list has a hidden pattern.
    • Stratified Sampling: This is a more advanced method that ensures subgroups (strata) within a population are represented proportionally. You divide the population into strata based on a shared characteristic (e.g., age, gender, year group) and then calculate the number of people to sample from each stratum to reflect its proportion in the overall population.

    sampling_methods_visual.png

    Concept 3: Identifying and Avoiding Bias

    Bias is a systematic error in sampling or testing that results in a sample that is not representative of the population. A biased sample leads to inaccurate conclusions. Examiners will often ask you to identify sources of bias.

    • Location/Timing Bias: The place or time a survey is conducted can influence the results. For example, surveying people outside a gym about their fitness habits will likely over-represent people who exercise regularly.
    • Leading Questions: A question phrased in a way that suggests a particular answer is a leading question (e.g., "Don't you agree that school holidays are too short?").
    • Self-Selection Bias: This occurs when individuals volunteer to participate in a survey. Volunteers may have stronger opinions or be more interested in the topic than the general population.

    Mathematical/Scientific Relationships

    Stratified Sampling Formula

    This formula is essential and must be memorised. It is not given on the formula sheet.

    Number to sample from a stratum = (Size of stratum / Size of total population) x Total sample size

    • Size of stratum: The number of individuals in the specific subgroup.
    • Size of total population: The total number of individuals in all strata combined.
    • Total sample size: The desired number of individuals in the final sample.

    Example Calculation:

    A university has 2000 students: 1200 undergraduate and 800 postgraduate. A sample of 100 students is required.

    • Undergraduate sample: (1200 / 2000) x 100 = 0.6 x 100 = 60 students
    • Postgraduate sample: (800 / 2000) x 100 = 0.4 x 100 = 40 students

    stratified_sampling_diagram.png

    Practical Applications

    Data collection skills are used everywhere:

    • Market Research: Companies use questionnaires and sampling to understand consumer preferences and test new products.
    • Government Surveys: The census is a large-scale data collection exercise to gather information about the population, which informs policy on housing, healthcare, and transport.
    • Scientific Studies: Researchers use sampling to study the effects of treatments or interventions on a population without needing to test everyone.

    Worked Examples

    3 detailed examples with solutions and examiner commentary

    Practice Questions

    Test your understanding — click to reveal model answers

    Q1

    A manager of a leisure centre wants to find out how satisfied customers are. She decides to survey the first 20 people who arrive on a Monday morning. Give one reason why this sample is likely to be biased.

    1 marks
    foundation

    Hint: Think about who is likely to be at a leisure centre at that specific time.

    Q2

    Design a suitable question that could be used to find out how many times people visit the cinema.

    2 marks
    foundation

    Hint: Remember the two key features of a good question.

    Q3

    A college has 250 students in Year 12 and 150 students in Year 13. The principal wants to take a stratified sample of 80 students. Calculate the number of Year 13 students that should be in the sample.

    3 marks
    standard

    Hint: Use the stratified sampling formula. Total population is 250 + 150.

    Q4

    The table shows the number of members of a golf club, by gender and age.

    | | Male | Female |
    |---|---|---| | Junior | 40 | 20 |
    | Adult | 150 | 90 |

    A stratified sample of 60 members is required, based on gender and age. How many adult males should be in the sample?

    4 marks
    challenging

    Hint: This is a two-way stratification. First find the total number of members.

    Q5

    An online magazine asks its readers to volunteer to complete a survey about their reading habits. Explain why this method of sampling may lead to bias.

    2 marks
    standard

    Hint: Think about the type of person who would volunteer for a survey from this specific source.

    More Mathematics Study Guides

    View all

    Geometry and Measures Skills: Volume

    Edexcel
    GCSE

    Master the essential skill of calculating volume for your Edexcel GCSE Maths exam. This guide breaks down everything from simple prisms to complex composite solids, giving you the formulas, exam techniques, and memory hooks needed to secure top marks.

    Statistics Skills: Averages (Mean, Median, Mode)

    Edexcel
    GCSE

    Master the essential Statistics skills of Mean, Median, and Mode for your Edexcel GCSE Maths exam. This guide breaks down how to calculate, interpret, and compare averages, securing you top marks on these guaranteed-to-appear questions.

    Vectors

    AQA
    GCSE

    This guide provides a comprehensive overview of Vectors for AQA GCSE Mathematics, covering everything from basic column notation to complex geometric proofs. It's designed to help you secure every possible mark by focusing on examiner expectations, common pitfalls, and powerful memory techniques.

    Powers and roots

    OCR
    GCSE

    Unlock the power of numbers! This guide demystifies powers and roots for your OCR GCSE Maths exam, showing you how to master index laws and tackle complex calculations with confidence. From basic squares to tricky fractional indices, we'll equip you with the techniques to secure every last mark.

    Vectors

    OCR
    GCSE

    Master OCR GCSE Vectors with this guide, packed with examiner tips and interactive content. We'll break down everything from basic column vectors to complex geometric proofs, showing you how to secure every mark and turn a tricky topic into one of your strengths.

    Ratio and proportion

    OCR
    GCSE

    Master OCR GCSE Maths Topic 1.4: Ratio and Proportion. This guide breaks down simplifying ratios, sharing quantities, and tackling direct and inverse proportion to help you secure top marks in your exam.