Measures of Central Tendency for Grouped Data

CSEC Mathematics: Finding the Center

Essential Understanding: Measures of central tendency tell us where the "center" or "typical" value of a dataset lies. For grouped data, we cannot find exact values, but we can estimate them using class midpoints. The three main measures are the mean, median, and mode, each with its own strengths and applications.

🔑 Key Skill: Estimating Mean from Grouped Data
📈 Exam Focus: Finding Modal Class and Median Class
🎯 Problem Solving: Interpolation for Median

The Three Measures of Central Tendency

📊

Mean (Arithmetic Average)

Definition: The sum of all values divided by the number of values. For grouped data, we use class midpoints as representative values.

Formula:

$$ \bar{x} = \frac{\sum fx}{\sum f} $$

Where: \( f \) = frequency, \( x \) = class midpoint

📈

Mode

Definition: The value that appears most frequently. For grouped data, we identify the modal class (class with highest frequency).

For Grouped Data:

  • Identify: Class with highest frequency
  • Cannot give exact value
  • Best for identifying the most common range
📉

Median

Definition: The middle value when data is ordered. For grouped data, we find the median class and estimate using interpolation.

Median Position:

$$ \text{Median Position} = \frac{n}{2} $$

Where \( n \) = total frequency

Calculating the Mean for Grouped Data

Formula for Grouped Mean

Since we don't have individual values in grouped data, we assume all values in a class are represented by the class midpoint:

$$ \bar{x} = \frac{\sum (f \times x)}{\sum f} $$

Where: \( f \) = frequency, \( x \) = class midpoint, \( \sum f = n \) (total frequency)

📝 Worked Example: Calculating the Mean

Given Frequency Table:

Class Interval Frequency (f) Midpoint (x) f × x
40-49344.5133.5
50-59754.5381.5
60-69664.5387.0
70-79774.5521.5
80-89584.5422.5
90-99294.5189.0
TOTAL302035

Solution:

Step 1: Calculate midpoints: \( x = \frac{\text{lower} + \text{upper}}{2} \)

Step 2: Calculate \( f \times x \) for each class

Step 3: Sum the frequencies: \( \sum f = 30 \)

Step 4: Sum the products: \( \sum fx = 2035 \)

Step 5: Calculate the mean:

$$ \bar{x} = \frac{2035}{30} = 67.83 $$

Estimated mean mark = 67.83 marks

Interactive Mean Calculator

Class Interval Frequency (f) Midpoint (x) f × x
40-4944.5133.5
50-5954.5381.5
60-6964.5387.0
70-7974.5521.5
80-8984.5422.5
90-9994.5189.0
TOTAL302035

Estimated Mean (x̄)

67.83

Finding the Modal Class

Modal Class Identification

The modal class is simply the class interval with the highest frequency.

This is straightforward - no calculation needed, just observation!

📝 Example: Identifying the Modal Class

From our frequency table:

  • 40-49: frequency 3
  • 50-59: frequency 7
  • 60-69: frequency 6
  • 70-79: frequency 7
  • 80-89: frequency 5
  • 90-99: frequency 2

Solution:

Both 50-59 and 70-79 have the highest frequency of 7.

Modal classes are 50-59 and 70-79 (bimodal distribution)

When frequencies are tied, there can be multiple modal classes!

Estimating the Median

Finding the Median Class

The median is the middle value of the ordered dataset. For grouped data:

1
Find Median Position: \( \frac{n}{2} \) where \( n = \sum f \) (total frequency)
2
Find Cumulative Frequency: Add up frequencies running total until you exceed the median position.
3
Identify Median Class: The class where the cumulative frequency first exceeds or equals \( \frac{n}{2} \).
4
Interpolate: Use linear interpolation to estimate the median value within that class.

Median Interpolation Formula

$$ \text{Median} = L + \left(\frac{\frac{n}{2} - CF}{f}\right) \times w $$

Where:

  • \( L \) = Lower boundary of median class
  • \( n \) = Total frequency
  • \( CF \) = Cumulative frequency before median class
  • \( f \) = Frequency of median class
  • \( w \) = Class width

📝 Worked Example: Finding the Median

Using our frequency table:

Class Frequency (f) Cumulative Frequency
40-4933
50-59710
60-69616
70-79723
80-89528
90-99230

Solution:

Step 1: \( n = 30 \), Median position = \( \frac{30}{2} = 15 \)

Step 2: Cumulative frequencies: 3, 10, 16, 23...

Step 3: The 15th value falls in the class 70-79 (cumulative 23 exceeds 15)

- Lower boundary (L) = 69.5

- Cumulative frequency before (CF) = 16

- Frequency of median class (f) = 7

- Class width (w) = 10

Step 4: Interpolation

$$ \text{Median} = 69.5 + \left(\frac{15 - 16}{7}\right) \times 10 $$ $$ \text{Median} = 69.5 + \left(\frac{-1}{7}\right) \times 10 $$ $$ \text{Median} = 69.5 - 1.43 = 68.07 $$

Estimated median = 68.07 marks

Interactive Median Calculator

Class Interval Frequency (f) Cumulative Frequency
40-493
50-5910
60-6916
70-7923
80-8928
90-9930

Modal Class

50-59 & 70-79

Estimated Median

68.07

Key Examination Insights

Common Mistakes

  • Using class limits instead of class boundaries in median formula
  • Forgetting to multiply frequency by midpoint (just summing midpoints)
  • Using n instead of n/2 for median position
  • Confusing cumulative frequency before vs at the median class

Success Strategies

  • Always use class midpoints for mean calculations
  • For median, always use class BOUNDARIES (lower - 0.5, upper + 0.5)
  • Label the median class clearly in your working
  • Mean = Σfx/Σf, Median requires interpolation formula

CSEC Practice Arena

Test Your Understanding

1
For grouped data, what value is used to represent all values in a class when calculating the mean?
The lower class limit
The upper class limit
The class midpoint
The class boundary
Explanation: The class midpoint (average of lower and upper limits) represents all values in that class when calculating the mean of grouped data.
2
A class has frequency 8 and cumulative frequency before it is 22. Total n = 50. What is the median position?
22
25
30
50
Solution: Median position = n/2 = 50/2 = 25. The median falls in this class because the cumulative frequency before (22) is less than 25, and 22 + 8 = 30 exceeds 25.
3
Which formula is correct for estimating the median from grouped data?
L + ((n/2 - CF) / w) × f
L + ((CF - n/2) / f) × w
L + ((n/2 - CF) / f) × w
L + ((f - CF) / n) × w
Explanation: The correct interpolation formula is: Median = L + ((n/2 - CF) / f) × w, where L = lower boundary, CF = cumulative frequency before median class, f = frequency of median class, w = class width.
🎯

CSEC Examination Mastery Tip

Working Method: In the CSEC exam, always show your working for central tendency calculations:

  • Create an extended frequency table with midpoint column and fx column
  • Show Σf and Σfx calculations
  • For median, clearly show cumulative frequency column
  • State which is the modal class and median class
  • Use the interpolation formula with all values substituted
Scroll to Top