Grouped Data and Frequency Tables
CSEC Mathematics: Organizing Data
Essential Understanding: When dealing with large datasets, raw data is difficult to analyze. Grouping data into classes and creating frequency tables allows us to identify patterns, trends, and characteristics that would be hidden in unorganized data. This is the foundation for all statistical analysis in Module 3.
Understanding Data Types
Discrete Data
Definition: Data that can only take specific, separate values. These are usually counts or measurements that cannot be subdivided meaningfully.
Examples:
- Number of students in a class (1, 2, 3...)
- Number of cars passing a point (0, 1, 2...)
- shoe sizes (6, 7, 8, 9...)
Continuous Data
Definition: Data that can take any value within a range. These are usually measurements that can be subdivided infinitely.
Examples:
- Height of students (150.5cm, 162.3cm...)
- Time taken to complete a task
- Weight of packages
Grouped Data
Definition: Data that has been organized into groups (classes) rather than listed individually. Essential when dealing with large datasets or continuous data.
When to Use:
- More than 20 data points
- Continuous data values
- When individual values are not needed for analysis
Essential Terminology for Grouped Data
Key Class Features
Understanding these terms is crucial for constructing and interpreting grouped data tables.
| Term | Definition | Example (for 50-59) |
|---|---|---|
| Class Interval | The range of values that form a group | 50-59 (width = 10) |
| Class Limits | The smallest and largest values that can belong to a class | Lower limit = 50, Upper limit = 59 |
| Class Boundaries | Precise boundaries that separate classes (half-units beyond limits) | 49.5 - 59.5 |
| Class Width | Upper boundary - Lower boundary | 59.5 - 49.5 = 10 |
| Class Midpoint | The center value of a class (used for calculations) | (50 + 59) ÷ 2 = 54.5 |
| Class Frequency | Number of data values in each class | e.g., 8 students scored 50-59 |
Constructing a Frequency Table
Step-by-Step Process
📝 Worked Example: Exam Scores
Problem: The following are the marks obtained by 30 students in a Mathematics test (out of 100):
45, 67, 72, 55, 89, 91, 63, 78, 82, 54, 67, 73, 58, 85, 92, 41, 69, 77, 84, 88, 52, 76, 81, 95, 48, 64, 79, 83, 90, 59
Solution:
Step 1: Find Range
Maximum = 95, Minimum = 41
Range = 95 - 41 = 54
Step 2: Choose Number of Classes
We choose 6 classes (a common choice for 30 data points).
Step 3: Calculate Class Width
Width = 54 ÷ 6 = 9
We round up to 10 for convenience.
Step 4: Determine Class Boundaries
Start at 40 (slightly below minimum of 41)
Classes: 40-49, 50-59, 60-69, 70-79, 80-89, 90-99
Step 5: Tally and Count
Interactive Frequency Table Builder
Data Entry (comma-separated):
Constructed Frequency Table
| Class Interval | Tally | Frequency (f) | Class Midpoint (x) | Class Boundaries |
|---|---|---|---|---|
| 40 - 49 | ||| | 3 | 44.5 | 39.5 - 49.5 |
| 50 - 59 | |||| || | 7 | 54.5 | 49.5 - 59.5 |
| 60 - 69 | |||| | | 6 | 64.5 | 59.5 - 69.5 |
| 70 - 79 | |||| || | 7 | 74.5 | 69.5 - 79.5 |
| 80 - 89 | |||| || | 5 | 84.5 | 79.5 - 89.5 |
| 90 - 99 | || | 2 | 94.5 | 89.5 - 99.5 |
| TOTAL | 30 |
Important Checks
Always verify:
- Sum of all frequencies must equal total number of data points
- Class boundaries should not overlap and should be continuous
- All data values should fall within some class
Key Examination Insights
Common Mistakes
- Confusing class limits with class boundaries (59 is included in 50-59, but 59.5 is the upper boundary)
- Forgetting to include the endpoint in class intervals (always clarify if inclusive or exclusive)
- Calculating midpoint as (lower + upper) instead of using boundaries when precision matters
Success Strategies
- Always check that your class width is consistent throughout
- Use class midpoints for all calculations involving grouped data
- Draw the table first, then fill in data systematically
CSEC Practice Arena
Test Your Understanding
CSEC Examination Mastery Tip
Class Boundary Convention: CSEC examiners expect you to understand that for inclusive class intervals like 10-19, 20-29, the value 19.5 is the upper boundary of 10-19, and 20.5 is the lower boundary of 20-29. This prevents overlap and gaps. Always calculate boundaries as (upper limit + 0.5) and (lower limit - 0.5) for integer data.
