Statistics — Study Notes for SOF IMO

Overview

Statistics is the branch of mathematics dealing with collection, organization, analysis and interpretation of numerical data. For SOF IMO, this topic focuses on measures of central tendency (mean, median, mode) for both ungrouped and grouped data, along with graphical representation of data through bar graphs, histograms, frequency polygons and pie charts.

Statistics problems regularly appear in the Mathematical Reasoning section and occasionally in the Achievers Section where data interpretation meets real-world scenarios. Students must master quick calculation techniques for mean, median and mode, understand frequency distribution tables, and interpret various graphical formats. The topic bridges pure mathematics with everyday applications, making it both conceptually important and practically relevant for competitive exam success.

Strong performance in statistics requires accuracy in arithmetic operations, careful reading of frequency tables, and the ability to extract information from graphs quickly. Students should focus on identifying which measure of central tendency best represents a given dataset and practice converting raw data into grouped frequency distributions.

Key Concepts

**Ungrouped data** consists of individual observations listed separately, while **grouped data** organizes observations into class intervals with their frequencies. Ungrouped data is easier to handle but becomes unwieldy for large datasets.

**Mean (arithmetic average)** represents the sum of all observations divided by their count. For grouped data, we use class marks (midpoints) multiplied by frequencies, making it sensitive to extreme values.

**Median** is the middle value when data is arranged in ascending or descending order, making it resistant to outliers. For even number of observations, median equals the average of the two middle values.

**Mode** is the observation or class interval with the highest frequency. A dataset can be unimodal (one mode), bimodal (two modes) or multimodal, while some datasets have no mode if all frequencies are equal.

**Class mark (midpoint)** for any class interval equals (lower limit + upper limit)/2 and serves as the representative value for that entire class in grouped data calculations.

**Cumulative frequency** is the running total of frequencies up to a particular class, essential for finding the median in grouped data and constructing cumulative frequency curves (ogives).

**Range** measures data spread as the difference between maximum and minimum values, providing a simple indicator of variability alongside central tendency measures.

Practice this topic

Take a full mock →

Q1 · Statistics · EASY
The marks obtained by 8 students in a test are: 12, 15, 18, 12, 20, 15, 12, 16. What is the mode of this data?
Q2 · Statistics · MEDIUM
The mean of 5 numbers is 24. If one number is excluded, the mean of the remaining 4 numbers becomes 22. What is the excluded number?
Q3 · Statistics · HARD
The following frequency distribution shows the ages of 50 people in a locality: Age (in years): 10-20, 20-30, 30-40, 40-50, 50-60 Number of people: 5, 12, 18, 10, 5 What is the median class of this grouped data?
Q4 · Statistics · EASY
The mean of 8, 12, 16, x, 20 and 24 is 16. What is the value of x?
Q5 · Statistics · MEDIUM
The median of the data 15, 12, 18, 21, x, 24 arranged in ascending order is 19. What is the value of x?

Ask Shishya to explain these →

Notes generated on 10 May 2026

Statistics

Statistics — Study Notes for SOF IMO

Overview

Key Concepts

Need more? Ask Shishya

Practice this topic

Formulas / Key Facts

Worked Examples

Common Mistakes

Quick Reference