1. What is information?
Knowing about something is called information.
Class 9 Mathematics | Punjab Curriculum and Textbook Board Syllabus 2025
Knowing about something is called information.
The collection of meaningful information as facts and numerical figures is called data.
Note: The term data handling was first used by Sir Ronald Aylmer Fisher (1890-1962), a pioneer in the field of statistics.
Information handling is the process of collecting, organizing, summarizing, analyzing, and interpreting numerical data.
It can take only specific values. Whole numbers are used to represent discrete data. It is obtained only by counting.
Example: Number of books sold by a shopkeeper, number of patients visiting a hospital in a week.
It can take every possible value in a given interval. Decimal numbers are used to represent continuous data. It is obtained only by measuring.
Example: Mass of students in a class (e.g., 28.5 kg, 26.5 kg, 27.5 kg).
Ungrouped Data | Grouped Data |
---|---|
Data that is not arranged in any systematic order (groups or classes) is called ungrouped data. It is also known as raw data. |
When data is arranged systematically into classes, it is called grouped data. Grouped data organizes raw data into intervals for clearer analysis. |
Example: 10, 5, 8, 12, 15, 20, 25, 30, ... |
Example: Classes: 5-9, 10-14, 15-19, ... with tally marks and frequencies. |
The minimum and the maximum values defined for a class or group are called class limits. The minimum value is called the lower-class limit and the maximum value is called the upper-class limit.
A frequency distribution is a distribution or table that represents classes or groups along with their respective class frequencies.
$$Range = X_{\max} - X_{\min}$$
Example: If the greatest value is 136 and the smallest is 30:
$$Range = 136 - 30 = 106$$
$${Class\ Size = \frac{Range}{Number\ of\ Classes}}$$
$$h = \frac{106}{10}$$
$$h = 10.6$$
$$h \approx 11$$
Example: 30-40, 41-51, 52-62, and so on.
Class boundaries usually are found by the following method:
Note: Class boundaries may also be obtained from the midpoints ($\mathbf{x}$) using the formula:
$$Class\ Boundaries = \ x \pm \frac{h}{2}$$
where $h$ is the difference between any two consecutive values of $x$.
A histogram is a graph of adjacent rectangles constructed on the xy-plane. It is a graph of frequency distribution.
A frequency polygon is a closed geometrical figure displaying a frequency distribution.
A midpoint is the average value of the lower and upper class limits. Midpoint is also known as the class mark. It is calculated by the formula:
$$Midpoint = \frac{Lower\ class\ limit + Upper\ class\ limit}{2}$$
The measure that gives the centre of the data is called measure of central tendency.
Therefore, measure of central tendency is used to find out the middle or central value of a data set. The measures of central tendency are:
Arithmetic Mean
Median
Mode
Weighted Mean
Arithmetic Mean (A.M.) is defined as the value of a variable which is obtained by dividing the sum of all the values (observations) by the number of observations.
The arithmetic mean of a set of values $x_{1},x_{2},x_{3},\ \ldots,x_{n}$ is denoted by $\overline{X}$ (read as "$X$-$bar$") and is calculated as:
$$\overline{X} = \frac{x_{1} + x_{2} + x_{3} + \ \ldots + x_{n}}{n}$$
$$\overline{X} = \frac{\sum X}{n}$$
Arithmetic Mean | |
---|---|
Ungrouped Data | Grouped Data |
Direct Method $$\overline{X} = \frac{Sum\ of\ all\ values\ of\ observation}{no.\ of\ observations}$$ $$\overline{X} = \frac{\sum X}{n}$$ |
Direct Method $$\overline{X} = \frac{\sum fX}{\sum f}$$ |
Indirect Method (i) Shortcut $$\overline{X} = A + \frac{\sum D}{n}$$ $D = X - A$, where $A$ is any assumed value of $X$ called assumed or provisional (ii) Coding Method $$\overline{X} = A + \frac{\sum u}{n} \times h$$ $u = \frac{X - A}{h}$, where $A$ is any assumed value of $X$ called assumed or provisional and $h$ is the class interval size for unequal intervals. |
Indirect Method (i) Shortcut $$\overline{X} = A + \frac{\sum fD}{\sum f}$$ $D = X - A$, where $A$ is any assumed value of $X$ called assumed or provisional and $X$ denotes the midpoint of class or group. (ii) Coding Method $$\overline{X} = A + \frac{\sum fu}{\sum f} \times h$$ $u = \frac{X - A}{h}$, where $A$ is any assumed value of $X$ called assumed or provisional and $h$ is the size of class interval. |
Median is the middle most value in an arranged (ascending or descending order) data set. It is the value which divides the data into two equal parts.
Median is denoted by $\widetilde{X}$ (read as $X$-$tilde$).
Ungrouped Data | Grouped Data |
---|---|
Case 1: When the number of observations is odd $$\widetilde{X} = \left( \frac{n + 1}{2} \right)^{th}\ observation$$ Case 2: When the number of observations is even $$\widetilde{X} = \frac{1}{2}\left\lbrack \left( \frac{n}{2} \right)^{th}observation + \left( \frac{n + 2}{2} \right)^{th}\ observation \right\rbrack$$ |
$$Median = l + \frac{h}{f}\left\lbrack \frac{n}{2} - c \right\rbrack$$ Where:
|
In a data set, the value (observation) which appears or occurs most often is called the mode of the data. It is the most common value.
Mode is denoted by $\widehat{X}$ (read as $X$-$hat$).
Ungrouped Data | Grouped Data |
---|---|
$$Mode = the\ most\ frequent\ observation$$ |
$$Mode = l + \frac{\left( f_{m} - f_{1} \right)}{\left( f_{m} - f_{1} \right)\left( f_{m} - f_{2} \right)} \times h$$ Where:
|
Note:
Arithmetic Mean is used when all the observations are given equal importance or weight, but there are certain situations in which the different observations get different weights.
In this situation, the weighted mean, denoted by ${\overline{X}}_{w}$ is preferred.
$${\overline{X}}_{w} = \frac{\sum WX}{\sum W}$$