| Latest Economics NCERT Notes, Solutions and Extra Q & A (Class 9th to 12th) | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9th | 10th | 11th | 12th | ||||||||||||||||
Chapter 5 Measures Of Central Tendency
This chapter introduces the concept of summarising an entire dataset with a single, representative value, known as a measure of central tendency or an "average". These measures provide a "typical" value that describes the center of a data distribution, making it easier to comprehend and compare different datasets. The chapter focuses on the three most commonly used averages: the Arithmetic Mean, the Median, and the Mode.
Each measure has unique properties and applications. The Arithmetic Mean is the most common average, calculated by summing all observations and dividing by the number of observations. It is simple to calculate but is highly sensitive to extreme values (outliers). The Median is the middle value in an ordered dataset, which divides the data into two equal halves. As a positional average, it is not affected by extreme values, making it a better measure for skewed data. The Mode is the value that occurs most frequently in the dataset and is particularly useful for describing qualitative data or identifying the most popular category. The choice of the most appropriate average depends on the purpose of the analysis and the nature of the data distribution.
Introduction to Measures of Central Tendency
Once data is collected and organised, the next step is often to summarise the entire set of data with a single, representative number. This is the purpose of a measure of central tendency, also known as an "average". These measures provide a numerical summary that helps explain the data in brief.
We use averages in our day-to-day life to understand and describe large sets of data, such as:
- Average marks obtained by students in a class.
- Average rainfall in a region.
- Average income of people in a locality.
To understand the economic condition of a farmer named Baiju in the village of Balapur, we would need to compare his land holding with that of the other 50 farmers in the village. This requires summarising the data on all land holdings into a single, typical value. A measure of central tendency provides such a value.
The Three Most Common Averages
There are several statistical measures of central tendency, but the three most commonly used are:
- Arithmetic Mean: The "average" in the ordinary sense, which considers all values in the dataset.
- Median: The middle value that divides the dataset into two equal halves.
- Mode: The value that occurs most frequently in the dataset.
The choice of which average to use depends on the nature of the data and the purpose of the analysis.
Arithmetic Mean
The Arithmetic Mean is the most commonly used measure of central tendency. It is defined as the sum of the values of all observations divided by the number of observations. It is usually denoted by $\bar{X}$.
Formula for Arithmetic Mean
If there are N observations $X_1, X_2, X_3, \dots, X_N$, the Arithmetic Mean is given by:
$ \bar{X} = \frac{X_1 + X_2 + X_3 + \dots + X_N}{N} $
This can be written in a simpler form using the summation symbol ($\Sigma$):
$ \bar{X} = \frac{\sum X}{N} $
where $\sum X$ is the sum of all observations and $N$ is the total number of observations.
Calculation of Arithmetic Mean
The method for calculating the arithmetic mean varies slightly depending on whether the data is ungrouped or grouped.
1. Arithmetic Mean for Ungrouped Data
-
Direct Method: This is the simplest method, involving summing all observations and dividing by the total number of observations.
Example 1. Calculate the arithmetic mean of the marks: 40, 50, 55, 78, 58.
Answer:
$ \bar{X} = \frac{40 + 50 + 55 + 78 + 58}{5} = \frac{281}{5} = 56.2 $
The average mark is 56.2.
-
Assumed Mean Method: When the number of observations or the figures are large, this method simplifies calculations. An "assumed mean" (A) is chosen, and deviations (d) of each observation from this mean are calculated. The formula is:
$ \bar{X} = A + \frac{\sum d}{N} $, where $d = X - A$
-
Step Deviation Method: To further simplify calculations with large numbers, deviations are divided by a common factor (c). The formula is:
$ \bar{X} = A + \frac{\sum d'}{N} \times c $, where $d' = \frac{X - A}{c}$
2. Arithmetic Mean for Grouped Data
For grouped data (both discrete and continuous series), the frequency (f) of each observation or class is considered.
- Direct Method (Discrete Series):
$ \bar{X} = \frac{\sum fX}{\sum f} $
- Direct Method (Continuous Series): Here, the mid-point (m) of each class interval is used as X.
$ \bar{X} = \frac{\sum fm}{\sum f} $
- The Assumed Mean and Step Deviation methods can also be applied to grouped data by incorporating the frequency (f) in the formulas:
Assumed Mean: $ \bar{X} = A + \frac{\sum fd}{\sum f} $
Step Deviation: $ \bar{X} = A + \frac{\sum fd'}{\sum f} \times c $
Weighted Arithmetic Mean
Sometimes, it is necessary to assign different levels of importance, or "weights" (W), to different observations. The Weighted Arithmetic Mean is calculated as:
$ \bar{X}_w = \frac{\sum WX}{\sum W} $
For example, when calculating an average price increase, more weight might be given to essential commodities that form a larger part of a consumer's budget.
Median
The Median is the positional value of a variable that divides the distribution into two equal parts. One part comprises all values greater than or equal to the median, and the other comprises all values less than or equal to it. In simple terms, it is the "middle" element when the data is arranged in ascending or descending order.
A key property of the median is that it is not affected by extreme values (outliers), making it a better measure of central tendency for skewed data compared to the mean.
Computation of Median
1. Ungrouped Data
To find the median, first, arrange the data in ascending or descending order. The position of the median is found using the formula:
Position of Median = Size of $ \left( \frac{N+1}{2} \right)^{th} $ item, where N is the number of items.
- If N is an odd number, the median is the middle value.
Example: For the data 1, 3, 4, 6, 8, 10, 12, the median is the 4th item, which is 6. - If N is an even number, there will be two middle values. The median is the arithmetic mean of these two middle values.
Example: For the data 25, 28, ..., 45, 46, ..., 65, 72, the median is $ \frac{45+46}{2} = 45.5 $.
2. Discrete Series
For a discrete series, first calculate the cumulative frequency (c.f.). Locate the position of the $ \left( \frac{N+1}{2} \right)^{th} $ item in the cumulative frequency column. The corresponding value of the variable is the median.
3. Continuous Series
For a continuous series, first identify the median class, which is the class where the $ \left( \frac{N}{2} \right)^{th} $ item lies. The median is then calculated using the following formula:
$ Median = L + \frac{\frac{N}{2} - c.f.}{f} \times h $
where:
- $L$ = Lower limit of the median class.
- $N$ = Total frequency ($\sum f$).
- $c.f.$ = Cumulative frequency of the class preceding the median class.
- $f$ = Frequency of the median class.
- $h$ = Magnitude (width) of the median class interval.
Quartiles and Percentiles
- Quartiles are measures that divide the data into four equal parts.
- First Quartile ($Q_1$): 25% of the items are below it. $Q_1 = \text{size of } \left( \frac{N+1}{4} \right)^{th} \text{ item}$.
- Second Quartile ($Q_2$): This is the Median itself.
- Third Quartile ($Q_3$): 75% of the items are below it. $Q_3 = \text{size of } 3 \left( \frac{N+1}{4} \right)^{th} \text{ item}$.
- Percentiles divide the distribution into one hundred equal parts (e.g., $P_1, P_2, \dots, P_{99}$). $P_{50}$ is the median.
Mode
The Mode is the value that occurs most frequently in a data series. It represents the most typical or fashionable value in a distribution. It is denoted by $M_o$. The mode is particularly useful for describing qualitative data or for identifying the most popular item in a set.
Example: For a shoe manufacturer, the modal shoe size is the most important average because it indicates the size with the maximum demand.
Computation of Mode
1. Discrete Series
In a discrete series, the mode is simply the value with the highest frequency. A distribution can be:
- Unimodal: Has one mode. (e.g., 1, 2, 3, 4, 4, 5)
- Bimodal: Has two modes. (e.g., 1, 2, 2, 3, 4, 4, 5)
- Multimodal: Has more than two modes.
- No Mode: If no value appears more frequently than any other.
2. Continuous Series
For a continuous series, first identify the modal class, which is the class with the highest frequency. The mode is then calculated using the following formula:
$ M_o = L + \frac{D_1}{D_1 + D_2} \times h $
where:
- $L$ = Lower limit of the modal class.
- $D_1$ = Difference between the frequency of the modal class and the frequency of the preceding class.
- $D_2$ = Difference between the frequency of the modal class and the frequency of the succeeding class.
- $h$ = Class interval of the modal class.
To use this formula, the class intervals should be equal and the series should be in an exclusive form.
Relative Position of Mean, Median, and Mode
The relative position of the Arithmetic Mean ($\bar{X}$), Median ($M_i$), and Mode ($M_o$) depends on the shape of the frequency distribution.
-
Symmetrical Distribution: In a perfectly symmetrical distribution (like a bell-shaped curve), the Mean, Median, and Mode are all equal.
$ \bar{X} = M_i = M_o $
-
Asymmetrical (Skewed) Distribution: In a skewed distribution, these three values are not equal. The median is always located between the arithmetic mean and the mode.
- For a positively skewed distribution (tail to the right): $ \bar{X} > M_i > M_o $
- For a negatively skewed distribution (tail to the left): $ \bar{X} < M_i < M_o $
Conclusion: Choosing the Right Average
Measures of central tendency summarise a dataset into a single, most representative value. The choice of which average to use is crucial and depends on the purpose of the analysis and the nature of the data distribution.
- The Arithmetic Mean is simple to calculate and is based on all observations, but it is heavily influenced by extreme values (outliers).
- The Median is a better summary for data with extreme values as it is a positional average and not affected by them. It is also suitable for open-ended distributions.
- The Mode is the most appropriate measure for describing qualitative data or identifying the most common value.
Both the Median and Mode can be determined graphically, which is another advantage in certain contexts.
NCERT Questions Solution
Question 1. Which average would be suitable in the following cases?
(i) Average size of readymade garments.
(ii) Average intelligence of students in a class.
(iii) Average production in a factory per shift.
(iv) Average wage in an industrial concern.
(v) When the sum of absolute deviations from average is least.
(vi) When quantities of the variable are in ratios.
(vii)In case of open-ended frequency distribution.
Answer:
Question 2. Indicate the most appropriate alternative from the multiple choices provided against each question.
(i) The most suitable average for qualitative measurement is
(a) arithmetic mean
(b) median
(c) mode
(d) geometric mean
(e) none of the above
(ii) Which average is affected most by the presence of extreme items?
(a) median
(b) mode
(c) arithmetic mean
(d) none of the above
(iii) The algebraic sum of deviation of a set of n values from A.M. is
(a) n
(b) 0
(c) 1
(d) none of the above
Answer:
Question 3. Comment whether the following statements are true or false.
(i) The sum of deviation of items from median is zero.
(ii) An average alone is not enough to compare series.
(iii) Arithmetic mean is a positional value.
(iv) Upper quartile is the lowest value of top 25% of items.
(v) Median is unduly affected by extreme observations.
Answer:
Question 4. If the arithmetic mean of the data given below is 28, find (a) the missing frequency, and (b) the median of the series:
| Profit per retail shop (in Rs) | Number of retail shops |
|---|---|
| 0-10 | 12 |
| 10-20 | 18 |
| 20-30 | 27 |
| 30-40 | - |
| 40-50 | 17 |
| 50-60 | 6 |
Answer:
Question 5. The following table gives the daily income of ten workers in a factory. Find the arithmetic mean.
| Workers | A | B | C | D | E | F | G | H | I | J |
|---|---|---|---|---|---|---|---|---|---|---|
| Daily Income (in Rs) | 120 | 150 | 180 | 200 | 250 | 300 | 220 | 350 | 370 | 260 |
Answer:
Question 6. Following information pertains to the daily income of 150 families. Calculate the arithmetic mean.
| Income (in Rs) | Number of families |
|---|---|
| More than 75 | 150 |
| ,, 85 | 140 |
| ,, 95 | 115 |
| ,, 105 | 95 |
| ,, 115 | 70 |
| ,, 125 | 60 |
| ,, 135 | 40 |
| ,, 145 | 25 |
Answer:
Question 7. The size of land holdings of 380 families in a village is given below. Find the median size of land holdings.
| Size of Land Holdings (in acres) | Number of families |
|---|---|
| Less than 100 | 40 |
| 100–200 | 89 |
| 200–300 | 148 |
| 300–400 | 64 |
| 400 and above. | 39 |
Answer:
Question 8. The following series relates to the daily income of workers employed in a firm. Compute (a) highest income of lowest 50% workers (b) minimum income earned by the top 25% workers and (c) maximum income earned by lowest 25% workers.
| Daily Income (in Rs) | Number of workers |
|---|---|
| 10–14 | 5 |
| 15–19 | 10 |
| 20–24 | 15 |
| 25–29 | 20 |
| 30–34 | 10 |
| 35–39 | 5 |
Answer:
Question 9. The following table gives production yield in kg. per hectare of wheat of 150 farms in a village. Calculate the mean, median and mode values.
| Production yield (kg. per hectare) | Number of farms |
|---|---|
| 50–53 | 3 |
| 53–56 | 8 |
| 56–59 | 14 |
| 59–62 | 30 |
| 62–65 | 36 |
| 65–68 | 28 |
| 68–71 | 16 |
| 71–74 | 10 |
| 74–77 | 5 |
Answer: