MEASURES OF DISPERSION

WEEK 4

SUBJECT: FURTHER MATHEMATICS

CLASS: S.S 1 THIRD TERM

TOPIC: MEASURES OF DISPERSION

CONTENT:

  1. Range
  2. Inter-quartiles
  3. Mean deviation
  4. Standard deviation
  5. Variance
  6. Coefficient of variation

Range

Measures of Dispersion

The measure of dispersion (also called measure of variation) is concerned with the degree of spread of the numerical value of a distribution.

The Range: This is the difference between the maximum and minimum values in the data.

Examples 1:

Find the range of the data 6, 6, 7, 9, 11, 13, 16, 21 and 32

Solution: The maximum item is 32

The minimum item is 6

∴ Range = 32 – 6 = 26

Examples 2:

The following shows the distribution of mails obtained by 20 students in a test:

58 55 52 56 64 71 64 53 54 67

66 68 78 52 65 57 61 65 59 52

Find the range.

Solution

The largest value in the distribution = 78

The smallest value in the distribution = 52

Range = 78 – 52

= 26

Inter-quartiles

The difference between the values of the third and first quartiles is called the inter-quartile range. In equation form, we state this as:

Inter-quartile range = Q3 – Q1

Inter-quartile range is a measure of spread.

Quartile deviation

One-half the interquartile rang is another measure of spread and is called quartile deviation. Another name for quartile deviation is semi-interquartilerange. In equation form, we state this as:

Quartile deviation (Semi-interquartile range)=(Q3 – Q1)

Recall that Q1, Q2and Q3 can be obtained from a cumulative frequency curve.

Compute to find their position on the cumulative frequency (CF) axis using the following formulae:

(a) For lower quartile or first quartile () we use

(b) For median quartile or second quartile (), we use

(c) For upper quartile or third quartile (), we use (Total frequency or last CF)

Cumulative frequency

Upper class boundaries

STEP 2: Locate the point on the cumulative frequency axis and draw a horizontal line from this point to intersect the Ogive.

STEP 3: At the point it intersect the Ogive, draw a line parallel to the cumulative frequency axis to intersect the horizontal axis.

STEP 4: Read the value of the desired quartile at the point of intersection of the vertical line and the horizontal axis.

Inter-quartile range =

Semi inter-quartile range =

Inter –Percentile Range:- This is the difference between the 90th percentile and 10th percentile. Usually denoted by 10 90 percentile Range = P90 P10.

Note: Interpolation formula on lower and upper quartiles should be studied/taught by student/teacher exhaustively.

CLASS ACTIVITY

Choose the correct answer from the options:

  1. The highest mark in a mathematics class is 80% and the lowest mark in the class is 25%. What is the range of the marks in the class?
  2. 105% (b) 52.5% (c) 55% (d) 6.85%
  3. In a distribution, the lower quartile is found to be 41.5 and the upper quartile is 55.2. What is the interquartile range of the distribution?
  4. 41.5 (b) 13.7 (c)48.35 (d) 6.85

Mean deviation

Also called average deviation of a set of numbers x1 ,x2…….xn is defined by

Where is the arithmetic average of the numbers and is the absolute value of the deviation of from .n is n is the number of terms.

Example 1: Find the mean deviation of the numbers 3,8,4,5,1,9

Arithmetic mean,

=

= 5

M.D. |3-5|+|8-5|+|4-5|+|5-5|+|1-5|+|9-5|)

MEAN DEVIATION OF A GROUP DATA.

Example2:Find the mean deviation of the distribution:

4,3,5,2,6,3,7,5,1,5,4,6,5,1,8.

x F f(x) |x – F|x – |
1 2 2
2 1 2
3 2 6
4 2 8
5 4 20
6 2 12
7 1 7
8 1 8
15 65 16

STANDARD DEVIATION:

The standard deviation of a distribution xi,x2…….xn denoted by S is defined by the formula;

For grouped data

Example 1:

Find the standard deviation of the set of number 8,9,4,3,1,5

Solution:

X
8 3 9
9 4 16
4 -1 1
3 -2 4
1 -4 16
5 0 0
Total 46

Using alternative method

Solution:

X X
8 64
9 81
4 16
3 9
1 1
5 25
30 196

= 2.77

Example 2:

The table below shows the masses of 200 students in a school. Calculate the standard deviation.

Mass in kg F
0-9 24
10-19 68
20-29 63
30-39 18
40-49 10
50-59 8
60-69 5
70-79 5

Solution :

Mass in kg F c- f(
0-9 24 4.5 -40 -960
10-19 68 14.5 -30 -2040
20-29 63 24.5 -20 -1260
30-39 18 34.5 -10 -180
40-49 10 44.5 0 0
50-59 8 54.5 10 80
60-69 5 64.5 20 100
70-79 4 74.5 30 120
200 -4140

Let =44.5

= 44.5 +

X f
4.5 -19.3 24 372.49 8939.76
14.5 -9.3 68 86.49 5881.76
24.5 0.7 63 0.49 30.87
34.5 10.7 18 114.49 2060.82
44.5 20.7 10 428.49 4284.9
54.5 30.7 8 942.49 7539.92
64.5 40.7 5 1656.49 8282.45
74.5 50.7 4 2560.49 10241.96
200 47261.00

.

Using the alternative method

or

X F Fx2 F(x)
4.5 24 20.25 486 108
14.5 68 210.25 14297 987
24.5 63 600.25 37815.75 1543.5
34.5 18 1990.25 21424.5 621
44.5 10 1980.25 19802.5 445
54.5 8 2970.25 23762 436
64.5 5 4160.25 20801.25 322.5
74.5 4 5550.25 22201 298

; ; = 22657600;

Variance

This is also called the mean squared deviation. Variance is therefore the square of the standard deviation. This can be computed using the formula,

This means that the variance of the example above is (15.38)2.

Example 1:

The table gives the distribution of marks obtained by a group of 10 students in a test. Calculate the variance of the distribution.

Marks 1 3 5 7 9
No. of students 4 2 2 1 1

Solution

x x2 f fx f x2
1 1 4 4 4
3 9 2 6 18
5 25 2 10 50
7 49 1 7 49
9 81 1 9 81

Ʃf = 10, Ʃfx = 36, Ʃfx2= 202,

Let the variance be s2

=

= 20.2 – 3.62

= 20.2 – 12.96

= 7.24

Example 2:

Calculate the variance of the numbers 19, 21, 27 and37

Solution

X
19 361
21 441
27 729
37 1369

,

Let the variance be then

=

= 725 – 676

= 49

CLASS ACTIVITY

  1. Marks of some students is tabulated below :
X 0 1 2 3 4 5 6 7 8 9 10
F 1 2 2 0 5 6 8 5 3 2 1

Use this to find:

  1. Range
  2. Mean deviation
  3. Standard deviation of the distribution
  4. Use this set of numbers 5, 3, 6, 8, 3 to answer questions (iii)—–(v)
  5. The standard deviation S.D is

(a) 1 (b) 1.9 (c) 1.6 (d) 0.8

  1. The mean deviation is
  2. 1.6 (b) 1.0 (c) 1.6 (d) 0.8
  3. The variance of the distribution is
  4. 1 (b) 0.64 (c) 2.56 (d) 3.6

Coefficient of variation

So far we have studied the various measures of dispersion of data in distributions. We can see that the most reliable of them all is the standard deviation. The standard deviation though reliable yet it has its own deficiency in that if we which one is most widely spread, the data that has bigger values will have a bigger standard deviation than the other one with lesser values. Any decision taking from such comparison will be misleading. In order to overcome this, we then find coefficient of variation. This one is independent of magnitude for better comparison.

How then do we find coefficient of variation?

Coefficient of variation= X 100%

Coefficient of standard deviation =

Example 1: given two sets of the following numbers, find which one is more widely dispersed.

A = 2, 3, 1, 4, 5, 6, 7, 25, 9, 8

B = 32, 45, 40, 38, 47, 50, 43, 36, 44, 35

Solution

Let A represents the mean of A

Let B represents the mean of A

Let the standard deviation for these sets be represented by SA, SB.

A = = 7

B = = 41

SA =

= 6.48

SB =

SB = = 5.46

The coefficient of standard deviation of A =

=

The coefficient of standard deviation of B =

=

The coefficient of variation is multiplied by 100%. A = 92.6% and B = 13.3%

From the calculation Data A is more widely dispersed than data B.

Let us get a new set of B by multiplying by 5 throughout and call it data C.

= 160 + 225 + 200 + 190 + 235 + 250 + 215 + 180 + 220 + 175 ÷ 10 = 205

Sc =

= 2050 + 400 + 25 + 225 + 900 + 2050 + 100 + 625 + 225 + 900

= 7450 ÷ 205

= 27.29

The coefficient of standard deviation of C = 0.133

C .V = 13.3%

From the example above coefficient of variation does not depend on the large numbers. This makes if more reliable.

CLASS ACTIVITY

  1. An epidemic affected a village and the following was the record
Age 10 – 12 12 – 14 14 – 16 16 – 18 18 – 20 20 – 22 22 -24
No Affected 5 12 20 26 12 1 4
  1. Which age – group was most affected
  2. Calculate the standard deviation
  3. Calculate also the coefficient variation (C.V)
  4. Find the Mean, Standard deviations and coefficient of variations of the following distribution
X 4 5 6 7 8 9 10
F 1 3 4 9 8 3 2
  1. What is the Mean, Standard deviations and coefficient of variations of the following distribution?
X 5 6 7 8 9 10 11
F 1 3 4 9 8 3 2

What can you notice about questions 2 and 3?

  1. The following values of a quantity x were obtained experimentally.
X 12 13 14 15 16 17 18 19 20
F 2 1 4 7 10 12 8 5 1

Calculate the mean and the standard deviation. Hence compute the coefficient of standard deviation and coefficient of variation.

  1. Compute (i) the variance (ii) the standard deviation of the following test scores.
Marks 10 12 13 14 15 16
Frequency 18 4 6 12 6 4

PRACTICE QUESTIONS

  1. The table below shows the distribution of the waiting times for some customers in a certain petrol station.

Waiting time (in min) Number of customers

1.5 – 1.9 3

2.0 – 2.4 10

2.5 – 2.9 18

3.0 – 3.4 10

3.5 – 3.9 7

4.0 – 4.4 2

 

(a) Write down the class boundaries of the distribution.

(b) Construct a cumulative frequency curve for the data.

(c) Using your graph, estimate:

(i) the interquartile range of the distribution.

(ii) the proportion of customers who would have waited for more than 3 minutes.

  1. The frequency distribution of the weight of 100 participants in a high jump competition is as shown below:
Weight (kg) 20-29 30-39 40-49 50-59 60-69 70-79
Number of participants 10 18 22 25 16 9

(a) Construct the cumulative frequency table

(b) Draw the cumulative frequency curve

(c) From the curve, estimate the:

(i) median

(ii) semi-interquartile range

(iii) probability that a participant chosen at random weighs at least 60kg.

  1. The following table shows the distribution of marks obtained by some students in an examination.
Marks 0-9 10-19 20-29 30-39 40 -49 50 – 59 60-69 70-79 80-89 90-99
No. of students 50 50 40 60 100 100 50 25 15 10
  1. Construct a cumulative frequency table for the distribution.
  2. Draw an ogive for the distribution.
  3. Use your graph in (b) to determine the;
  4. Semi-inter quartile range;
  5. Number of students who failed, if the pass mark for the examination is 37;
  6. Probability that a student selected at random scored between 20% and 60%.