Chapter Fourteen Measures of central tendency and variability

CHAPTER CONTENTS

Introduction 155

Measures of central tendency 156

The mode 156

The median 156

The mean 157

Comparison of the mode, median and mean 157

Measures of variability 158

The range 159

The variance 159

The standard deviation 160

The semi-interquartile range 160

Summary 161

Self-assessment 161

True or false 161

Multiple choice 162

Introduction

In the previous chapter we examined how raw data can be organized and represented by the use of statistics in order that they may be easily communicated and understood. The two statistics that are necessary for representing a frequency distribution are measures of central tendency and variability.

Measures of central tendency are statistics or numbers expressing the most typical or representative scores in a distribution. Measures of variability are statistics representing the extent to which scores are dispersed (or spread out) numerically. The overall aim of this chapter is to examine the use of several types of measures of central tendency and variability commonly used in the health sciences. As quantitative evidence arising from investigations is presented in terms of these statistics, it is essential to understand these concepts.

The aims of this chapter are to:

1. Discuss the selection and use of measures of central tendency.

2. Discuss the selection and use of measures of variability.

3. Outline the relationship between the skew of frequency distributions and the selection of appropriate descriptive statistics.

Page 156

Measures of central tendency

The mode

When the data are nominal (i.e. categories), the appropriate measure of central tendency is the mode. The mode is the most frequently occurring score in a distribution. Therefore, for the data that was shown in Table 13.1 the mode is the ‘females’ category. The mode can be obtained by inspection of grouped data (with the largest group being the mode). As we shall see later, the mode can also be calculated for continuous data.

The median

With ordinal, interval or ratio scaled data, central tendency can also be represented by the median. The median is the score that divides the distribution into half: half of the scores fall under the median, and half above the median. That is, if scores are arranged in an ordered array, the median would be the middle score. With a large number of cases, it may not be feasible to locate the middle score simply by inspection. To calculate which is the middle score, we can use the formula (n + 1)/2, where n is the total number of cases in a sample. This formula gives us the number of the middle score. We can then count that number from either end of an ordered array.

In general, if n is odd, the median is the middle score; if n is even, then the median falls between the two centre scores. The formula (n + 1)/2 is again used to tell us which score in an ordered array will be the median. For example:

5, 8, 9, 10, 28. Median = 9 (n is odd)

6, 17, 19, 20, 21, 27. Median = 19.5 (n is even)

For a grouped frequency distribution, the calculation of the median is a little more complicated. If we assume that the variable is continuous (for example, time, height, weight or level of pain), we can use a formula for calculating the median. This formula (explained in detail below) can be applied to ordinal data, provided that the variable being measured has an underlying continuity. For example, in a study of the measurement of pain reports we obtain the following data, where n = 17:

1, 1, 2, 2, 2, 2, 2, 3, 3, 3, 3, 4, 4, 4, 5, 5, 5

The above data can be represented by a bar graph (Fig. 14.1).

Figure 14.1 Distribution of pain scores.

Here we can obtain the mode simply by inspection. The mode = 2 (the most frequent score). For the median, we need the ninth score, as this will divide the distribution into two equal halves (see Table 14.1). By inspection, we can see that the median will fall into category 3. Assuming underlying continuity of the variable and applying the previously discussed formula, we have:

Table 14.1 Table of sample data

where X_L = real lower limit of the class interval containing the median, i = width of the class interval, n = number of cases, cum f_L = cumulative frequency at the real lower limit of the interval and fi = frequency of cases in the interval containing the median.

Page 157

Substituting into the above equation:

The mean

The mean, or μ, is defined as the sum of all the scores divided by the number of scores. The mean is, in fact, the arithmetic average for a distribution. The mean is calculated by the following equations:

where Σx = the sum of the scores, = the mean of a sample, μ = the mean of a population, x = the values of the variable, that is the different elements in a sample or population, and n or N = the number of scores in a sample or population.

The formula simply summarizes the following ‘advice’:

To calculate the average or mean of a set of scores (), add together all the scores (Σx) and divide by the number of cases (n).

Therefore, given the following sample scores:

2, 3, 5, 6, 7

To calculate the mean:

When n or N is very large, the average is calculated with the formula above but usually with the assistance of computers.

Comparison of the mode, median and mean

The mode can be used as a measure of central tendency for any level of scaling. However, since it only takes into account the most frequent scores, it is not generally a satisfactory way of presenting central tendency. For example, consider two sets of scores, A and B, shown in Table 14.2. It can be seen, either by inspection or by sketching a graph, that the two distributions A and B are quite different, yet the modes are the same, i.e. 1.

Table 14.2 Example data sets

The median divides distributions into two equal halves, and is appropriate for ordinal, interval or ratio data. For interval or ratio data, however, the mean is the most appropriate measure of central tendency. The reason for this is that in calculating this statistic, we take into account all the values in the study sample. In this way, it gives the best representation of the average score. Clearly, it is inappropriate to use the mean with nominal data, as the concept of ‘average’ does not apply to discrete categories. For example, what would be the average of 10 males and 20 females?

There is some justification for using the median as a measure of central tendency when the variable being measured is continuous. However this is controversial, and the mean should be preferred. Alternatively, when a distribution is highly skewed, the median might be more appropriate than the mean for representing the ‘typical’ score. Consider the distribution:

2, 2, 2, 5, 7, 8, 9

mode = 2

median = 5

= 5

Let us change the 9 to 44:

2, 2, 2, 5, 7, 8, 44

mode = 2

median = 5

= 10

Page 158

Clearly, the median and the mode are less sensitive to extreme scores, while the mean is pulled towards extreme scores. This might be a disadvantage. For example, there are seven people working in a small factory, with the following incomes per week:

$100; $200; $200; $300; $400; $400; $1900

median = $300

= $500

The distribution of wages is highly skewed by the high income of the owner of the factory ($1900). The mean, $500, is higher than six of the seven scores; it is in no way typical of the distribution. In cases like this the median is more representative of the distribution.

Figure 14.2 illustrates the relationships between the skew of frequency distributions and the three measures of central tendency discussed in this chapter. We should remember that the mode will always be at the highest point, the median will divide the area under the curve into halves, and the mean is the average of all the scores in the distribution. Also, the greater the skew in distribution, the more the measures of central tendency are likely to differ.

Figure 14.2 Measures of central tendency in (A) normal distribution; (B) negative skewing; (C) positive skewing.

Measures of variability

We have seen that a single statistic can be used to describe the central tendency of a frequency distribution. This information is insufficient to characterize a distribution; we also need a measure of how much the scores are dispersed or spread out. The variability of discrete data is of little relevance, as the degree of variability will be limited by the number of categories defined by the investigator at the beginning of measurement.

Consider the following two hypothetical distributions representing the IQs of two groups of intellectually disabled children:

Group A: 45, 50, 55, 60, 60, 70, 80

Group B: 57, 58, 59, 60, 61, 62, 63

It is evident that although _A= _B = 60, the variability of the scores of Group A is greater than that of Group B. Insofar as IQ is related to the activities appropriate for these children, Group A will provide a greater challenge to the therapist working with the children.

Page 159

The three statistics commonly used to indicate the numerical value of variability are the range, the variance and the standard deviation.

The range

The range is the difference between the highest and lowest scores in a distribution. As we mentioned, given the IQ data above, the ranges are:

Group A: 80 − 45 + 1 = 36

Group B: 63 − 57 + 1 = 7

Although the range is easy to calculate, it is dependent only on the two extreme scores. In this way, we are not representing the typical variability of the scores. That is, the range might be distorted by a small number of atypical scores or ‘outliers’. Consider, for instance, the differences in the range for the data given. The earlier example of the distribution of wages shows that just one outlying score in a distribution has an enormous impact on the range ($1900 − $100 = $1800). Obviously, some measure of average variability would be a preferable index of variability.

The average deviation

A convenient measure of variability might be average deviation about the mean. Consider Group B shown previously. Here = 60. To calculate the average variability about the mean, we subtract the mean from each score, sum the individual deviations, and divide by n, the number of measurements (see Table 14.3).

Table 14.3 Average variability about mean for Group B

x	x −
57	−3
58	−2
59	−1
60	0
61	+1
62	+2
63	+3

Therefore: Σ (x − ) = (− 3) + (− 2) + (− 1) + (0) + (1) + (2) + (3) = 0

This is a general result; the sum of the average deviations about the mean is always zero. You can demonstrate this for the average deviation of Group A. The problem can be solved by squaring the deviations, as the square of negative numbers is always positive. This statistic is called the sums of squares (SS) and is always a positive number. This leads to a new statistic called the variance.

The variance

The variance (σ² or s²) is defined as the sum of the squared deviations about the mean divided by the number of cases.

(for a population)

(for a sample)

Divide by n − 1 when calculating the variance for a sample, when we use s² as an estimate of population variance. Dividing by n results in an estimate which is too small, given that a degree of freedom has been lost calculating .

For the IQ example shown above, the variance is calculated as shown in Table 14.4.

Table 14.4 Calculation of variance

x	x −	(x − )²
57	−3	9
58	−2	4
59	−1	1
60	0	0
61	+1	1
62	+2	4
63	+3	9
= 60		Σ(x − )² = 28

Substituting into the formula:

The problem with this measure of variability is that the deviations were squared. In this sense, we are overstating the spread of the scores. In taking the square root of the variance, we arrive at the most commonly used measure of variability for continuous data; the standard deviation.

Page 160

The standard deviation

The standard deviation (σ or s) is defined as the square root of the variance:

Therefore, the standard deviation for Group B is:

The size of the standard deviation reflects the spread or dispersion of the frequency distribution representative of the data. Clearly, the larger σ or s, the relatively more spread out the scores are about the mean in a distribution. Calculation of the variance or the standard deviation is extremely tedious for large n using the method shown above. Statistics texts provide a variety of calculation formulae to derive these statistics. However, the common use of computers in research and administration makes it superfluous to discuss these calculation formulae in detail.

The semi-interquartile range

We have seen previously that if we are summarizing ordinal data, or interval or ratio data which is highly skewed, then the median is the appropriate measure of central tendency. The statistic called interquartile and semi-interquartile range is used as the measure of dispersion when the median is the appropriate measure of central tendency for a distribution. The interquartile range is the distance between the scores representing the 25th (Q1) and 75th (Q3) percentile ranks in a distribution.

It is appropriate to define what we mean by percentiles (sometimes called centiles). The percentile or centile rank of a given score specifies the percentage of scores in a distribution falling up to and including the score. As an illustration, consider Figure 14.3, in which:

• 25% of cases or scores fall up to and including Q1

• 50% of cases or scores fall up to and including the median

• 75% of cases or scores fall up to and including Q 3

• 25% of cases or scores fall above Q3.

Figure 14.3 Interquartile ranges.

The distances A and B represent the distances between the median and Q1 and Q3. When a distribution is symmetrical or normal, the distances A and B will be equal. However, when a distribution is skewed, the two distances will be quite different. The semi-interquartile range (sometimes called the quartile deviation) is half of the distance between the scores representing the 25th (Q1) and 75th (Q3) percentile ranks in a distribution. Let us look at an example. If we have a sample where n = 16 and the values of the variable are:

1, 5, 7, 7, 8, 9, 9, 10, 11, 12, 13, 15, 19, 20, 20, 20.

clearly, a frequency distribution of these data is not even close to normal as the distribution is not symmetrical and the mode is at the maximum value. Therefore, the median is selected as the appropriate measure for central tendency, and we should use the interquartile range as the measure of dispersion.

Looking at the data, we find that:

1 5 7 7 * 8 9 9 10 † 11 12 13 15 † 19 20 20 20

where * denotes the 25th centile (first quartile, Q1), † denotes the 50th centile: median (second quartile, Q2) and ‡ denotes the 75th centile (third quartile, Q3).

Page 161

The score that cuts off the first 25% of scores (25th centile) is the first quartile (Q1). Since we have n = 16, Q1 will cut off the first four scores (25% of 16 is 4).

The third quartile (Q3) is the score which cuts off 75% of the scores. As n = 16, Q3 will cut off 12 scores (75% of 16 is 12).

Therefore, the semi-interquartile range is:

The larger the semi-interquartile range, the more the scores are spread out about the median.

Summary

In this section, we discussed two essential statistics for representing frequency distributions: measures of central tendency and variability. The measures of central tendency outlined were the mode and median for discrete data, and the mean for continuous data. Measures of variability were shown to be the range, average deviation, variance, and standard deviation. These statistics are appropriate for crunching the data together to the point that the distribution of raw data can be meaningfully represented by only two statistics. That is, the raw data representing the outcome of investigations or clinical measurements are expressed in this manner. We have seen that the mean and the standard deviation are most appropriate for interval or ratio data. The median and semi-interquartile range are used when the data were measured on an ordinal scale, or when interval or ratio data are found to have a highly skewed distribution. The mode represents the most frequent scores. The contents of the chapter focused on the use and meaning of these concepts, rather than stressing calculations involved. These calculations are now made by computers. In Chapter 15, we discuss the application of the mean and standard deviation for relating specific scores to an overall distribution.

Self-assessment

Explain the meaning of the following terms:

central tendency

descriptive statistics

dispersion

mean

median

mode

range

semi-interquartile range

standard deviation

statistic

variability

variance

True or false

1. Inferential statistics are used to describe specific characteristics of the data.

2. With nominal data, the mean should be used as a measure of central tendency.

3. The mode represents the most frequently occurring score in a distribution.

4. With ordinal data, we can use both the mode and the mean as a measure of central tendency.

5. When the data are interval or ratio, we can use the mean as a measure of central tendency.

Page 162

6. With continuous data, the median is the most appropriate measure of central tendency.

7. If a continuous distribution is highly skewed, the median might be the appropriate measure of central tendency.

8. When a frequency distribution is positively skewed, the mean is greater than the median or the mode.

9. Given a normal distribution, the three measures of central tendency are equivalent.

10. The range is the simplest indicator of variability.

11. The range is calculated by adding the lowest score to the highest score in a distribution.

12. Given nominal or ordinal data, we should use the standard deviation as a measure of dispersion.

13. The square root of the variance is called the standard deviation.

14. s and σ indicate the extent to which scores are distributed about the mean.

15. When a distribution consists of very different scores, s or σ will be relatively large.

16. It is possible to have data with three different values for measures of central tendency.

17. The 50th percentile score and the median will always be the same value.

18. The median is less affected than the mean by extreme scores at one end of a distribution.

19. Central tendency describes the ‘typical’ value of a set of scores.

20. We use n − 1 in the denominator of the equation for calculating the sample standard deviation, because it provides us with an accurate estimate of the population standard deviation.

21. If the number of raw scores is odd, the median is the score in the middle position.

22. The mean must have a value equal to one of the scores in the distribution.

23. 25% of the scores fall between Q1 and the median.

24. The distance between Q1 and the median is always different to the distance between Q3 and the median.

25. The semi-interquartile range is inappropriate to use with skewed distributions.

Multiple choice

1. Given a set of nominally scaled scores, the most appropriate measure of central tendency is the:

a mean

b mode

c standard deviation

d range.

2. Which of the following statements is true?

a The mode is the most useful measure of central tendency.

b The variance is the square root of the standard deviation.

c The median and the 50th percentile rank have different values.

d The mean is more affected by extreme scores than the median.

Questions 3–6 refer to the following data:

2, 2, 3, 4, 6, 6, 7.

3. Σx is equal to:

a 30

b 40

c 50

d none of the above.

4. (Σx)² is equal to:

a 124

b 128

c 130

d 900.

5. The median is equal to:

a 6

b 5

c 4

d 3.

6. The range for the above set of scores is:

a 7

b 5

c 2

d 1.

A clinic had 50 patients attending in a month. The number of times each patient visited the clinic is given in the table below in the form of frequency distribution.

Questions 7–9 refer to this information.

7. The total number of visits by the patients was:

a 194

b 28

c 50

d none of the above.

8. The mean number of visits by patients was:

a 3.89

b 3.50

c 1.00

d 3.88.

No. of visits (x)	No. of patients (f)
7	3
6	6
5	6
4	10
3	21
2	0
1	4

Page 163

9. The median number of visits per patient was:

a 3.88

b 3.50

c 3.00

d 4.00.

10. The more dispersed, or spread out, a set of scores is:

a the greater the difference between the mean and the median

b the greater the value of the mode

c the greater the standard deviation

d the smaller the interquartile range.

11. The measure of central tendency which is most strongly influenced by extreme values in the ‘tail’ of the distribution is:

a the mean

b the median

c the mode

d the standard deviation

e none of the above.

12. The mean height of a student group is 167 cm. Assuming height is normally distributed, this enables us to deduce that:

a approximately half of all students are taller than 167 cm

b being a student stunts your growth

c approximately half of all students are shorter than 167 cm

d a and c

e none of the above.

13. If we subtract the value of the mean from every score in a set of scores the sum of the remaining values will be:

a impossible to determine

b equal to the mean

c a measure of the dispersion around the mean

d zero

e none of the above.

14. Given a normally distributed continuous variable, the best measure of central tendency is the:

a mode

b median

c mean

d standard deviation

e none of the above.

15. If a distribution is negatively skewed, then:

a the median is greater than the mean

b the mode is greater than the median

c the mean is greater than the median

d both a and b are true

e none of the above are true.

16. In a normal distribution, with regard to the mean, the median and the mode:

a they always have the same value

b the mean has the higher value

c the mean has the lower value

d they have no particular relationship

e they cannot take the same value.

17. The measure of central tendency which is the most frequently occurring score is:

a the mean

b the median

c the mode

d the standard deviation

e none of the above.

18. Given the group of scores 1, 4, 4, 4 and 7, it can be said of the mean, the median and the mode that:

a the mean is larger than either the median or the mode

b all are the same

c the median is larger than either the mean or the mode

d all are different

e the mode is larger than either the median or the mean.

A nurse recorded the number of analgesic preparations taken by patients in a surgical ward. The resulting data were:

5, 2, 8, 2, 3, 2, 4, 12

Questions 19–23 refer to these data.

19. The mode for this distribution is:

a 2

b 3

Page 164

c 8

d there is no mode.

20. The median is:

a 2.00

b 3.50

c 3.00

d 3.25.

21. The mean is:

a 3.52

b 5.43

c 4.75

d 4.15.

22. The range is:

a 9

b 10

c 12

d 2.

23. The standard deviation is:

a 3.04

b 5.81

c 2.28

d 3.58

Questions 24–28 refer to these data:

3, 3, 4, 5, 6, 7, 8, 9, 9, 10, 38, 60

24. The median is:

a 7.0

b 7.5

c 8.0

d 3 or 9.

25. Q1 is:

a 4.5

b 5.5

c 8.0

d 9.5.

26. Q3 is:

a 4.5

b 6.0

c 7.5

d 9.5.

27. The semi-interquartile range is:

a 2.5

b 4.5

c 6.0

d 9.0

28. The semi-interquartile range is preferred to the standard deviation as a measure of variability when:

a the sample size is small

b the distribution is standardized

c the distribution is highly skewed

d the range is small.