Page 31 

Chapter Three Sampling methods and external validity

CHAPTER CONTENTS

Introduction 31
What is sampled in a study 32
Basic issues in sampling 32
Representative samples 32
Incidental samples 33
Random and systematic samples 34
Sample size 35
Sampling error 36
Sampling issues in qualitative research 37
Purposive sampling 37
External validity and sampling 39
Summary 40
Self-assessment 40
True or false 40
Multiple choice 41

Introduction

Research in the health sciences usually involves the collection of information from a sample of participants, rather than on the entire population in which the investigator is interested. Studies that involve an entire population or group are called census studies but these are relatively rare and generally expensive to perform. A sample drawn from the target population is studied because it is usually impossible or too costly to study entire populations. For instance, when individuals who have conditions such as diabetes, cerebral palsy or emphysema are being studied, it is not possible to study everyone because of the large size of such populations. Also because many people do not seek treatment or may be wrongly diagnosed, we may not be able to identify all members of the entire population in order to study them. Therefore, in most research, the researcher studies a subset or sample of the target population, and then attempts to generalize the findings to the population from which the participants were drawn. This general principle applies to both qualitative and quantitative studies.

The aim of this chapter is to examine ways that samples can be drawn to permit the investigator to make valid generalizations from the study sample to the target population. We will also consider the question of generalizing the findings of an investigation to other samples and situ-ations. This is referred to as ‘external validity’ or ‘generalizability’.

  Page 32 

The specific aims of this chapter are to:

1. Define what is meant by sampling and representative samples.
2. Outline the relative advantages and disadvantages of commonly used sampling methods.
3. Discuss the relationship between sampling error and sample size.
4. Examine the concept of external validity for generalizing research findings to other settings.

What is sampled in a study

While this chapter will focus on the selection of the research participants in a study, many other things are also selected or sampled. These include:

1. the information to be collected by the researcher
2. the procedures used for the collection of the information
3. where the research is conducted, e.g. in a field setting or in a structured research setting
4. the clinicians and researchers who are involved.

Many researchers focus on the selection of the research participants as the key or only issue in maximizing the generalizability of their research, and they do not pay enough attention to the other factors they are sampling or the context in which the research is being conducted. It is not at all unusual to see studies that employ large and sophisticated participant samples, yet with only one or two highly selected clinicians involved in the research in perhaps only one health setting. While the study sample may be highly representative, the context in which the research is conducted may not be and it may be that the researchers and clinicians involved in the study have a particularly unusual or idiosyncratic approach to their work that is not reflective of others. It is our contention that in many qualitative studies there is strong consider-ation of the research context and its impact upon the research findings. However, there is often less emphasis upon sampling of research participants. This impacts upon the ability to generalize the findings more broadly from the actual research participants to other groups.

Basic issues in sampling

As we have discussed, often, because of the numbers involved, it is not within the resources of the researcher to study the whole target population. In any event, in most situations it would be wasteful to study all of the population. If a sample is representative, one can generalize validly from the sample’s results to the population without going to the expense of studying everyone.

The population is the target group of individ-uals or cases in which the researcher is interested. Examples of valid study populations include: all English women under 25; all children with diagnosed spina bifida in the state of Alberta; all the students at a particular Australian college. The researcher defines the population to which he/she wishes to generalize. Note that a population need not consist of human or animal subjects. Objects or events can also be sampled, as shown in Table 3.1.

Table 3.1 Examples of populations and samples

Population Possible sample selection
All working podiatrists in a state 50 podiatrists selected for a study of job satisfaction
All working pathologists in a state 25 pathologists selected for detailed tax evaluation by an inspector
The temperature of a patient during a 24-hour period Hourly measurements of the patient’s temperature recorded by staff
Stuttering in a child’s speech Number of stutters made during 5 minutes of reading a standard piece of material
All patients in a state with frontal lobe damage 30 patients with frontal lobe damage selected for evaluating a rehabilitation programme
All surgical gauzes held by a given hospital 10 gauzes selected by a bacteriologist to test for sterility

As shown in Table 3.1, a population is an entire set of persons, objects or events which the researcher intends to study. A sample is a subset of the population. Sampling involves the selection of the sample from the population.

Representative samples

There is a variety of different ways by which one can select the sample from the population. These are called sampling methods.

The ultimate aim of all sampling methods is to draw a representative sample from the population. The advantage of a representative sample is clear: one can confidently generalize from a representative sample to the rest of the population without having to take the trouble of studying the rest of the population. If the sample is biased (not representative of the population) one can generalize less validly from the sample to the population. This might lead to quite incorrect conclusions or inferences about the population. This would mean that the results obtained in the study would not neces-sarily generalize to other studies using the same population. Figure 3.1 illustrates the concept of a representative sample.

image

Figure 3.1 A simple representative sample.

  Page 33 

Figure 3.1 illustrates a hypothetical population composed of three different types of study participants or categories of participants. A representative sample is a precise miniaturized representation of the population. An unrepresentative or biased sample does not adequately represent the key groups or characteristics in the population, and this may lead to mistaken conclusions about the state of the population.

The selection of the appropriate sampling method depends upon the aims and resources of the researchers. For instance, if someone is designing a very expensive health or social welfare programme on the basis of a survey of clients’ needs, it is imperative that the researcher uses a good sampling method and obtains a representative sample of the clients, so that appropriate conclusions may be reached about the population. Good sampling methods are somewhat more expensive and more difficult to implement than poor methods but they are worth it. The main sampling methods used in health research are incidental and random sampling.

Incidental samples

Incidental sampling is the cheapest, easiest and most commonly used sampling method in clin-ical studies. It involves the selection of the most accessible and available members of the target population. For example, a researcher who stands in the middle of a city street and quizzes people about their health status is practising incidental sampling. However, it is quite likely that this sample would not be representative of the general voting population. There would probably be an over-representation of businessmen and white-collar workers, and an under-representation of factory workers and housewives. The sample is likely to be unrepresentative and biased.

A further example of incidental sampling might involve a researcher surveying the needs of a group of spina bifida children at a local community health centre. Their measured needs may be representative of those of other spina bifida children, but then again they may not if these children are not typical of the wider population of children with spina bifida.

Thus, incidental sampling is cheap and easy to implement but may give a biased sample that is not representative of the population.

Quota sampling

Sometimes it is known in advance that there are important subgroups within the population that need to be included in the sample. Two such important groups within the human population are males and females. Further, it is known that they occur in the ratio of approximately 49:51 in the general population. Our researcher might decide that it is very important that the sexes are proportionally represented in the sample. Thus, the researcher would set two quotas (of 49 male and 51 female respondents in a sample of 100) and sample accordingly in a city street. This is still a form of incidental sampling but has some significant advantages over simple incidental sampling because the study sample’s composition on this key demographic variable is guaranteed to match that of the target population.

More sophisticated examples involving more than two groups can be accommodated as shown in Table 3.2. We can see from Table 3.2 that if our sample were to be representative regarding both sex and occupational status, in a sample of 100 people we would need 19 blue-collar males, 15 blue-collar females, and so on.

Table 3.2 Distribution of percentages of gender and occupational variables in the general population

image

Quota sampling still has a number of shortcomings: before it can be used, one has to know which population groups are likely to be important to a particular question and the exact proportions of the various groups in the population. Sometimes we may not know these proportions. Also, the members of the sample within the quotas are still incidentally chosen. The blue-collar males, for example, selected in a city centre on a weekday may still be quite different from those working elsewhere. However, quota sampling is better than simple incidental sampling.

Random and systematic samples

Random sampling

This is one of the best but probably more expensive sampling methods to implement in drawing a study sample. A random sample is one in which all members of the population have an equal chance of selection. Thus a random sample is more likely to be representative of the relevant population than an incidental sample.

The procedure for drawing a random sample involves:

1. the construction of a list of all members of the population
2. using a method such as dice, coins, hat or random number tables to select randomly from the list the number of members required for the sample.

A simple example of a random sample is provided by a common raffle, where names on (preferably!) equal-size papers are put in a hat, shaken and selected ‘blind’. Many national and state lotteries use numbered balls that are drawn randomly from a barrel. Another way to draw a random sample is to construct a list of all the members of the population and assign a number to each element. Then a table of random numbers, generated by a com-puter, could be used to select a random sample.

Cases that are selected from a list using a random selection method constitute a random sample. Sometimes, because of issues such as refusal to participate in the study, dropping out and failure to satisfy the sample inclusion criteria, it is necessary to select replacements for some cases that are unavailable for the study. This can introduce bias into the sample, as the people who refuse may differ consistently from those who accept or volunteer. This impact on sampling validity is sometimes called the volunteer effect.

However, random sampling methods have a number of important advantages over incidental or non-random methods:

Because the exact sizes of the sample and population are known, it is possible to estimate exactly how representative the sample is; that is, the size of the sampling error. This cannot usually be done with non-random sampling methods.
Because random samples are usually more representative than non-random samples, the sample size needed for good representation of the population is smaller.

The major disadvantages of random sampling methods are:

The researcher needs to be able to list every member of the population. Often this is impossible because the full extent of the population is not known. For example, it would be very difficult to sample randomly from the population of Canadians with coronary disease, because no such list exists. Even if it did exist it would be constantly changing.
Cost. It is usually easier to use conveniently available groups. Random sampling usually involves considerable planning and expense, especially with large populations.

Stratified random sampling

This involves the same approach as quota sampling with set quotas from specific subgroups, except that each quota is filled by randomly sampling from each subgroup, rather than sampling incidentally. For example, if one was drawing a sample stratified with respect to sex, one would prepare a list of all females and all males in the target population and then sample randomly from these lists with the numbers of each group in the sample corresponding to the population proportions. These groups collectively are called the strata.

The advantages of stratified random sampling are:

All the important groups are proportionally represented.
The exact representativeness of the sample is known. This has important statistical ramifications.

The disadvantages are:

A list of all members of the population, their characteristics and the proportions of the important groups within the population need to be known.
Cost.
The gain in sample accuracy is usually small in comparison to simple random sampling.

Area sampling

In area sampling, one samples on the basis of location of cases. For example, on the basis of census data the investigator may select several areas in a city or county with known characteristics, such as high or low unemployment rates. The areas could then be further divided into specific streets and the occupants of, say, every third house contacted for participation in the study. In other words, the locations are randomly selected and then one interviews the occupants of those locations. This can be a very effective, cheap method of sampling in social surveys. It does not require a list of the individual members of the population, merely the location where they live. Recent research has shown that where people live can be an important factor in their health status and their use of health services. The study of the impact of where people live upon health and social issues is sometimes called (geo-) spatial analysis.

Systematic sampling

This involves working through a list of the population and choosing, say, every 10th or 20th case for inclusion in the sample. It is not a truly random technique but it will usually give a representative sample. It is based on the (usually justified) supposition that cases are not added to the list in a systematic way which coincides with the sampling system. Provided a list of cases is available, systematic sampling is an easy and convenient sampling method. In clinical practice we are using systematic sampling when, for instance, we measure temperature and blood pressure every hour.

Sample size

One of the most poorly understood aspects of sampling is the number of cases that should be included in a study sample. The whole issue of sampling is conceptualized somewhat differently for qualitative and quantitative studies, although the issue of to what extent the results of a study are generalizable is the same whatever the study design.

It is obvious that in one sense the more cases selected the merrier (or the better the sample representativeness), but the costs associated with the sampling and data collection must be weighed up against the greater generalizability that is generally associated with larger samples. Also, some studies may involve discomfort, pain or even danger to patients or laboratory animals. Therefore, in this case it is ethically and logistically desirable to ensure that no more than the bare minimum of subjects is used to achieve the desired sample accuracy. Health researchers ‘walk a tightrope’ in deciding the optimum sample sizes for their studies. However, there are some principles available to guide the researcher.

  Page 36 

First of all, let us say quite definitely that there is no magic number that we can point to as an ‘optimum’ sample size. Rather, the optimum sample size depends on the characteristics of an investigation in the context of which the sample is drawn. In general, the optimum sample size is one which is adequate for making correct generalizations from the sample to the target population. Let us discuss these issues by introducing the concept of sampling error.

Sampling error

In the quantitative research framework, sampling error is the discrepancy between the true population parameter and the sample statistic. For example, if I happen to know from census data that the actual average age of males in a district is 35 years and the average age of a sample of males I have surveyed from the district is 30 years, then I have a sampling error of 5 years. However, if we do not know the actual population parameters (which is commonly the case), we can only estimate the probable sampling error.

Sampling error is related to sample size by the following relationship:


image


What the above equation claims is that the greater the sample size (n), the smaller the probability sampling error. In fact, the sampling error in a study sample is inversely proportional to the square root of the sample size.

From this relationship it can be seen that doub-ling the sample size would only result in a reduction of the error by a factor of the square root of 2 (1.414). Similarly, a ninefold increase in sample size would result in only a threefold reduction in the sampling error. Figure 3.2 illustrates this point by showing the graphical relationship between probable sampling error and sample size.

image

Figure 3.2 The relationship between sample size and sampling error. The scaling but not the form of the curve will alter with the variability of the data.

It can be seen from this graph that not much is gained from a sample size of over, say, n1. Yet the cost of the sampling and data collection can be very high with large numbers (such as n2), for relatively little gain in reducing sampling error. In some research situations, even large probable sampling errors have relatively little potential influence on our decisions. In such situations, we can live with a relatively small sample size. In other situations, we need large samples to justify our confidence in the truth of our decisions.

As an illustration, suppose that we are attempting to predict the outcome of an election fought between two political parties, A and B. A representative sample of 100 respondents is polled before the election. Say the outcome is as follows:

Intends to vote for A Intends to vote for B Estimated sampling error
25% 75% 10%

In this instance, the estimated sampling error is very small in relation to the size of the effect (that is, the difference between the percentage of intended votes for the two parties). We can predict confidently, assuming that the respondents were truthful and don’t change their minds before the elections, that political party B will romp into government. Increasing sample size, say to 10 000, would enormously increase the cost of the survey. The corresponding reduction in sampling error would not justify this cost, as we would still come to the same conclusion. However, say the following sample statistics were obtained in the pre-election poll of n = 100 respondents:

Intends to vote for A Intends to vote for B Estimated sampling error
48% 58% 10%
  Page 37 

Now the same level of estimated sampling error is too large, in relation to the apparent size of the effect, to make a decision concerning which party is likely to win the election. The poll would have to be repeated using a substantially increased sample size to reduce the sampling error.

The example illustrates the notion that the adequacy of the sample size is affected by the specific situation that we are trying to research. A sample size which is adequate in one situation may be inadequate in another. One benefit of a previous pilot study is that it allows the researcher to estimate the size of the phenomenon under study, and thereby make a more educated guess concerning the sample size required. The concept of statistical ‘power’ or the ability to detect real effects or differences is discussed later in this text.

Sampling issues in qualitative research

Qualitative research shares with quantitative research a concern with the extent to which findings from one study can be generalized to other settings and people. Therefore the representativeness of the information obtained from the sample is important in qualitative research. However, there are differences in the way in which a representative sample is conceptualized in qualitative research.

The key difference is the way in which the population is defined. In quantitative research we make the assumption that there is a true knowable state of the population. The true state of the population is represented by an actual parameter, such as ‘49% of the Australian population are females’ or ‘the average IQ of Canadian high school students is 104’. Qualitative research is not concerned with measuring quantities or counting frequencies but with the experiences and meanings of these experiences for individuals and communities. Sampling in qualitative research is referred to as purposive; the researchers deliberately select the participants who are best placed to provide the information for understanding the personal meanings of health-related events. For example, what is it like to survive a heart attack? What does it mean for patients to undergo transplantation surgery? Our sample provides the information which enables us to understand the emotional processes of coping with a heart attack or the psychological demands of being an organ recipient.

Let us consider an example of this approach. Say that you are intending to conduct qualitative research for clarifying issues relevant to the problem of family or ‘domestic’ violence. The first step would be to clarify the specific aspect of the problem you were studying and define the issues in which you were interested. For example, say that you wanted to collect data concerning the impact of physical violence on the lives of women. A key issue in defining the population is the cultural and historical setting in which it is enmeshed. The moral, legal and health-related problems involved in domestic violence are varied across cultures and change with time. In Western societies it is only relatively recently that violence in domestic settings has been recognized as a serious crime and this influences the experiences of the people involved in the events. Therefore, it is essential to define carefully the cultural characteristics of the group being studied and to keep in mind that we must be very cautious in generalizing the findings from one study to other settings and people.

Purposive sampling

Qualitative sampling is designed to be purposive, i.e. to select cases for inclusion in the research that are likely to be illustrative of particular issues or circumstances. Patton (1990) has proposed a typology of sampling that is widely used in qualitative research. Table 3.3 shows nine of the commonly used purposive sampling methods. Interested readers can follow up in Patton (1990) or Liamputtong Rice & Ezzy (1999) for more detailed information.

Table 3.3 Nine of the commonly used purposive sampling methods

Sampling method Brief description of sampling strategy
Extreme or deviant case sampling The cases in these methods are chosen because of their deviance and the hope that this deviance may illustrate issues about more regular or mainstream cases
Maximum variation sampling Purposefully picking a wide range of variation on dimensions of interest. Documents unique or diverse variations that have emerged in adapting to different conditions. Identifies important common patterns that cut across variations
Homogenous sampling Focuses, reduces variation, simplifies analysis, facilitates group interviewing
Stratified purposeful sampling Illustrates characteristics of particular subgroups previously found to be of interest. Facilitates comparisons of issues across groups
Snowball or chain sampling Identifies cases of interest from contacts who know of suitable interview participants; that is, people who are likely to be good sources of information
Theory-based or operational construct sampling Finding manifestations of theoretical construct of interest so as to elaborate and examine the construct
Opportunistic sampling Following new leads during fieldwork, taking advantage of the unexpected, flexibility for discovering new issues
Random purposive sampling (Still small sample size.) Adds credibility to sample when potential purposeful sample is larger than one can handle. Reduces judgment within a purposeful category. Unlike qualitative research, this strategy is not used for generalizations or representativeness
Convenience sampling Similarly to quantitative sampling, this strategy saves time, money and effort. Poorest rationale, lowest credibility for findings. Yields information-poor cases
  Page 38 

To illustrate qualitative sampling we have selected four strategies. Considering the example of physical violence against women in domestic settings, what would be the researcher’s purpose for selecting the following strategies?

Extreme or deviant case sampling. We would select respondents who experienced severe, even life-threatening, physical violence. Alternatively, we could select women who experienced minimal violence (such as verbal abuse). Both extremes would illustrate the consequences of violence from the victims’ perspective and the understanding of these issues could contribute to developing sensitive and effective interventions.
Maximum variation sampling. Here we would select across all the categories and severities of domestic violence which have been previously identified. Our purpose here is to identify the common themes emerging from the victims’ experiences. This approach would also help to understand how different types of violence might result in varying outcomes from the individual’s perspective.
Snowball or chain sampling. The assumption here is that people sharing similar important experiences might become aware of each other in a community. In particular, key informants who are sympathetic to the researcher’s work might be helpful in identifying ‘information-rich’ cases. For example, people who are actively involved in victim support groups would be a good source for further contacts with possible participants. In addition, such key informants can act as ‘gatekeepers’, protecting participants from unsafe researchers. However, ‘snowball’ sampling can lead to the biased selection of participants and an inaccurate representation of the issues.
  Page 39 
Theory-based sampling. A researcher might hold the theory that the occurrence and experience of family violence might be shaped by patriarchal traditions. Here the researcher might purposely select couples who have more-or-less equal relationships and couples where there has been an imbalance of power. The theoretical construct of interest might be ‘power’ or ‘oppression’ and this construct might be manifested in the ways in which women experience and interpret violence.

To summarize, we want to make sure the people we are interviewing or the situations we are observing are representative of the targeted issues. We want to gain as complete a picture of the issues from the perspectives of our participants as possible. The usual approach to qualitative sampling is to interview or observe people in situations that fit our criteria until we have reached a point where little or no new observations are being made. This is called saturation.

External validity and sampling

The term external validity refers to the extent to which the results of an investigation can be generalized to other samples or situations. External validity can be classified into two types: population and ecological (Huck et al 1974).

Population validity

This refers to generalizing the findings from the sample to the population from which it was drawn. We have already examined the importance of having a representative sample in the generalizing of results from a sample to a population. However, an investigator in the health sciences might face another problem; that the accessible population from which the sample was taken might not be the same as the target population, that is, the one of general interest.

Let us illustrate this point with an example. A physiotherapist working in a large private maternity hospital intends to examine the effectiveness of a new antenatal exercise procedure for pregnant women for controlling levels of pain during delivery. A random sample of 50 pregnant women is chosen for the investigation from the population attending the hospital. The sample is then randomly assigned into two groups: one receiving the new antenatal exercise procedure, the other receiving the traditional programme. The researcher finds a statistically and clinically significant difference between the two procedures, such that the new programme is shown to be effective.

Strictly speaking, these findings can be generalized only to the population of women who attend the hospital. If the target population is all women who are having babies, then the generalization will lack external validity, because women attending different hospitals or having children at home had no chance of being included in the study sample. They might have different characteristics and these different characteristics may interact with the treatment in different ways to that of the sample. For instance, women who chose to deliver at home might respond better than those who chose to go to hospital but we would not know this from this study.

Ecological validity

There is another facet of external validity: the situation in which an investigation is carried out might not be generalizable to other situations. This is called ecological validity. Consider the following examples:

1. It has been shown that for clinical pain (due to disease or injury), morphine is an excellent analgesic. However, in laboratory studies of pain induced by electric shocks, morphine had little effect on subjects’ reports of pain threshold. Clearly, generalizing from laboratory to clin-ical settings, or vice versa, has to be done with extreme caution (Beecher 1959).
2. Coronary arteriography involves the insertion of a small-gauge catheter into the coronary arteries, and injection of a dye for X-ray visualization. It was initially reported that the mortality rates for this rather dangerous-sounding practice were only 0.1% (1 per 1000) in a first-class medical institution. However, later reports from various other institutions showed mortality rates as high as 8% (80 per 1000) (Taylor 1979). Clearly, the effectiveness of treatments or the usefulness of clinical evaluations can well depend on who implements them.

The above examples illustrate the caution necessary in generalizing findings.

Summary

Appropriate sampling strategies ensure the external validity of studies and their findings. The aim of sampling strategies is to ensure the selection of a sample that is representative of the population of objects, persons or events the investigator aims to study. Incidental and quota samples are chosen for convenience, but these sampling strategies do not guarantee a representative sample. Random and stratified random sampling methods ensure that all important groups and characteristics in the population have the best chance to be selected and included in the study sample. Random sampling strategies are the most desirable to obtain a representative sample, although random sampling is not always feasible or desirable in health research. Area sampling and systematic sampling are also strategies that can be used to obtain representative samples.

An adequate sample size reduces the chance of large sampling errors. The probable sampling error is inversely proportional to the square root of the sample size. It was argued that optimum sample size is not the maximum number of obtainable subjects or a constant number or proportion. Rather, it has to be estimated for a specific investigation on the basis of the parameters of the phenomenon being studied and the study circumstances. Two types of external validity were discussed: population and ecological. External validity is related to inference, which involves using evidence from a limited set of elements to formulate general propositions.

Various approaches to purposive sampling in qualitative research were reviewed and an example of their application to a hypothetical study situation was presented.

Self-assessment

Explain the meaning of the following terms:

area sample
biased sample
ecological validity
external validity
incidental sample
population
population validity
random sample
random sampling
representative sample
sample
sampling error (probable)

True or false

1. The basic idea underlying sampling is to select a representative sample from which the investigator can make inferences to the population.
2. A sample is said to be random when it is not representative of the population.
3. If a population contains 50% males and 50% females, and our sample has 10% males and 90% females, then our sample is said to be biased.
4. When you take a patient’s blood pressure daily, you are in fact sampling from a population of potential blood pressure readings.
5. If the patient’s blood pressure fluctuates considerably during the day, then a single reading will be an inadequate sample.
6. Stratified random sampling involves the selection of the most accessible elements of the population.
7. Say that important subgroups of the population are known and subjects are sampled incidentally in proportion to these subgroups. This sampling method is known as area sampling.
8. A random sample is one in which 50% of the elements of a population have equal chances of being sampled.
9. Random sampling in health sciences is the least expensive and time-consuming strategy for selecting a sample.
10. When you take blood pressures hourly, you are in fact using a random sampling method.
11. The larger the sample size, the larger the sampling error.
12. The problem of internal validity might emerge when we generalize results obtained in one research setting to another.
13. An incidental sample cannot be, in principle, representative of the population.
  Page 41 
14. If a sample is representative, it yields valid data for making generalizations about the population from which it was drawn.
15. The main difference between random and quota sampling is that in quota sampling particular subgroups of the population are represented proportionately.
16. Incidental sampling generates less bias than systematic sampling.
17. Area sampling involves selecting an area at random and assessing the inhabitants of that area.
18. A sample that is unbiased is a representative sample of the population from which it is drawn.
19. Sample error decreases as the sample size increases.
20. If the sample size is halved, the sampling error will be doubled.
21. If a sample is large (say n > 500) then the sample must be representative.
22. Generalizing findings from laboratory to clinical settings raises questions of ecological validity.
23. The problem of population validity refers to a population which contains invalid elements.

Multiple choice

1. As sample size increases:
a the sampling error decreases
b the ecological validity of the investigation increases
c the population becomes more accessible
d the sample becomes more biased.
2. A representative sample:
a consists of at least 500 cases
b must be a random sample
c is defined as the inverse of the square root of the sample size
d reflects precisely the crucial dimensions of a population.
3. An incidental sample is:
a not necessarily biased
b generally obtained through costly and difficult sampling procedures
c used only in non-experimental investigations
d none of the above.
An investigator wishes to study individuals suffering from agoraphobia (fear of open spaces). The investigator places an advertisement in the paper asking for subjects. One hundred replies are received, of which the investigator randomly selects 30. However, only 15 subjects actually turn up for their appointment.

Questions 4–6 refer to the above information.

4. Which of the following statements is true?
a The final 15 subjects are likely to be a representative sample of the subjects selected by the investigator.
b The final 15 subjects are likely to be a representative sample of the population of agoraphobics.
c The randomly selected 30 subjects are likely to be a representative sample of those agoraphobics who replied to the newspaper advertisement.
d None of the above are true.
5. The problem with drawing a representative sample of subjects with clinical conditions such as agoraphobia is that:
a the subjects who consent to participate may be unrepresentative of the target population
b no sampling strategies are appropriate
c no complete lists of sufferers’ names are usually available
d a and c.
6. The basic problem confronted by the investigator is that:
a the accessible population might be different from the target population
b the sample has been chosen using an uneth-ical method
c the sample size was too small
d agoraphobics are impossible to study in a scientific way.
7. Say that it is known that coronary disease occurs twice as frequently among males as females and three times more commonly among over 50-year-olds than those under 50. Given a sample of 120 obtained by quota sampling, how many subjects would you expect to be female and under 50?
a 60
b 40
c 30
d 10.
8. Referring to the population in Question 7, and given a sample obtained by stratified random sampling, how many females over 50 would you expect in the sample?
a 40
b 30
c 10
d 5.
  Page 42 
9. If a study is externally valid then:
a its results can be generalized to other equivalent settings
b it must have been an experiment
c quota sampling must have been used
d all the subjects in the sample must have been equivalent.
10. A random sample is one in which:
a all the elements had an equal chance of selection
b a chance method was used to select the elements included in the sample
c both a and b
d neither a nor b.
11. When we say that a study lacks ecological validity we are implying that:
a the study was carried out in a laboratory
b the results cannot be generalized to other settings
c the target population is different to the accessible population
d all of the above.
12. Pilot studies are useful for establishing:
a the approximate sample size necessary for the investigation
b the appropriateness of the sampling model being employed
c both a and b
d neither a nor b.
13. If a pilot study indicates that the effect is likely to be small in relation to the sampling error then the investigator should:
a abandon the research project
b use a relatively small sample
c use a relatively large sample
d use an incidental method of sampling.