PROTEUS

              

 

 

 

 

 

 

Return to ITMD COURSE PAGE

These are the practice questions that will give you experience in tackling the methods we are discussing.  The ones you should be able to do at this stage are shown with numbers highlighted.... 

PLEASE NOTE: Some of these questions are presented in a "full" form- suitable for a longer working time (as in a take-home assignment).  The questions you will see in the exam will NOT be more complicated than these, and MOST of them will be much simpler, or have more of the answer already calculated for you.  

  The answers to these problems are given here. 

_______________________________________________________________________________________________________________________

1.  You are investigating risk factors for contracting Yersinia enterocolitica infection. A group of 22 patients has been identified as well as a group of 40 non-Yersinia controls.  The following data include possible exposures in the previous month.  The figures shown are the number in each group who answered 'yes'.

  Do you have a pet? Have you drunk raw milk? Have you travelled to Mexico? Have you handled raw pork? Have you had any other illness?
Cases 10 5 8 9 3
Controls 12 7 13 7 8

 

2. Before the analysis can proceed with data from a questionnaire, the age distribution of the two groups must be checked to help confirm unbiased sampling.  Type A respondents (N=120) have a mean age of 32.0 years, with a standard deviation of 7.2 years.  Type B respondents (N=130) have a mean age of 33.2 years with a standard deviation of 7.5 years. How likely is it that the differences in the mean ages occurred by chance?

 

3. Pre-testing and post-testing of knowledge at a patient-lifting training course for nurses has yielded the following summary.  Note that the differences (d) = post-test minus pre-test. The mean difference (đ) = 13 points, 22 df,  P>0.05      Which of the following is/are correct?

    [i] that is all but 5% of cases, there was a significant increase in knowledge of 13 points

   [ii] that more than 5% of the people scored 13 points more on the post-test than on the pre-test.

  [iii] that each of these 23 people could be expected to score 13 points more on the post test in 5% of repeated trials.

  [iv] that an overall mean improvement of 13 points could be expected (if the null hypothesis were correct) more than 5% of the time.

 

6.  The following data are taken from a food-specific attack rate table and represent one food (of many).  Display the values in a contingency table and choose a suitable method of testing the null hypothesis of "no association".  Report your findings in clear terms.

Ate food Did not eat food
ill not ill total ill not ill total
5 1 6 5 5 10

 

7.  The time between taking a water sample and having the sample analyzed is being investigated.  Beginning with a known suspension of coliforms, and with temperature remaining constant, the following data are collected.  [THE ADDITIONAL INFO HAS NOW BEEN ADDED TO THIS QUESTION]

Time stored (hours) 16 24 1 8 4 12
Colony count (per ml) 128 115 150 140 141 135

 In addition, the following results have been obtained (meaning that you do NOT have to start the analysis from the beginning (although the practice is very valuable!)     

       Regression coefficient :  -1.42324

       Value of Y when X = zero :  150.252 colonies

       Std error of the regr. coeff. (Sb) : 0.119431

 On a scattergram, draw the regression line, and test the null hypothesis that beta=zero. (Don't forget to also check that the CL around the slope do NOT include the line of b=0 )   Predict the colony count after 20 hrs.  Is the count dependent upon time? summarize clearly.  [For a bonus point DRAW the confidence limits as lines on the scattergram; using any convenient (x,y) common point for all FOUR lines]  

 

 

8.   In buildings where very little fresh air is introduced the CO2 concentration increases over the working day.  You are studying the relationship, if any between (X) the CO2 conc. (in ppm) in the air of 16 government buildings, and (Y) the number of complaints about 'stale air', 'bad air', etc. per person, per year.  The regression is linear and is described by the following:  Y' = 324 + 0.016 (X)   The confidence limits are calculated as CL(95%) = 0.016 +/- (t 0.05, 14df )(0.11).

       (a) test the null hypothesis that Beta = zero

       (b) what is the estimated number of complaints/p/yr when CO2 is 300 ppm

       (c) How would you summarize the relationship?

 

9.   An excursion to Ethoipia has resulted in illness among a party of 22 Canadians.  The risk factors are shown, together with the number of people who were exposed to each factor, and the number ill in both exposed and non-exposed groups.  Calculate the odds ratio for each factor, and test the null hypothesis for the factor having the strongest association.

1.  food: dates 8 ill of 18 who ate (2 ill who did not eat)
2. food: couscous 6 ill of 13 who ate (4 ill who did not eat)
3. drink: local beer 3 ill of 7 who drank (7 ill who did not drink)
4. river rafting 9 ill of 12 exposed (1 ill of those not exposed)
5. contact with camels 8 ill of 18 exposed (2 ill of those not exposed)

 

10.   The following figures represent results of a trial to measure the body length in cm (Y) of a rodent species as a function of age (X) in days after birth.

   Regression coefficient: 0.272 cm/day

   Value of Y' when X is zero 0.72 cm

   Standard error of the estimate (Syx ):   0.22 cm

   Range of age:  3  to 17 days

   N=13  

   Range of body length: zero to 5 cm

   Standard error of the regr coeff. : 0.0135 cm/day

(a) draw the scattergram using suitable axes

(b) draw the regression line on the scattergram

(c) place upper and lower confidence limits around the regr line so as to pass through the MEAN of X

(d) Test the null hypothesis that beta =zero

(e) What is the probability that these results could have occurred by chance (in the absence of any real relationship)?

(f) Predict body length when age = 8 days

(g) Is body length dependent upon age for these animals in their first 17 days.  How would you summarize?

 

11.   In a workplace where lead solder is used, the concentration of lead fumes (Y) is being measured as a function of the number of hours (X) that soldering is being carried out.  The test begins on a Monday after two days of no operations, and runs for 30 hrs non-stop.  The lead measurements are in micrograms per litre of air and are taken each hour for 30 hrs.

The relationship is described by the model Y' = 24 + 5.1(X) and the 95% confidence limits are calculated as CL(95%)= b +/- (t0.05,28df)(2.1).

(a) Draw the regression line and confidence limits on a suitable scattergram (even through you don't have the original data)

(b) Test the Ho: that beta = zero

(c) What is the estimated conc. of lead in the air at 10,20, and 30 hrs?

(d) How would you summarize the relationship shown?

 

13.   You are investigating the effects of lighting upon accidents in a car-assembly plant.  There are two assembly lines, identical, except for the lighting. Line B has been fitted with new, broad spectrum lighting at greater intensity.  Line A retains an older form of lighting.  The number of accidents per month are recorded for 2 years, and compared using a t-test. 

The results are summarized as follows:   meanA — meanB = 3.4       t = 2.104        df_____      P_________

    (i)  Complete the summary line by adding the df value, and the probability of such a difference occurring by chance alone

   (ii)  What is the value of the standard error of the difference between two means?

  (iii) Answer this question in clear, everyday language:  "Does the new lighting seem to be associated with a reduction in accidents?"

 

15.     A new device for measuring carbon dioxide is being compared with a standard device.  Nine trials are run, each involving a measurement with both devices.  The data are as follows:

Trial 1 2 3 4 5 6 7 8 9
Standard Device 450 480 510 500 470 520 475 480 490
New device 460 485 505 510 470 525 485 500 510

 

16.      In an attempt to compare the bacteriological quality of two types of imported, bottled water, 20 bottles are sampled of each type.  The results show that standard deviation of bacterial counts from each type were the same:  16.0 cells/ml. but the mean counts were different: 105 cells/ml (type Y) and 85 cells/ml (type Z),    How would you summarise the findings?   

 

17.      Twelve medical students from University A are being compared with 11 medical students from University B in an pharmacology knowledge quiz scored out of a possible 20.   Is there convincing evidence that the two groups are different?

A 12,   15,   19,   11,   18,   14,   13,   10,   16,   17,   15,   14
B     15,   11,   17,   18,   19,   17,   19,   19,   12,   13,   16

 

18.     Health programs are being evaluated for costs.  The rabies control program and the food safety program are being compared on average person-hours expended per incident.   There were 11 rabies control incidents and 9 food incidents in 2011.  Analyze the data and comment on the difference in hours, if any, between the program. 

RABIES 6,   8,  11, 4,  12,   18,  15,   13,   9,   11,   10
FOOD SAFETY     8,   15,   26,   11,   16,   17,   30,   20,   17,  

 

19   An automated device for counting bacterial colonies on an agar plate is being compared to an experienced lab technician.  A series of six specially prepared petri dishes are counted by both methods and the results compared.  Are you able to determine if the machine and the lab technician differed to any significant degree?

Plate 1 2 3 4 5 6
Technician 53 12 65 14 20 14
New device 50 11 62 14 17 11

  

20   After returning home from an international conference,  twenty-four delegates appear to have fallen victim to a mysterious respiratory ailment.  Possible risk factors for these people and those for forty non-ill conference attendees are being examined.  Calculate the odds ratios for each factor.  For the factor with the strongest association only, test the null hypothesis of "no association".  Summarize for the five factors fully.

1.  Attended opening reception 8 ill of 18 who participated
2. visited local theatre 5 ill of 10 who participated 
3. swam in hotel pool 16 ill of 20 who participated 
4. went on sightseeing trip 6 ill of 16 who participated 
5. Attended closing dinner 5 ill of 15 who participated

 

21   After data collection, but before analysis, the data are checked to ensure that they are unbiased. Determine if the following figures are similar in the two groups prior to the commencement of a study:

  STUDY GROUP ( N=30 ) CONTROL GROUP ( N=30 )
Age (yrs) mean: 35.2 SD: 6.4 mean: 38.0 SD: 7.0
Weight (Kg) mean: 70.0 SD: 9.5 mean: 68.2 SD: 9.3
Blood pressure (mm) mean: 120 SD: 15 mean: 145 SD: 17

 

22.   Is there evidence of a difference in monthly accidence rates at the following two plants between January and December 2009?  Both plants are of similar design in all respects except that "Epsilon" has 'non-slip' floors and "Omega" does not.

  JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC
EPSILON 3 5 1 0 3 1 1 3 4 3 5 6
OMEGA 3 4 0 1 1 0 3 3 2 6 4 5

 

23.   Roach bait-and-trap stations are being evaluated.  A test-location is set up in each of five infested buildings.  At each station a carbohydrate-based bait and a protein-based bait are used, and the numbers of trapped roaches are recorded for each bait and at each station.  Analyse the data and report fully.

  1 2 3 4 5
Carbohydrate bait 11 15 8 13 9
Protein bait 10 17 5 10 10

   

24.   Days to full recovery is the dependent variable in a trial of two treatment methods for 13 workers diagnosed with a spinal condition.  Analyse the data and determine if one method is significantly more efficient that the other:

HEAT ELECTRICAL STIMULATION
13,   10,   7,   6,   11,   15   days 14,   13,   12,   17,   8,   12,   6   days

 

25   Increased ventilation in the workplace is being examined as an aid to worker's respiratory health.  Ten cabinet makers working in an improved ventilation environment are compared with eight working in poorly-ventilated workshop under otherwise similar conditions.  Their lung functions are measured on a scale from 1 (poor) to 8 (perfect) as shown below.  Analyze the data and report on any apparent difference between the two groups.  What comments might you make about the design of this study?   What other factors would you need to know?

  Lung function of workers
Good ventilation 6,    7,    3,    7,   4,    4,    3,    6,    6,    5
Poor ventilation 4,     5,     2,     6,     3,     2,     4,     5

 

26   State the null hypothesis and summarize the following, giving a probability (P) value                      

Nursing students:   mean GPA: 2.85
Engineering students:   mean GPA: 3.15
t: 2.191,     89 df,      P___________

 

27    State the null hypothesis and summarize the following, giving a probability (P) value

Toronto residents' mean Kg garbage/person.day:    1.38
Scarborough residents' mean Kg garbage/person.day:    1.43
t: 1.729,    2720 df,      P___________

 

28   Workers in a fibre-glass production facility are being tested for hypersensitivity to the resin catalysts being used. Twenty workers exposed to fibre-glass resin and 40 workers not exposed to resins are given sensitivity tests.  Fifty-five percent of the exposed group are found to be sensitive, but only 12 of the not-exposed workers were found to be sensitive to the resins. Analyze and summarize fully.

 

29   An antiseptic cream is being used in a food plant in an attempt to control the presence of Staph. aureus bacteria.  After six weeks the wokers' handsare swabbed to compare the flora.  Of 40 workers who have used the cream, 36 were free from Staph. aureus on their hands.  Of 48 who had not used the cream, two-thirds were free from Staph. aureus. What can you state about the cream's effectiveness in controlling this bacterium?

30  The ability of a new ultra-violet flying insect trap to attract more insects that an ordinary 15 w light bulb is being investigated.  At the end of 10 days, the flies-per-day totals are compared as follows:

   Ultra-violet trap : mean: 74 flies (std.dev. : 16 )

  15w light bulb : mean: 42 flies (std.dev. : 18 )

How would you evaluate these results using an appropriate test of the null hypothesis?

 

31   Twelve (12)   samples of water are taken at each of two water storage reservoirs.  The nitrate content (in parts per million) was recorded as follows. Assuming unbiased data, state the null hypothesis, and test it at 0.05 level of significance.

Reservoir A mean: 35.2 ppm SD:  8.4  ppm
Reservoir B mean: 27.0 ppm SD: 7.1  ppm

 

34   The foods prepared in a commissary are being sampled to determine the incidence of Staphylococcus aureus, and to track the possible source of the bacteria.  The data collected include product (sandwiches [S], pies [P], wraps [W] ), production line (#1 or #2), and the shift (am, pm) in which the food was prepared. Analyse the data, and consider also the distribution of the dependent variable!

sample line type shift S. aureus count/ml

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

1

2

1

1

2

1

1

2

2

1

2

1

2

1

1

2

1

1

2

2

1

1

2

2

S

P

P

W

S

W

P

S

W

P

S

W

P

S

S

W

P

S

W

P

S

P

S

W

A

A

A

A

A

P

P

P

A

P

A

P

P

A

P

A

P

A

P

P

A

P

A

P

30

100

300

90

10

100

600

0

10

300

0

100

0

0

90

0

100

0

3

30

10

300

0

10

An explanatory note re the above question......  The Dependent variable (the count of S. aureus) is NOT "normally distributed" and this is an important assumption for these tests.  The reason of course is that bacerial multiply exponentially, and this is reflected in the strange distribution of values.  Analysis involving this variable would require the data to be transformed into a normal distribution by means of a logarithm.  So each value would have +1 added (to avoid the problem of trying to take a Log of zero), and then the Logs of the values would be used in all calculations with this variable.  Don't worry, I would not spring this on you at the final exam!

 

35.  The health unit is investigating the spread of a water-borne gastro-enteritis at a school.  The following data have been collected and your job is to test the association between water consumption and the risk of becoming ill.

                           * Total number of students who were ill: 37    

                          * Total number of students in the study: 89

                         * - of those 26 who usually drink more than 5 cups of water/day: 15 ill

                        * - of those 30 who usually drink more than 5 cups of water/day: 14 ill

                       * - the remainder usually drink less than one cup/day

 

38   The following are taken from different outbreaks. (So the totals are different). For each item, make an appropriate test of the null hypothesis, and include odds ratio results in your interpretation. (THREE separate discussions needed).

    Milk pudding    ate: 15,    (ill: 10),      did not eat: 18,    (ill: 9)    

    Ham Pie          ate: 31,    (ill: 13),      did not eat:  26,   (ill:20)

    Egg Salad        ate: 11,    (ill: 6),        did not eat: 7,      (ill:2)  

 

39   In a study of indoor air quality, workers are asked to rate air-"freshness" of thebuilding on a five-point scale in which '1' was 'very comfortable' and '5' was 'very uncomfortable'. Also collected at the same time are the relative humidity (RH%) and air velocity (VEL) in cm/sec.  Determine if any relationship exists between the reported rating and either of these two factors.  Also consider if these two factors might be themselves related; how would you test that idea?

RH% VEL (cm/sec) employee rating

30

27

42

52

36

24

10

16

30

40

45

58

43

28

42

48

22

62

3.2

3.8

4.7

5.5

1.9

2.6

2.0

1.1

4.9

 

40   You are attempting to determine the effects of acidified water on leaching of lead from plumbing fixtures.  The following trials are carried out using various pH levels in contact with lead-soldered fittings for 24 hrs.  Describe the effects, if any.

  water pH     3.5     5.5     7.5     2.5     3.0      2.0      7.0     3.0      4.0
 lead concentration (ppm)    120      6       15      170     130     210      10      110       7

 

41   The following data are collected from a workplace because a suggestion has been made that exposure to styrene increases the risk of dermatitis. Carry out an appropriate analysis to determine if there is any convincing evidence of this association.  Of workers exposed to styrene vapour, 14 have dermatitis and 22 do not.  Of the workers not exposed to styrene vapour, only one in six suffer from dermatitis.  Total workers in the study: 72.

 

42   As part of the evaluation of a safety-education program, a pre- and post-test are administered.  The following scores were recorded from a class of eight persons. Determine the effectiveness of the knowledge-acquisition. (Dependent variable = percentage correct answers). Analyze and interpret fully.

Individual Pre-test % score Post-test % score

1

2

3

4

5

6

7

8

45

55

52

18

29

70

59

66

60

65

82

27

35

83

54

80

 

43   In evaluating the effectiveness of different methods of presenting health and safety education sessions, fifteen learners are assigned at random into one of three methods of presentation.  Using ANOVA, test the null hypothesis that the methods are equal in teaching effectiveness.

Method used Scores (%)
Lecture 63, 59, 71, 59, 65
B Hands-on participation 82, 92, 75, 74, 87
C Audio-visual media 71, 82, 66, 71, 73

 

46   The following data relate to the chlorine residual in a swimming pool at various times after it has been treated.

Number of hours Chlorine residual  1. Draw a scattergram

 2. Fit a regression line to the data

 3. Use the predicting equation to determine the Chlorine residual at the 11th hour

 4. Test the Ho: beta=zero

 5. Clearly summarize your findings

2

4

6

8

10

12

2.2

1.9

1.6

1.3

1.0

0.8

   

 

47   A bisulphite based formaldehyde-scavenging compound is being tested in eight houses containing urea-formaldehyde foam insulation.  A still-air measurement of formaldehyde vapour is taken prior to treatment and another measurement taken seven days after the treatment with bisulphite.  The following data are presented to you with the request to analyze and comment upon the effectiveness of the treatment.

House

Test before treatment

 (ppm formaldehyde )

Test after treatment

(ppm formaldehyde

1

2

3

4

5

6

7

8

7.0

4.6

6.0

6.2

5.6

3.1

4,2

6.1

3.8

2.5

2.0

2.9

3.0

1.6

2.0

3.5

 

 

49   In a study of dietary intake of metals, the following measurements for aluminum and zinc in the blood of 12 persons are recorded (micrograms/100 ml blood)

Person 1 2 3 4 5 6 7 8 9 10 11 12
ZINC 83 71 60 54 90 80 68 50 56 62 51 60
ALUMINUM 22 14 16 22 08 10 16 20 15 16 19 17

a. Make a scattergram of the data

b. Describe any relationship between the variables

c. Compute the correlation coefficient

d. Would you be justified in stating that there is a cause-and-effect mechanism operating here?

Test the correlation coefficient from (c) and determine if it differs significantly from zero

 

50   In 1963, an outbreak of type E botulism occurred in Kentucky and Alabama.  Antitoxin for type E botulism was used, one of the first trials in North America.  Of the 17 cases, five died. Of the twelve survivors, eight had been treated with type E antitoxin.  None of the five fatalities had received the antitoxin.     Arrange the data into a form suitable for analysis.  Test the null hypothesis that there was no association between survival and having received the antitoxin.

 

51    A new disinfectant compound is being evaluated for use in dairy plants.  Ten different types of stainless steel milk-contact surface are inoculated with a suspension of E. coli bacteria in milk.  The surfaces are treated on the left side with the new compound, and on the right side with the standard compound.   The counts are then compared after suitable incubation.  Use a suitable means of analysis on the data. (Dependent variable: percentage of live E coli cells remaining).

Surface 1 2 3 4 5 6 7 8 9 10
New compound 9 5 8 5 11 3 10 15 0 2
Standard compound 9 15 9 18 12 17 14 8 22 18

 

52     The mean number of dead insects (out of original 50) is shown for each pesticide.  Note that the ANOVA table has been largely completed.  Finish the table, and report fully. 

  Pesticide Mean kill/50 Trials per group  
A 20 15
B 38 15
C 12 15

A.N.O.V.A  TABLE

Source of variation Sum of Squares degrees of freedom Mean square F ratio P
Between groups 486.01        
Within groups 1655.24        
Total          

 

53   The results of a test of muscle strength are delivered to you.  You establish a null hypothesis and carry out an (unpaired) t-test on ten measures

  With treatment    Without treatment                     (five in each group) because nothing suggests that the data were paired. 

          7.8                     8.9                                     Later, you are told that the data were gathered from the SAME FIVE PERSONS,

          5.0                     7.1                                     before and after the treatment.  The correct analysis now becomes a t-test

          2.1                     4.5                                     for PAIRED data.  Carry out BOTH analyses.  Compare the two results.  What values

          5.5                     7.7                                     are the same?  Comment on the apparent discrepancy in statistical significance.  Why is this so?

        10.0                   11.9

 

54   The following foods are being examined for association with illness.  Deal with each food separately.  Calculate the odds ratio and test the null hypothesis by means of appropriate analytical methods.  Summarize fully. [Note: IF Fisher's test is required, please generate a complete and accurate "P" statement]

Food ATE THE FOOD DID NOT EAT THIS FOOD
  TOTAL ILL TOTAL ILL
ALPHALPHA 17 9 17 7
BEETS 21 4 13 12
CRUNCHY BARS 14 11 20 5

  

56   The effect, if any, of using iodophor hand-dips in supermarket delicatessen departments is being evaluated.  A total of 174 swab-tests are taken from the hands of employees working with no hand-dips, with Salmonellae species being reported as 'present' or 'absent'.  Eighty (80) swabs are taken from the hands of employees in stores where the dip has been introduced. Salmonellae are isolated from 24 non-dipped hands, and 22 "dipped" hands.  Comment on the effectiveness, if any, of the hand dips.