Page 163 - Statistics for Environmental Engineers

P. 163

L1592_frame_C19.fm Page 161 Tuesday, December 18, 2001 1:53 PM

Assessing the Difference of Proportions

KEY WORDS bioassay, binomial distribution, binomial model, censored data, efﬂuent testing, normal
distribution, normal approximation, percentages, proportions, ratio, toxicity, t-test.

Ratios and proportions arise in biological, epidemiological, and public health studies. We may want to
study the proportion of people infected at a given dose of virus, the proportion of rats showing tumors
after exposure to a carcinogen, the incidence rate of leukemia near a contaminated well, or the proportion
of ﬁsh affected in bioassay tests on efﬂuents. Engineers would study such problems only with help from
specialists, but they still need to understand the issues and some of the relevant statistical methods.
A situation where engineers will use ratios and proportions is when samples have been censored by
a limit of detection. A data set on an up-gradient groundwater monitoring well has 90% of all observations
censored and a down-gradient well has only 75% censored. Does this difference indicate that contami-
nation has occurred in the groundwater ﬂowing between the two wells?

Case Study

Biological assays are a means of determining the toxicity of an efﬂuent. There are many ways such tests
might be organized: species of test organism, number of test organisms, how many dilutions of efﬂuent
to test, speciﬁcation of response, physical conditions, etc. Most of these are biological issues. Here we
consider some statistical issues in a simple bioassay.
Organisms will be put into (1) an aquarium containing efﬂuent or (2) a control aquarium containing
clean water. Equal numbers of organisms are assigned randomly to the control and efﬂuent groups. The
experimental response is a binary measure: presence or absence of some characteristic. In an acute
bioassay, the binary characteristic is survival or death of the organism. In a chronic bioassay, the organisms
are exposed to nonlethal conditions and the measured response might be loss of equilibrium, breathing
rate, loss of reproductive capacity, rate of weight gain, formation of neoplasms, etc.
In our example, 80 organisms (n 1 = n 2 = 80) were exposed to each treatment condition (control and
efﬂuent) and toxicity was measured in terms of survival. The data shown in Table 19.1 were observed.
Are the survival proportions in the two groups so different that we can state with a high degree of
conﬁdence that the two treatments truly differ in toxicity?

The Binomial Model

The data from a binomial process consist of two discrete outcomes (binary). A test organism is either
dead or alive after a given period of time. An efﬂuent is either in compliance or it is not. In a given
year, a river ﬂoods or it does not ﬂood. The binomial probability distribution gives the probability of
observing an event x times in a set of n trials (experiment). If the event is observed, the trial is said to
be successful. Success in this statistical sense does not mean that the outcome is desirable. A success
may be the death of an organism, failure of a machine, or violation of a regulation. It means success in

158 159 160 161 162 163 164 165 166 167 168