Prof Randi Garcia
January 27, 2021
Male albino laboratory rats are used routinely in many kinds of experiments. This experiment was designed to determine the requirements for protein and amino acid threonine in their food. Specifically, the experiment is interested in testing the combinations of eight levels of threonine (.2 through .9% of diet) and five levels of protein (8.68, 12, 15, 18, and 21% of diet). Baby rats were separated into five groups of 40 to form groups of approximately the same weight. The 40 rats in each group were randomly assigned to each of the 40 conditions. Body weight and food consumption were measured twice weekly, and the average daily weight gain over 21 days was recorded.
Male albino laboratory rats are used routinely in many kinds of experiments. This experiment was designed to determine the requirements for protein and amino acid threonine in their food. Specifically, the experiment is interested in testing the combinations of eight levels of threonine (.2 through .9% of diet) and five levels of protein (8.68, 12, 15, 18, and 21% of diet). 200 baby rats were randomly assigned to each of the 40 conditions. Body weight and food consumption were measured twice weekly, and the average daily weight gain over 21 days was recorded.
This experiment is interested in the blood concentration of a drug after it has been administered. The concentration will start at zero, then go up, and back down as it is metabolized. This curve may differ depending on the form of the drug (a solution, a tablet, or a capsule). We will use three subjects, and each subject will be given the drug three times, once for each method. The area under the time-concentration curve is recorded for each subject after each method of drug delivery.
This experiment is interested in the blood concentration of a drug after it has been administered. The concentration will start at zero, then go up, and back down as it is metabolized. This curve may differ depending on the form of the drug (a solution, a tablet, or a capsule). We will use nine subjects, and randomly assign subjects to one of the three delivery methods. The area under the time-concentration curve is recorded for each subject after being given the drug.
Modern zoos try to reproduce natural habitats in their exhibits as much as possible. They try to use appropriate plants, but these plants can be infested with inappropriate insects. Cycads (plants that look vaguely like palms) can be infected with mealybugs, and the zoo wishes to test three treatments: 1) water, 2) horticultural oil, and 3) fungal spores in water. Nine infested cycads are taken to the testing area. Three branches are randomly selected from each tree, and 3 cm by 3 cm patches are marked on each branch. The number of mealybugs on the patch is counted. The three treatments then get randomly assigned to the three branches for each tree. After three days the mealybugs are counted again. The change in number of mealybugs is computed (\( before-after \)).
treatment | tree1 | tree2 | tree3 | tree4 | tree5 |
---|---|---|---|---|---|
oil | 4 | 29 | 14 | 14 | 7 |
spores | -4 | 29 | 4 | -2 | 11 |
water | -9 | 18 | 10 | 9 | -6 |
Draw the factor diagram, labeling inside outside factors.
\[ {y}_{ij}={\mu}+{\tau}_{i}+{\beta}_{j}+{e}_{ij} \]
Source | SS | df | MS | F |
---|---|---|---|---|
Treatment | \( \sum_{i=1}^{a}b(\bar{y}_{i.}-\bar{y}_{..})^{2} \) | \( a-1 \) | \( \frac{{SS}_{T}}{{df}_{T}} \) | \( \frac{{MS}_{T}}{{MS}_{E}} \) |
Blocks | \( \sum_{j=1}^{b}a(\bar{y}_{.j}-\bar{y}_{..})^{2} \) | \( b-1 \) | \( \frac{{SS}_{B}}{{df}_{B}} \) | \( \frac{{MS}_{B}}{{MS}_{E}} \) |
Error | \( \sum_{i=1}^{a}\sum_{j=1}^{b}({y}_{ij}-\bar{y}_{i.}-\bar{y}_{.j}+\bar{y}_{..})^{2} \) | \( (a-1)(b-1) \) | \( \frac{{SS}_{E}}{{df}_{E}} \) |
mealybugs
tree treatment bugs_change
1 tree1 water -9
2 tree1 spores -4
3 tree1 oil 4
4 tree2 water 18
5 tree2 spores 29
6 tree2 oil 29
7 tree3 water 10
8 tree3 spores 4
9 tree3 oil 14
10 tree4 water 9
11 tree4 spores -2
12 tree4 oil 14
13 tree5 water -6
14 tree5 spores 11
15 tree5 oil 7
mod <- lm(bugs_change ~ treatment + tree, data = mealybugs)
anova(mod)
Analysis of Variance Table
Response: bugs_change
Df Sum Sq Mean Sq F value Pr(>F)
treatment 2 218.13 109.07 2.9963 0.106846
tree 4 1316.40 329.10 9.0412 0.004603 **
Residuals 8 291.20 36.40
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
This experiment is interested in the blood concentration of a drug after it has been administered. The concentration will start at zero, then go up, and back down as it is metabolized. This curve may differ depending on the form of the drug (a solution, a tablet, or a capsule). We will use three subjects, and each subject will be given the drug three times, once for each method. The area under the time-concentration curve is recorded for each subject after each method of drug delivery.
In the bioequivalence example, because the body may adapt to the drug in some way, each drug will be used once in the first period, once in the second period, and once in the third period.
Treatments:
timeslot 1 | timeslot 2 | timeslot 3 | |
---|---|---|---|
subject 1 | A 1799 | C 2075 | B 1396 |
subject 2 | C 1846 | B 1156 | A 868 |
subject 3 | B 2147 | A 1777 | C 2291 |
Factor diagram for the Latin Square??
The actual data structure for analysis is “long” format
subject | treatment | period | group | c_curve |
---|---|---|---|---|
1 | solution | 1 | A | 1799 |
1 | capsule | 2 | C | 1846 |
1 | tablet | 3 | B | 2147 |
2 | capsule | 1 | C | 2075 |
2 | tablet | 2 | B | 1156 |
2 | solution | 3 | A | 1777 |
3 | tablet | 1 | B | 1396 |
3 | solution | 2 | A | 868 |
3 | capsule | 3 | C | 2291 |
We can make a parallel dot graph
And check for equal standard deviations
library(mosaic)
sd <- favstats(c_curve ~ treatment, data = bioequivalence)[,8]
max(sd)/min(sd)
[1] 2.387418
\[ {y}_{ijk}={\mu}+{\alpha}_{i}+{\beta}_{j}+{\tau}_{k}+{e}_{ijk} \]
Source | SS | df | MS | F |
---|---|---|---|---|
rows | \( \sum_{i=1}^{p}p(\bar{y}_{i..}-\bar{y}_{...})^{2} \) | \( p-1 \) | \( \frac{{SS}_{A}}{{df}_{A}} \) | \( \frac{{MS}_{A}}{{MS}_{E}} \) |
columns | \( \sum_{j=1}^{p}p(\bar{y}_{.j.}-\bar{y}_{...})^{2} \) | \( p-1 \) | \( \frac{{SS}_{B}}{{df}_{B}} \) | \( \frac{{MS}_{B}}{{MS}_{E}} \) |
treatment | \( \sum_{k=1}^{p}p(\bar{y}_{..k}-\bar{y}_{...})^{2} \) | \( p-1 \) | \( \frac{{SS}_{T}}{{df}_{T}} \) | \( \frac{{MS}_{T}}{{MS}_{E}} \) |
Error | \( \sum_{i=1}^{p}\sum_{j=1}^{p}({y}_{ijk}-\bar{y}_{i..}-\bar{y}_{.j.}-\bar{y}_{..k}+2\bar{y}_{..})^{2} \) | \( (p-1)(p-2) \) | \( \frac{{SS}_{E}}{{df}_{E}} \) |
ls_mod <- lm(c_curve ~ treatment + period + subject, data = bioequivalence)
anova(ls_mod)
Analysis of Variance Table
Response: c_curve
Df Sum Sq Mean Sq F value Pr(>F)
treatment 2 608891 304445 67.733 0.014549 *
period 2 928006 464003 103.231 0.009594 **
subject 2 261115 130557 29.047 0.033282 *
Residuals 2 8990 4495
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
bioequivalence <- bioequivalence %>%
mutate(fitted = fitted(ls_mod),
residuals = residuals(ls_mod))
ggplot(bioequivalence, aes(x = fitted, residuals)) +
geom_point() +
geom_hline(yintercept = 0, color = "red")
A plant breeder wishes to study the effects of soil drainage and variety of tulip bulbs on flower production. Twelve 3m by 10m experimental sites are available in the test garden–each is a .5m deep trench. You can manipulate soil drainage by changing the ratio of sand to clay for the soil you put in a trench. After talking to your collaborator, you decided that four different levels of soil drainage would suffice. You'll be testing 15 different types of tulips, and measuring flower production in the spring.