Latin Square Design

Prof Randi Garcia
January 27, 2021

Reading contemplation question

  1. Imagine a latin square design where the factor of interest has four levels. How many levels will you need of the blocking factor 1? How many levels of blocking factor 2?

Announcements

  • Quiz 1 due on Thursday 11:55p
  • HW6 due on Tuesday 9:20a
  • MP2 starts tonight!

Agenda

  • ANOVA for CB[1]
  • Latin square designs

Example

Google slides

Male albino laboratory rats are used routinely in many kinds of experiments. This experiment was designed to determine the requirements for protein and amino acid threonine in their food. Specifically, the experiment is interested in testing the combinations of eight levels of threonine (.2 through .9% of diet) and five levels of protein (8.68, 12, 15, 18, and 21% of diet). Baby rats were separated into five groups of 40 to form groups of approximately the same weight. The 40 rats in each group were randomly assigned to each of the 40 conditions. Body weight and food consumption were measured twice weekly, and the average daily weight gain over 21 days was recorded.

Example - Basic Factorial Design [2]

Male albino laboratory rats are used routinely in many kinds of experiments. This experiment was designed to determine the requirements for protein and amino acid threonine in their food. Specifically, the experiment is interested in testing the combinations of eight levels of threonine (.2 through .9% of diet) and five levels of protein (8.68, 12, 15, 18, and 21% of diet). 200 baby rats were randomly assigned to each of the 40 conditions. Body weight and food consumption were measured twice weekly, and the average daily weight gain over 21 days was recorded.

Example

This experiment is interested in the blood concentration of a drug after it has been administered. The concentration will start at zero, then go up, and back down as it is metabolized. This curve may differ depending on the form of the drug (a solution, a tablet, or a capsule). We will use three subjects, and each subject will be given the drug three times, once for each method. The area under the time-concentration curve is recorded for each subject after each method of drug delivery.

Example - Basic Factorial Design

This experiment is interested in the blood concentration of a drug after it has been administered. The concentration will start at zero, then go up, and back down as it is metabolized. This curve may differ depending on the form of the drug (a solution, a tablet, or a capsule). We will use nine subjects, and randomly assign subjects to one of the three delivery methods. The area under the time-concentration curve is recorded for each subject after being given the drug.

Design Principal: Blocking

  • Blocking is using a factor that is not of research interest – But there may be differences across blocks on the response variable
  • A “block” is a level of a blocking factor
  • We use blocking to improve precision/statistical power of our factor of interest

Three Ways to Block

  1. Sort units into similar groups
    • Albino rats
  2. Subdivide larger chunks of material into sets of smaller pieces
    • Mealybug trees
  3. Reuse subjects or material in each of several times slots
    • Drug study

Complete Block Design, CB[1]

  • Experimental material are separated into groups (or reused) to create similar units
  • Then each unit within a block is then is assigned one level of the factor of interest
  • “Complete Block” means that every block x treatment combination is tested

Inappropriate Insects

Modern zoos try to reproduce natural habitats in their exhibits as much as possible. They try to use appropriate plants, but these plants can be infested with inappropriate insects. Cycads (plants that look vaguely like palms) can be infected with mealybugs, and the zoo wishes to test three treatments: 1) water, 2) horticultural oil, and 3) fungal spores in water. Nine infested cycads are taken to the testing area. Three branches are randomly selected from each tree, and 3 cm by 3 cm patches are marked on each branch. The number of mealybugs on the patch is counted. The three treatments then get randomly assigned to the three branches for each tree. After three days the mealybugs are counted again. The change in number of mealybugs is computed (\( before-after \)).

Inappropriate Insects

treatment tree1 tree2 tree3 tree4 tree5
oil 4 29 14 14 7
spores -4 29 4 -2 11
water -9 18 10 9 -6

Draw the factor diagram, labeling inside outside factors.

Formal ANOVA for CB[1]

\[ {y}_{ij}={\mu}+{\tau}_{i}+{\beta}_{j}+{e}_{ij} \]

Source SS df MS F
Treatment \( \sum_{i=1}^{a}b(\bar{y}_{i.}-\bar{y}_{..})^{2} \) \( a-1 \) \( \frac{{SS}_{T}}{{df}_{T}} \) \( \frac{{MS}_{T}}{{MS}_{E}} \)
Blocks \( \sum_{j=1}^{b}a(\bar{y}_{.j}-\bar{y}_{..})^{2} \) \( b-1 \) \( \frac{{SS}_{B}}{{df}_{B}} \) \( \frac{{MS}_{B}}{{MS}_{E}} \)
Error \( \sum_{i=1}^{a}\sum_{j=1}^{b}({y}_{ij}-\bar{y}_{i.}-\bar{y}_{.j}+\bar{y}_{..})^{2} \) \( (a-1)(b-1) \) \( \frac{{SS}_{E}}{{df}_{E}} \)

Data Analysis Structure

mealybugs
    tree treatment bugs_change
1  tree1     water          -9
2  tree1    spores          -4
3  tree1       oil           4
4  tree2     water          18
5  tree2    spores          29
6  tree2       oil          29
7  tree3     water          10
8  tree3    spores           4
9  tree3       oil          14
10 tree4     water           9
11 tree4    spores          -2
12 tree4       oil          14
13 tree5     water          -6
14 tree5    spores          11
15 tree5       oil           7

Formal ANOVA

mod <- lm(bugs_change ~ treatment + tree, data = mealybugs)

anova(mod)
Analysis of Variance Table

Response: bugs_change
          Df  Sum Sq Mean Sq F value   Pr(>F)   
treatment  2  218.13  109.07  2.9963 0.106846   
tree       4 1316.40  329.10  9.0412 0.004603 **
Residuals  8  291.20   36.40                    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Bioequivalence of drug delivery

This experiment is interested in the blood concentration of a drug after it has been administered. The concentration will start at zero, then go up, and back down as it is metabolized. This curve may differ depending on the form of the drug (a solution, a tablet, or a capsule). We will use three subjects, and each subject will be given the drug three times, once for each method. The area under the time-concentration curve is recorded for each subject after each method of drug delivery.

Latin Square Design

In the bioequivalence example, because the body may adapt to the drug in some way, each drug will be used once in the first period, once in the second period, and once in the third period.

  • We can use a Latin Square design to control the order of drug administration
  • In this way, time is a second blocking factor (subject is the first)

Latin Square Design

Treatments:

  • Solution is treatment A
  • Tablet is treatment B
  • Capsule is treatment C
timeslot 1 timeslot 2 timeslot 3
subject 1 A 1799 C 2075 B 1396
subject 2 C 1846 B 1156 A 868
subject 3 B 2147 A 1777 C 2291

Factor diagram for the Latin Square??

Latin Square Design

The actual data structure for analysis is “long” format

subject treatment period group c_curve
1 solution 1 A 1799
1 capsule 2 C 1846
1 tablet 3 B 2147
2 capsule 1 C 2075
2 tablet 2 B 1156
2 solution 3 A 1777
3 tablet 1 B 1396
3 solution 2 A 868
3 capsule 3 C 2291

Informal ANOVA for Latin Square

We can make a parallel dot graph

plot of chunk unnamed-chunk-6

And check for equal standard deviations

library(mosaic)

sd <- favstats(c_curve ~ treatment, data = bioequivalence)[,8]

max(sd)/min(sd)
[1] 2.387418

Formal ANOVA for the Latin Square

\[ {y}_{ijk}={\mu}+{\alpha}_{i}+{\beta}_{j}+{\tau}_{k}+{e}_{ijk} \]

  • \( {\mu} \) is the benchmark
  • \( {\alpha}_{i} \) is the row effect
  • \( {\beta}_{j} \) is the column effect
  • \( {\tau}_{k} \) is the treatment effect
  • There are p rows, columns, and treatments
Source SS df MS F
rows \( \sum_{i=1}^{p}p(\bar{y}_{i..}-\bar{y}_{...})^{2} \) \( p-1 \) \( \frac{{SS}_{A}}{{df}_{A}} \) \( \frac{{MS}_{A}}{{MS}_{E}} \)
columns \( \sum_{j=1}^{p}p(\bar{y}_{.j.}-\bar{y}_{...})^{2} \) \( p-1 \) \( \frac{{SS}_{B}}{{df}_{B}} \) \( \frac{{MS}_{B}}{{MS}_{E}} \)
treatment \( \sum_{k=1}^{p}p(\bar{y}_{..k}-\bar{y}_{...})^{2} \) \( p-1 \) \( \frac{{SS}_{T}}{{df}_{T}} \) \( \frac{{MS}_{T}}{{MS}_{E}} \)
Error \( \sum_{i=1}^{p}\sum_{j=1}^{p}({y}_{ijk}-\bar{y}_{i..}-\bar{y}_{.j.}-\bar{y}_{..k}+2\bar{y}_{..})^{2} \) \( (p-1)(p-2) \) \( \frac{{SS}_{E}}{{df}_{E}} \)

Formal ANOVA for the Latin Square

ls_mod <- lm(c_curve ~ treatment + period + subject, data = bioequivalence)

anova(ls_mod)
Analysis of Variance Table

Response: c_curve
          Df Sum Sq Mean Sq F value   Pr(>F)   
treatment  2 608891  304445  67.733 0.014549 * 
period     2 928006  464003 103.231 0.009594 **
subject    2 261115  130557  29.047 0.033282 * 
Residuals  2   8990    4495                    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual Plot

bioequivalence <- bioequivalence %>%
  mutate(fitted = fitted(ls_mod), 
         residuals = residuals(ls_mod))

ggplot(bioequivalence, aes(x = fitted, residuals)) +
  geom_point() +
  geom_hline(yintercept = 0, color = "red")

Residual Plot

plot of chunk unnamed-chunk-10

Tulips

A plant breeder wishes to study the effects of soil drainage and variety of tulip bulbs on flower production. Twelve 3m by 10m experimental sites are available in the test garden–each is a .5m deep trench. You can manipulate soil drainage by changing the ratio of sand to clay for the soil you put in a trench. After talking to your collaborator, you decided that four different levels of soil drainage would suffice. You'll be testing 15 different types of tulips, and measuring flower production in the spring.