Measures of
Association
Fundamental Epidemiological Concepts and Approaches
Kiffer G. Card, PhD, Faculty of Health Sciences, Simon Fraser University
Learning objectives for this lesson:
- Calculate and interpret the risk ratio, incidence rate ratio, and odds ratio
- Compute risk difference, attributable fraction (exposed), and population attributable measures
- Understand when to use each measure of association
- Correctly distinguish between strength of association and statistical significance
- Understand the basis for hypothesis tests and confidence intervals
This course was developed by Kiffer G. Card, PhD, as a companion to Dohoo, I. R., Martin, S. W., & Stryhn, H. (2012). Methods in Epidemiologic Research. VER Inc.
Introduction & Ratio Measures of Association
⏱ Estimated reading time: 15 minutes
Learning Objectives
- Explain why measures of association are used in epidemiology.
- Set up incidence risk and incidence rate data in 2×2 tables.
- Calculate and interpret the risk ratio (RR), incidence rate ratio (IR), and odds ratio (OR).
- Describe the relationships among RR, IR, and OR.
Why Measure Association?
Measures of association assess the magnitude of the relationship between an exposure (a potential cause) and a disease. Unlike measures of statistical significance, which are heavily dependent on sample size, measures of association indicate the strength of the effect — how much more (or less) likely disease is in exposed compared to non-exposed groups.
Strength vs. Significance
A measure of association tells you how strongly an exposure is linked to disease. A P-value tells you how likely the observed data would be under the null hypothesis of no association. A strong association can be non-significant (small sample), and a weak association can be highly significant (large sample). Always report both.
Data Layout
Depending on study design, disease frequency can be expressed as incidence risk, incidence rate, prevalence, or odds. For risk data, the standard 2×2 table is:
| Exposed | Non-exposed | Total | |
|---|---|---|---|
| Diseased | a1 | a0 | m1 |
| Non-diseased | b1 | b0 | m0 |
| Total | n1 | n0 | n |
For rate data, the denominator is person-time at risk rather than the number of individuals:
| Exposed | Non-exposed | Total | |
|---|---|---|---|
| Number of cases | a1 | a0 | m1 |
| Person-time at risk | t1 | t0 | t |
Three Ratio Measures of Association
Click each card to learn more:
Worked Example: Brazil Water Cistern Study
Diarrhea & Water Cistern Presence
| Water Cistern | No Cistern | Total | |
|---|---|---|---|
| Diarrhea Present | 194 | 303 | 497 |
| Diarrhea Absent | 1,588 | 1,314 | 2,902 |
| Total | 1,782 | 1,617 | 3,399 |
- RR = (194/1782) / (303/1617) = 0.109 / 0.187 = 0.58
- OR = (194 × 1314) / (303 × 1588) = 0.53
Both measures indicate that having a water cistern is protective against diarrhea (values < 1). The RR of 0.58 means the risk is 42% lower among those with cisterns.
Worked Example: Migraine Incidence Rates
Gender and Migraine (Ages 30–40)
| Female | Male | Total | |
|---|---|---|---|
| Cases of migraine | 131 | 44 | 175 |
| Person-months | 250 | 236 | 486 |
IR = (131/250) / (44/236) = 0.524 / 0.186 = 2.81
The rate of migraine is 2.81 times higher in females than males aged 30–40.
Relationships Among RR, IR, and OR
In general, IR values are further from the null (1) than RR values, and OR values are even further away. This can be visualised on a number line:
Figure 6.1 — General relationships among RR, IR, and OR. OR is always furthest from the null value of 1.
When the disease is rare (prevalence or incidence risk < 5%), OR approximates RR. This is because when a1 is small relative to n1, the denominator of the odds (b1) is approximately equal to n1, and similarly for the non-exposed group. In the cistern example, the overall risk was 14.6%, so OR (0.53) was more extreme than RR (0.58).
RR and IR will be close to each other if the exposure has a negligible impact on the total time at risk in the study population. This occurs when the disease is rare or when IR is close to the null value (IR ≈ 1).
OR is a good estimator of IR under certain conditions in case-control studies. If controls are selected using cumulative or risk-based sampling (all non-cases after cases have occurred), then OR estimates IR only if the disease is rare. If controls are selected using density sampling (a control selected from non-cases each time a case occurs), then OR is a direct estimate of IR regardless of disease rarity.
Key Takeaways
- Measures of association quantify the strength of the exposure-disease relationship, unlike P-values which reflect sample size.
- RR compares risks, IR compares incidence rates, and OR compares odds between exposed and non-exposed groups.
- OR is the only measure that can be computed from case-control studies due to its symmetry property.
- When disease is rare (<5%), OR ≈ RR. IR values are further from the null than RR, and OR values further still.
1. The odds ratio (OR) is the only ratio measure of association applicable to case-control studies because:
2. A risk ratio of 0.58 for diarrhea in a cistern study indicates:
3. Under what condition does OR best approximate RR?
✦ Pass the knowledge check with 100% to continue
Measures of Effect in the Exposed Group
⏱ Estimated reading time: 15 minutes
Learning Objectives
- Distinguish between “ratio” (relative) and “difference” (absolute) measures of association.
- Calculate and interpret the risk difference (RD) and incidence rate difference (ID).
- Calculate and interpret the attributable fraction in the exposed (AFe).
- Explain the concept of vaccine efficacy as a special case of AFe.
Ratio vs. Difference Measures
The ratio measures from Section 1 (RR, IR, OR) tell us the relative strength of association, but they do not indicate the absolute number of cases attributable to the exposure. Difference (absolute effect) measures address this gap by computing how many additional cases occur because of the exposure.
Why Both Matter
Even when an exposure is very strongly associated with disease (high RR), if the exposure is rare in a population, it may contribute very few cases. Conversely, a relatively weak risk factor (modest RR) that is common can be responsible for many cases. Difference measures capture this “public health impact.”
Risk Difference (RD) — Attributable Risk
The risk difference (RD), also called attributable risk, is simply the risk in the exposed group minus the risk in the non-exposed group:
Similarly, the incidence rate difference (ID) is the difference between two incidence rates:
Interpretation of Difference Measures
- RD or ID < 0 → Exposure is protective
- RD or ID = 0 → No effect of exposure
- RD or ID > 0 → Exposure is positively associated with disease
RD indicates the increase (or decrease) in the probability of disease in the exposed group, beyond the baseline risk. It tells you: “For every X exposed individuals, how many additional cases occur because of the exposure?”
Example: Smoking & Low Birth Weight
From a cohort of 5,000 women followed through pregnancy:
| Smoker | Non-smoker | Total | |
|---|---|---|---|
| Low birth weight | 40 | 331 | 371 |
| Normal birth weight | 311 | 4,318 | 4,629 |
| Total | 351 | 4,649 | 5,000 |
- Risk in exposed: RE+ = 40/351 = 0.114
- Risk in non-exposed: RE− = 331/4649 = 0.071
- RD = 0.114 − 0.071 = 0.043
For every 100 women who smoked, approximately 4.3 had a low-birth-weight baby due to the fact that they smoked (assuming causal relationship).
Attributable Fraction in the Exposed (AFe)
The AFe expresses the proportion of disease in exposed individuals that is due to the exposure, assuming the relationship is causal. It can be viewed as the proportion of disease in the exposed group that would be avoided if the exposure were removed.
AFe ranges from 0 (where risk is equal, RR = 1) to 1 (where all disease in the exposed group is due to the exposure, RR = ∞). In case-control studies, AFe can be approximated by substituting OR for RR.
Worked Example: AFe for Smoking
From the smoking example above:
- RR = 0.114 / 0.071 = 1.60
- AFe = (1.60 − 1) / 1.60 = 0.60 / 1.60 = 0.375 (37.5%)
Among women who smoked, 37.5% of the low-birth-weight cases were attributable to smoking. Alternatively: 0.043 / 0.114 = 0.377 ≈ 37.7% (slight rounding difference).
Vaccine Efficacy
Vaccine efficacy is a special form of AFe, where “not vaccinated” is the exposure (factor positive) and “vaccinated” is the comparison group. If 20% of unvaccinated individuals develop disease versus 5% of vaccinated individuals:
Vaccine Efficacy Calculation
- RD = 0.20 − 0.05 = 0.15
- AFe = 0.15 / 0.20 = 0.75 (75%)
The vaccine has prevented 75% of the cases of disease that would have occurred in the vaccinated group if the vaccine had not been used.
AFe vs. Etiologic Fraction
The etiologic fraction is the proportion of cases in the exposed group for which exposure was a component of the sufficient cause. While AFe measures the excess fraction, the etiologic fraction can be higher because exposure may contribute to cases even when the baseline risk would have produced them eventually. In general, AFe provides a lower bound for the etiologic fraction.
Reflection
In a cohort study, a new environmental pollutant is found to have an RR of 3.0 for respiratory disease. The risk of respiratory disease in the non-exposed population is 2%. Calculate the RD and AFe. If 1,000 people are exposed, how many additional cases would you expect due to the exposure? Discuss why RD and AFe give different but complementary perspectives.
Minimum 20 characters required.
Key Takeaways
- RD (attributable risk) measures the absolute increase in risk due to exposure; the null value is 0.
- AFe = (RR − 1)/RR gives the proportion of disease in the exposed that is due to the exposure.
- Vaccine efficacy is a special case of AFe where the “exposure” is being unvaccinated.
- AFe provides a lower bound for the etiologic fraction.
1. If the risk of disease is 12% in the exposed group and 4% in the non-exposed group, the risk difference (RD) is:
2. A vaccine efficacy of 75% means:
3. AFe = (RR − 1)/RR. If RR = 2.5, what is AFe?
✦ Pass the knowledge check with 100% and complete the reflection to continue
Population-Level Measures & Study Design
⏱ Estimated reading time: 12 minutes
Learning Objectives
- Calculate and interpret the population attributable risk (PAR) and population attributable fraction (AFp).
- Explain how the prevalence of exposure affects population-level measures.
- Identify which measures of association can be computed from each study design.
From the Exposed Group to the Entire Population
While RD and AFe describe the effect of exposure among exposed individuals, public health decisions often require understanding the impact of an exposure on the entire population. Two key population-level measures address this:
Population Attributable Risk (PAR)
The PAR is the increase in overall population risk attributable to the exposure. It reflects both the strength of the association and the frequency of the exposure in the population.
Population Attributable Fraction (AFp)
The AFp indicates the proportion of disease in the entire population that is attributable to the exposure, and which would be avoided if the exposure were removed (assuming causation and no confounding).
Why Exposure Prevalence Matters
A strong risk factor (high RR) that is rare in the population will have a small AFp. A weaker risk factor (modest RR) that is common may have a large AFp. For example, intravenous drug use has a very high RR for HIV, but if it is rare in the population, eliminating it would prevent few total cases. A modestly elevated risk factor like poor diet, affecting millions, may account for more total cases.
Worked Example: Smoking & Low Birth Weight (Population Level)
From the cohort of 5,000 women (351 smokers, 4,649 non-smokers):
- Overall risk: p(D+) = 371/5000 = 0.074
- Risk in non-exposed: 331/4649 = 0.071
- PAR = 0.074 − 0.071 = 0.003
- AFp = 0.003 / 0.074 = 0.041 (4.1%)
Only 4.1% of all low-birth-weight babies in the population were attributable to smoking. The low AFp is because very few women (351/5000 = 7%) smoked during the 2nd trimester, despite the relatively strong association (RR = 1.60).
Confounding and AFp
If confounding is present, adjusted estimates of RR should be used. The AFp can then be estimated using:
where pd is the proportion of cases exposed to the risk factor, and aRR is the adjusted risk ratio. For multiple exposure categories, a summation formula is used.
Study Design and Measures of Association
Not all measures can be computed from all study designs. The following table summarises which measures are available:
| Measure | Cross-sectional | Cohort | Case-control |
|---|---|---|---|
| RR | ✓ | ✓ | |
| IR | ✓ | ||
| OR | ✓ | ✓ | ✓ |
| RD | ✓ | ✓ | |
| AFe | ✓ | ✓ | ✓b |
| PAR | ✓ | ✓a | |
| AFp | ✓ | ✓a | ✓c |
a Requires independent estimate of p(D+) or p(E+). b Estimated using OR. c Requires OR and independent estimate of p(E+|D+).
Reflection
Consider two risk factors for a disease: Factor A has RR = 5.0 and affects 2% of the population. Factor B has RR = 1.5 and affects 40% of the population. Calculate AFp for each factor using the formula AFp = p(E+)(RR − 1) / [p(E+)(RR − 1) + 1]. Which factor would you prioritise in a public health intervention, and why?
Minimum 20 characters required.
Key Takeaways
- PAR = overall population risk − risk in unexposed; it reflects both strength and prevalence of exposure.
- AFp = PAR / p(D+); it gives the proportion of all disease in the population attributable to the exposure.
- A common risk factor with modest RR can have a larger AFp than a rare factor with high RR.
- Different study designs support different measures: only OR is available from case-control studies.
1. A risk factor has RR = 4.0 but affects only 1% of the population. The AFp is:
2. Which measure cannot be computed directly from a case-control study?
3. PAR differs from RD in that:
✦ Pass the knowledge check with 100% and complete the reflection to continue
Hypothesis Testing & Confidence Intervals
⏱ Estimated reading time: 15 minutes
Learning Objectives
- Explain the concepts of standard error, null hypothesis, and P-value.
- Describe the four common test statistics for evaluating associations.
- Interpret confidence intervals for measures of association.
- Distinguish between statistical significance and the strength of association.
Standard Error
The standard error (SE) provides a measure of the precision of a point estimate — how much uncertainty exists in the estimate. For difference measures (RD, ID), the variance can be computed directly:
For ratio measures (RR, IR, OR), the variance is computed on the log scale using Taylor series approximations:
Hypothesis Testing
Significance testing is based on specifying a null hypothesis about the population parameter. The null hypothesis typically states there is no association:
- For difference measures (RD, ID): H0: θ = 0
- For ratio measures (RR, IR, OR): H0: θ = 1
An alternative hypothesis can be 1-tailed or 2-tailed. In general, 2-tailed hypotheses are preferred because 1-tailed hypotheses are harder to justify.
Limitations of P-values
P-values are often dichotomised into “significant” or “non-significant” at α = 0.05, but this entails a huge loss of information. A P-value of 0.049 and 0.051 lead to different conclusions despite being virtually identical. Always report the actual P-value and a confidence interval, which conveys both significance and precision.
Test Statistics
Click each card to explore:
Confidence Intervals
A confidence interval (CI) reflects the level of uncertainty in a point estimate. A 95% CI means that if the study were repeated many times under identical conditions, 95% of the computed CIs would contain the true parameter value.
Computing CIs
For difference measures, the CI is computed directly:
For ratio measures, the CI is computed on the log scale and then exponentiated:
The CI is symmetrical about lnθ but not about θ itself — this is why confidence intervals for ratio measures appear asymmetric.
Interpreting CIs for Measures of Association
- For RR, IR, OR: if the 95% CI includes 1, the association is not statistically significant at α = 0.05.
- For RD, ID: if the 95% CI includes 0, the association is not statistically significant.
However, this “surrogate significance test” is an under-use of the CI. The CI also shows the range of plausible effect sizes, which is far more informative than a binary significant/non-significant classification.
Example CIs from the Textbook
| Measure | Point Estimate | 95% CI |
|---|---|---|
| RD (smoking) | 0.043 | (0.009, 0.077) |
| RR (smoking) | 1.601 | (1.174, 2.182) |
| OR (smoking) | 1.678 | (1.154, 2.387) |
| ID (migraine) | 0.338 | (0.232, 0.443) |
| IR (migraine) | 2.811 | (1.983, 4.050) |
None of the CIs for ratio measures include 1, and none for difference measures include 0, confirming statistical significance for all associations.
Reflection
A study reports an OR of 1.45 with a 95% CI of (0.92, 2.28). A second study reports an OR of 1.15 with a 95% CI of (1.02, 1.30). Compare these two findings in terms of: (a) strength of association, (b) statistical significance, and (c) precision. Which finding might be more concerning from a public health perspective, and why?
Minimum 20 characters required.
Key Takeaways
- Standard errors quantify precision; they are computed differently for difference vs. ratio measures.
- Hypothesis testing uses a null hypothesis (no effect) and a test statistic to generate a P-value.
- Four common test statistics: Pearson χ², exact tests, Wald, and likelihood ratio tests.
- Confidence intervals are more informative than P-values: they show the range of plausible effect sizes.
- For ratio measures, the CI containing 1 (or 0 for differences) indicates non-significance at the corresponding α level.
1. A 95% confidence interval for an odds ratio is (1.2, 3.8). This means:
2. Why are confidence intervals for ratio measures (like OR) asymmetric around the point estimate?
3. Which test statistic is generally considered superior in regression settings?
✦ Pass the knowledge check with 100% and complete the reflection to continue
Final Review & Assessment
⏱ Estimated time: 20 minutes
Lesson Summary
In this lesson, you explored the key measures used to quantify the association between an exposure and a disease outcome. Let’s review:
Three ratio measures compare disease frequency between exposed and non-exposed groups: risk ratio (RR), incidence rate ratio (IR), and odds ratio (OR). RR and IR are computed from cohort studies; OR is the only measure applicable to case-control studies due to its symmetry. When disease is rare, OR approximates RR.
Risk difference (RD) measures the absolute increase in risk due to exposure. The attributable fraction (AFe) gives the proportion of disease in exposed individuals attributable to the exposure. Vaccine efficacy is a special case of AFe. AFe provides a lower bound for the etiologic fraction.
PAR and AFp extend effect measures to the population level. They depend on both the strength of association and the prevalence of exposure. A common weak risk factor can have greater population impact than a rare strong one. Different study designs support different measures.
Standard errors quantify precision. Hypothesis tests (Pearson χ², exact tests, Wald, LRT) produce P-values. Confidence intervals are more informative than P-values, showing the range of plausible effect sizes. CIs for ratio measures are computed on the log scale, producing asymmetric intervals.
Reflection
A colleague presents findings from a case-control study showing OR = 2.3 (95% CI: 1.1, 4.8) for the association between a workplace chemical exposure and bladder cancer. She concludes the chemical “causes 2.3 times the risk of bladder cancer.” Evaluate this statement. What can and cannot be concluded from this study? Discuss the roles of strength of association, statistical significance, the rare disease assumption, and the distinction between OR and RR.
Minimum 20 characters required.
Final Assessment
Complete all 15 questions below with 100% accuracy to finish this lesson. You must also complete the reflection above before submitting.
1. Measures of association differ from measures of statistical significance in that they:
2. In a cohort study, 150 of 2,000 exposed individuals and 75 of 2,000 non-exposed individuals develop the disease. What is the risk ratio?
3. The odds ratio can be calculated from a case-control study because:
4. RD = 0.043 in the smoking and low-birth-weight example means:
5. If RR = 4.0, the attributable fraction in the exposed (AFe) is:
6. Vaccine efficacy of 80% indicates:
7. A common risk factor with a modest RR may have a larger AFp than a rare risk factor with a high RR because:
8. PAR is best described as:
9. The null value for the risk ratio (RR) is:
10. A 95% CI for RR of (0.85, 1.32) suggests:
11. Confidence intervals for OR are asymmetric around the point estimate because:
12. In the formula var(ln OR) = 1/a1 + 1/a0 + 1/b1 + 1/b0, increasing all cell counts will:
13. Which of the following measures CANNOT be estimated from a case-control study, even with external data?
14. The Pearson χ² test is most appropriate when:
15. A study finds RR = 1.8 (P = 0.40). Which interpretation is most appropriate?
✦ Complete the final reflection above before submitting