Confounding and Causal Inference

Fundamental Epidemiological Concepts and Approaches

Learning objectives for this lesson:

Apply criteria to identify potential confounders in observational studies
Use restricted sampling and matching to prevent confounding
Implement matching in both cohort and case-control study designs
Use causal diagrams (DAGs) to identify confounders needing control
Apply stratified analysis (Mantel-Haenszel) to control confounding and assess interaction
Understand propensity scores, instrumental variables, and marginal structural models
Evaluate the potential of unmeasured confounders using sensitivity analysis
Interpret the effects of controlling different types of extraneous variables

This course was developed by Dr. Kiffer G. Card, Faculty of Health Sciences, Simon Fraser University based on Dohoo, I. R., Martin, S. W., & Stryhn, H. (2012). Methods in Epidemiologic Research. VER Inc.

Reference

Glossary: Key Terms, People & Concepts

📚 Reference page, available throughout the lesson

This glossary collects the key concepts, people, and ideas you will meet in this lesson. Use it as a reference while you work through the material, or as a review before assessments. Type in the search box to filter entries.

Causal Structure

Confounder / Confounding A variable associated with the exposure and a cause of the outcome (other than via the exposure) whose imbalance distorts the exposure-outcome association. Classically, satisfies the three criteria: associated with exposure, independent risk factor for outcome, not on the causal pathway. For a modern structural definition see VanderWeele & Shpitser (2013).

Mediator A variable on the causal pathway between exposure and outcome (E → M → Y). Adjusting for a mediator removes part of the total causal effect and can introduce collider bias.

Collider A variable that is a common effect of two or more variables (E → C ← Y). Conditioning on a collider can open a non-causal path and induce spurious associations.

Directed Acyclic Graph (DAG) A graphical representation of assumed causal relationships between variables, used to identify confounders, colliders, mediators, and minimally sufficient adjustment sets. Introduced to epidemiology by Greenland, Pearl, & Robins (1999). The structure is usually drawn as a directed acyclic graph.

Backdoor Path A non-causal path from exposure to outcome that begins with an arrow into the exposure. Open backdoor paths produce confounding; the backdoor criterion (Pearl, 1995) identifies sets of variables that, when conditioned on, block all such paths.

d-Separation A graph-theoretic criterion for determining whether two variables are conditionally independent given a set of other variables, given a DAG structure.

Counterfactual / Potential Outcomes The framework in which each individual has a potential outcome under each level of exposure. The causal effect is the contrast between these counterfactual outcomes; in observational data, only one is observed.

Exchangeability (No Unmeasured Confounding) The assumption that exposed and unexposed groups would have had the same outcome distribution had they received the same exposure. Conditional exchangeability holds within levels of measured covariates.

Positivity The assumption that, within every covariate stratum, every level of exposure has a positive probability of being observed. Required for valid standardisation and IPW.

Effect Modification (Interaction) When the magnitude of an exposure-outcome effect differs across levels of a third variable. Effect modification is a feature of the causal system, not bias to be removed.

Time-Varying Confounding When a confounder is itself affected by past exposure and influences future exposure and outcome (treatment-confounder feedback). Standard regression fails; marginal structural models or g-methods are required.

Methods to Control Confounding

Restriction Limiting study eligibility to a single level of a confounder (e.g., only never-smokers). Eliminates confounding by that variable but reduces generalisability and may not address other confounders.

Matching Selecting controls with the same values of confounders as cases (or unexposed with same values as exposed). In case-control designs, requires conditional analysis; in cohort designs, can produce balanced cohorts directly.

Stratification & Mantel-Haenszel Computing exposure-outcome estimates separately within strata of a confounder, then combining via a weighted summary (Mantel-Haenszel). Reveals effect modification but has limited capacity for many confounders.

Standardisation Computing the marginal effect that would have been observed if every individual had a specified exposure value, by averaging stratum-specific outcomes weighted by a chosen population structure (direct standardisation).

Multivariable Regression Adjustment Including confounders as covariates in a regression model for the outcome. Provides conditional effect estimates and assumes correct model specification (functional form, no unmeasured confounders).

Propensity Score The conditional probability of exposure given measured covariates. Used for matching, stratification, weighting, or covariate adjustment to balance confounders between exposure groups (Rosenbaum & Rubin, 1983; reviewed in Austin, 2011).

Inverse Probability Weighting (IPW) Each observation is weighted by the inverse of its probability of receiving the observed exposure, creating a pseudo-population in which exposure is independent of measured confounders (Hernán, Brumback, & Robins, 2000).

Marginal Structural Model (MSM) A model for the marginal distribution of counterfactual outcomes, fit using IPW. Designed to handle time-varying confounding affected by prior exposure (Robins, Hernán, & Brumback, 2000).

G-Methods (g-Formula, g-Estimation, IPW) A family of methods developed by Robins (1986) for estimating causal effects in the presence of time-varying confounders affected by prior exposure.

Instrumental Variable (IV) A variable that affects exposure, has no direct effect on the outcome, and shares no causal ancestors with the outcome other than through exposure (Angrist, Imbens, & Rubin, 1996; Hernán & Robins, 2006). Mendelian randomisation (Davey Smith & Ebrahim, 2003) is the most prominent IV approach in epidemiology.

Negative Control An exposure or outcome chosen because it shares the suspected unmeasured-confounder structure with the primary analysis but should be causally null. A non-null negative-control association suggests residual confounding.

Key People

Judea Pearl (1936– ) Computer scientist whose work on causal graphs, the backdoor criterion (Pearl, 1995), do-calculus, and the structural causal model framework provided the formal language now used to reason about confounding via DAGs.

James M. Robins (1950– ) Harvard biostatistician who developed g-methods (Robins, 1986), marginal structural models, and structural nested models, the analytic backbone for handling time-varying confounding affected by prior exposure.

Sander Greenland (1951– ) Epidemiologist whose extensive writing on confounding, collapsibility, bias analysis, and DAGs helped translate causal-inference theory into routine epidemiologic practice.

Miguel A. Hernán (1968– ) Harvard epidemiologist; co-author with Robins of Causal Inference: What If and a leading voice in framing observational analyses as “target trials” (Hernán & Robins, 2016).

Donald B. Rubin (1943– ) Statistician who formalised the potential-outcomes framework (the Rubin Causal Model) and co-developed propensity-score methods with Paul Rosenbaum (Rosenbaum & Rubin, 1983).

No matching entries. Try a different search term.

Section 1 of 5

Introduction & Pre-Analysis Control of Confounding

⏱ Estimated reading time: 25 minutes

Lesson 12 · HSCI 341

Confounding and Causal Inference

The course's closing thread: moving from an observed association to a defensible claim about cause and effect.

Three criteria

What makes a variable a confounder?

C must cause D (top criterion), be associated with E (left arrow), and must not sit on the E→D causal path.

Design strategy 1

Restriction

Restrict enrollment to a single level of the confounder. Confounding by that variable becomes impossible because it no longer varies.

Complete restriction

Restrict to females in a cervical cancer study: gender confounding is fully eliminated.

Partial restriction

Restrict to ages 60–69: age confounding is reduced but residual variation remains.

Prefer the low-risk group when restricting on a dichotomous confounder, to avoid confusing interactions with confounding.

Design strategy 2

Matching: cohort versus case-control

Cohort matching

Occurs at baseline. Makes exposure independent of the matched variable. No analytical correction needed.

Case-control matching

Occurs after disease. Introduces selection bias (usually toward the null). Must be corrected analytically via stratified or conditional analysis.

The key distinction is timing: pre-exposure matching creates balance; post-disease matching creates bias.

Matched analysis

Discordant pairs and McNemar's test

Eq 12.2: Matched odds ratio

\[ \color{#0B7B6B}{\text{OR}_{\text{match}}} = \frac{\color{#C2410C}{u}}{\color{#1D4ED8}{v}} \]

OR_match matched odds ratiou exposed-case / unexposed-control pairsv unexposed-case / exposed-control pairs

Eq 12.3: McNemar's chi-squared

\[ \color{#0B7B6B}{\chi^2} = \frac{(\color{#C2410C}{u} - \color{#1D4ED8}{v})^2}{\color{#C2410C}{u} + \color{#1D4ED8}{v}} \]

χ² McNemar test statisticu one kind of discordant pairv the other kind of discordant pair

where \(u\) = pairs with an exposed case and unexposed control, \(v\) = pairs with an unexposed case and exposed control.

Carry forward

Next: detecting and adjusting

Three criteria: causes disease, associated with exposure, not on the causal pathway.
Restriction prevents confounding; tradeoff is reduced generalisability.
Matching in case-control studies introduces selection bias requiring analytical correction.

Introduction and Overview

An earlier lesson closed the second leg of the bias triad with validity in observational studies. This lesson takes the third, confounding, and pulls together the full causal inference framework. Confounding is the systematic distortion of an exposure–outcome association by a third variable that influences both. The four content sections walk through the topic in the order an investigator would actually approach it. This section covers strategies that prevent confounding before data analysis (restriction, matching). A later section turns to detecting confounding in observed data and stratified analysis (Mantel–Haenszel). A later section introduces the analytic alternatives: multivariable regression, instrumental variables, propensity scores. A later section closes with what to do about confounders you can't measure, and how to think structurally about the relationships among extraneous variables.

Learning Objectives

Define confounding and apply the three criteria for identifying a confounder (cause of disease; precedes and is associated with exposure; not an intervening variable or effect of disease).
Distinguish a population confounder from a sample confounder and decide when each warrants control.
Apply restriction as a design-stage strategy and explain its tradeoffs for generalisability.
Implement matching (frequency or individual) at the design stage and identify the analytic implications for case-control vs cohort studies.

12.1 Introduction

A central focus of epidemiological research is to identify factors that contribute to the occurrence of disease. Randomised controlled trials (RCTs) provide a probabilistic basis for balancing factors between groups. However, in observational studies we cannot randomly assign exposures, so confounding is always a concern.

What Is Confounding?

Confounding can be described as the mixing together of the effects of 2 or more factors. When confounding is present, we might think we are measuring the association between an exposure and an outcome, but the observed measure also includes the effects of one or more extraneous factors. These extraneous factors that produce the bias are called confounders or confounding factors.

A quick example to fix the idea: people who carry a cigarette lighter have far higher lung-cancer rates, yet the lighter itself causes nothing. Smokers are the people who carry lighters, and smoking is what raises cancer risk, so smoking gets mixed into any comparison of lighter-carriers with non-carriers. Smoking is the confounder here. Once you compare lighter-carriers and non-carriers who smoke the same amount, the apparent lighter effect vanishes. That is what it means to control for a confounder.

12.1.1 Which Extraneous Factors Are Confounders?

A factor is a confounder if:

It is a cause of the disease, or a surrogate for a cause, and
It precedes and is associated with the exposure in the source population, and
Its distribution across exposure levels cannot be determined by the exposure (i.e., it is not an intervening factor) or by the disease (i.e., it is not a result of the disease)

Important Distinction

Population confounder: known or regularly reported to be a confounder in the target population, should be controlled regardless of sample data.

Sample confounder: appears to be a confounder in the study data but may not truly be one in the population. We should not control for it unless there is substantive evidence.

Example 12.1: A Demonstration of Confounding

Investigating the relationship between Streptococcus pneumoniae (STREP) and childhood respiratory disease (CRD), with RSV (respiratory syncytial virus) as a potential confounder:

	STREP+	STREP−	OR
CRD+	240	40	3.3 (crude)
CRD−	6260	3460	3.3 (crude)

When stratified by RSV status, the stratum-specific ORs are both 2.0, while the crude OR is 3.3. The >30% difference indicates confounding by RSV is present. The stratum-specific OR of 2.0 is the best estimate of the causal association.

12.2 Control of Confounding Prior to Data Analysis

We can prevent and control confounding using three general procedures:

ExclusionClick to explore

MatchingClick to explore

Analytic ControlClick to explore

12.3 Matching on Confounders

In a cohort study, matching makes the exposure independent of the matched extraneous variable so there can be no confounding. The matched variable(s) can still exert an effect on the outcome, but it has the same effect in both exposure groups.

Because the outcome (e.g., disease) has not happened at the time of matching, the matching process is independent of the outcome. No analytical control of the matched confounder is necessary, and there is no bias in the summary table.

In case-control studies, the disease has already occurred when matching takes place. Matching will actually introduce a selection bias. The stronger the exposure-confounder association, the greater the bias (generally toward the null).

This bias must be controlled by stratified or matched analysis; the matched variable(s) must be included in the analytical approach.

Overmatching

Do not match unless you are certain the variable is a confounder. Matching on a variable strongly associated with exposure but not a confounder leads to overmatching, giving the distribution of exposure in controls greater similarity to cases than in the source population, which can reduce precision.

Frequency vs. Pair Matching

Feature	Frequency Matching	Pair Matching
Method	Overall distribution made equal	Individual-level matching (1:m)
Analysis	Stratified (MH procedure)	Matched-pair analysis (McNemar’s test)
Interaction	Can assess interaction	Difficult to assess interaction
Best when	Confounder has few levels	Many variables or refined categories
Control-to-case ratio	Variable	Fixed (1:1, 1:4, etc.); minimal gain beyond 4:1

Analysing Matched Data

For pair-matched data in a case-control study with 1:1 matching, we analyse the four possible exposure patterns. Only the discordant pairs (case exposed/control unexposed, or case unexposed/control exposed) contribute information:

Eq 12.2 and 12.3: Matched OR and McNemar’s test

OR_match = u / v

McNemar’s χ² = (u − v)² / (u + v)

In a pair-matched case-control study only the discordant pairs carry information. The matched odds ratio is the ratio of the two kinds of discordant pair, and McNemar’s test asks whether that ratio departs from one.

where u = pairs where case is exposed and control is not, and v = pairs where case is not exposed and control is.

Why do the other pairs drop out? If a case and their matched control were both exposed (or both unexposed), the pair shows the same exposure on each side, so it says nothing about whether exposure differs between cases and controls. Only the discordant pairs, where exactly one of the two was exposed, carry that information.

Reflection

Why does matching in case-control studies introduce selection bias while matching in cohort studies does not? Think about the timing of when disease occurs relative to the matching process.

Model answerMatching in case-control studies selects controls based on matching variables after disease has occurred. If the matching variable is associated with exposure, this introduces a selection bias because exposed and unexposed groups within the case sample are no longer comparable on that variable, because you have conditioned on something potentially associated with both exposure and outcome at the time of selection. In cohort studies, matching happens at baseline, before exposure-outcome time has elapsed. The matched groups are comparable on the matching variable at time-zero, and disease occurs subsequently, so no collider conditioning occurs. The timing distinction is structural: in case-control, matching is on a post-exposure variable; in cohort, matching is on a pre-exposure variable. Standard remedy in case-control: explicitly model the matching variable in the analysis (conditional logistic regression) and avoid over-matching on factors strongly correlated with exposure.

Minimum 20 characters required.

✓ Reflection saved

Section 2 of 5

Detection of Confounding & Stratified Analysis

⏱ Estimated reading time: 25 minutes

Section 2 of 5

Detection of Confounding & Stratified Analysis

Directed acyclic graphs, the change-in-estimate rule, and the Mantel-Haenszel stratified estimator.

Structural reasoning

Directed acyclic graphs

The backdoor path runs through C. Conditioning on C blocks that path and removes the confounded portion of the E-D association.

The practical rule

Change-in-estimate: 20–30% threshold

Percent change on log scale

\[ \color{#0B7B6B}{\% \Delta} = \frac{|\ln(\color{#C2410C}{\hat{\text{OR}}_c}) - \ln(\color{#1D4ED8}{\hat{\text{OR}}_a})|}{|\ln(\color{#C2410C}{\hat{\text{OR}}_c})|} \times 100 \]

%Δ change in estimateOR_c crude odds ratioOR_a adjusted odds ratio

where \(\hat{\text{OR}}_c\) is the crude odds ratio and \(\hat{\text{OR}}_a\) is the adjusted (Mantel-Haenszel) odds ratio. A change exceeding 20–30% indicates meaningful confounding.

The stratified estimator

Mantel-Haenszel adjusted odds ratio

Eq 12.7: Mantel-Haenszel OR

\[ \color{#0B7B6B}{\text{OR}_{\text{MH}}} = \frac{\displaystyle\sum_j \frac{\color{#C2410C}{a_{1j}}\, \color{#C2410C}{b_{0j}}}{\color{#6D28D9}{n_j}}}{\displaystyle\sum_j \frac{\color{#1D4ED8}{a_{0j}}\, \color{#1D4ED8}{b_{1j}}}{\color{#6D28D9}{n_j}}} \]

OR_MH pooled adjusted odds ratioa_1jb_0j concordant cross-product, stratum ja_0jb_1j opposite cross-product, stratum jn_j stratum total

\(a_{1j}\), \(b_{0j}\), \(a_{0j}\), \(b_{1j}\) are the stratum-\(j\) cell counts (exposed cases, unexposed non-cases, unexposed cases, exposed non-cases), and \(n_j\) is the stratum total. Verify homogeneity of stratum-specific ORs before interpreting \(\text{OR}_{\text{MH}}\).

When strata disagree

Interaction and effect modification

Additive interaction

\[\text{RD}_{10} + \text{RD}_{01} \neq \text{RD}_{11}\]
Effects may share biological pathways.

Multiplicative interaction

\[\text{RR}_{10} \times \text{RR}_{01} \neq \text{RR}_{11}\]
Most common scale in epidemiology.

When interaction is present, report stratum-specific estimates. A single summary OR is misleading.

A mathematical complication

Non-collapsibility of the odds ratio

The crude odds ratio can differ from stratum-specific odds ratios even when no confounding is present. This is non-collapsibility.

When it matters

Most common when the outcome is frequent. A 20–30% change may look like confounding but reflect the mathematical property.

Risk-based measures

Risk ratio and risk difference are collapsible. If outcome frequency is high, they are preferable.

Carry forward

Next: methods that scale up

DAGs identify which variables to control and which to leave alone.
Change-in-estimate on the log scale: >20–30% signals confounding.
\(\text{OR}_{\text{MH}}\) pools across strata after confirming homogeneity.
When strata disagree, report them separately: effect modification is not bias.

Introduction and Overview

An earlier section covered the strategies that prevent confounding before any data are looked at. This section takes over once the data are in: how do we tell whether a candidate variable is actually confounding the exposure–outcome relationship, and how do we adjust for it via stratification? Mantel–Haenszel is the workhorse method here and the conceptual ancestor of every multivariable regression you'll meet in a later course.

Learning Objectives

Use directed acyclic graphs (DAGs) to identify which extraneous factors must be controlled and which must not.
Distinguish intervening variables, colliders, and confounders, and explain why controlling each has different consequences.
Apply the change-in-estimate rule (20–30% threshold) to decide whether a candidate variable is confounding.
Compute the Mantel-Haenszel pooled odds ratio across strata, including its 95% CI and a test of interaction (Breslow-Day or Woolf).

12.4 Detection of Confounding

12.4.1 Using Causal Diagrams (DAGs)

Identifying which potential confounders need to be controlled can be accomplished using directed acyclic graphs (DAGs) (Greenland, Pearl, & Robins, 1999). The process:

▸ INTERACTIVE STORY: THE BACKDOOR PATH
Open full screen ↗

Watch the front door (causal) and back door (confounded) of a DAG, then close the backdoor by conditioning. Next ▶ advances scenes.

A 7-scene visualization of Pearl's backdoor criterion: nodes E, Y, and confounder C; the causal front door E→Y; the spurious backdoor through C; conditioning on C as the door slamming shut; and the ice-cream/drownings example to ground the abstraction.

Draw the diagram using the principles from Chapter 1
Eliminate all arrows emanating from the exposure factor of interest (CIG)
If any paths still connect the exposure to the outcome, the causally prior factors and non-intervening variables on these paths must be controlled
Connect marginally independent factors that become conditionally associated when a common effect is controlled (shown as a dashed line)

Example 12.4: DAG for Smoking & Birth Weight

In studying the effect of cigarette smoking (CIG) on birth weight (BWT), with RACE, COLLEGE, TBO (total birth order), and WTGAIN as additional factors:

After removing direct causal arrows from CIG, the path from CIG to BWT through WTGAIN remains, but WTGAIN is an intervening variable and should not be controlled
TBO needs to be controlled (causal path from TBO to CIG)
Controlling TBO makes COLLEGE and RACE conditionally associated, so either (or both) must be controlled to break the remaining pathway

12.4.2 Change in Measure of Association

A practical approach: compare the crude OR (OR_c) with the adjusted OR (OR_a) obtained after stratification. If the change exceeds 20–30%, confounding is considered important.

Three Important Notes

Always use the unadjusted values as the baseline when computing % change
For ratio measures (OR), compute % change on the log scale (% change in lnOR). Ratios are multiplicative, so on the log scale a halving and a doubling count as the same size of change, and the percentage is not distorted by which estimate you start from
Apply the % change criterion only to statistically significant variables; non-significant variables with lnOR ≈ 0 can have very large % changes

Non-Collapsibility of Odds Ratios

The odds ratio is not always collapsible: even in the absence of confounding, the crude OR can differ from the stratum-specific ORs (Greenland, Robins, & Pearl, 1999). This typically occurs when outcome frequency is high. A >20–30% change in OR might look like confounding but could simply be non-collapsibility.

In plain terms: the odds ratio does not blend across subgroups the way a risk ratio does, so merging strata can nudge it even when nothing is being confounded. When the outcome is common, treat a modest odds-ratio change after stratifying with extra caution, and consider checking the risk ratio, which does not have this quirk.

12.5 Analytic Control: The Mantel-Haenszel Estimator

The Mantel-Haenszel (MH) procedure (Mantel & Haenszel, 1959) is the most widely used stratified analytic approach. It involves physically stratifying data by levels of the confounder(s), examining stratum-specific ORs, and computing a pooled ‘adjusted’ estimate.

Key Formulae

Eq 12.4: Stratum-specific OR

OR_j = (a_1j × b_0j) / (a_0j × b_1j)

Within each stratum, the odds ratio is the usual cross-product of that stratum’s two-by-two table.

Eq 12.7: Mantel-Haenszel adjusted OR

OR_MH = Σ(a_1j × b_0j / n_j) ⁄ Σ(a_0j × b_1j / n_j)

The Mantel-Haenszel estimator pools the stratum tables into one confounding-adjusted odds ratio, weighting each stratum by its size.

Eq 12.8: Wald test for homogeneity

χ²_homo = Σ [(lnOR_j − lnOR_MH)² / var(lnOR_j)]

The homogeneity test asks whether the stratum-specific odds ratios are similar enough to summarise with a single number; a large value signals effect modification rather than plain confounding.

Eq 12.9: Overall test (is OR_MH = 1?)

χ²_MH = (Σa_1j − ΣE_j)² / ΣV_j

The Mantel-Haenszel chi-squared tests whether the pooled, adjusted odds ratio differs from the null value of one.

📊 Interactive: Mantel-Haenszel Stratified Analysis

A study with 2,000 participants. Adjust how strongly the confounder C is linked to the exposure and to the outcome, then watch the crude OR diverge from the stratum-specific ORs and the pooled OR_MH. The change-in-estimate ("Δ") tells you whether stratification matters.

Crude (unstratified) 2×2

	Y+	Y−	Total
E+	281	536	817
E−	198	985	1183

Stratum 1: C+

	Y+	Y−
E+	239	278
E−	145	338

Stratum 2: C−

	Y+	Y−
E+	42	258
E−	53	647

Crude vs. stratum-specific vs. MH-adjusted OR

Crude OR

2.61

OR | C+

2.00

OR | C−

1.99

OR_MH

2.00

% change vs crude

-23%

Homogeneity?

homogeneous

True (within-stratum) OR 2.00

C → Exposure strength 2.50

C → Outcome strength (baseline risk in C+) 0.30

Effect modification: OR | C− multiplier 1.00

Prevalence of C+ 0.50

Presets:

Confounding present: crude OR moves -23% away from the stratum-specific truth. The MH estimator (2.00) is the unbiased estimate.

12.5.2 Interaction

Interaction occurs when the combined effect of 2 variables differs from the sum (or product) of their individual effects. There are 3 types of joint effects:

Additive ScaleClick to explore

Multiplicative ScaleClick to explore

Synergism & AntagonismClick to explore

Key Rule: When Interaction Is Present

When stratum-specific measures differ significantly (interaction is present), we should not compute a single summary OR_MH. Instead, we must report stratum-specific estimates because the effect of the exposure depends on the level of the other variable. This phenomenon is also called effect modification.

Reflection

Consider a study where the crude OR is 1.69 and the Mantel-Haenszel adjusted OR is 1.97 (a 17% change). Would you consider this sufficient evidence of confounding? What factors would influence your decision?

Model answerA 17% change between crude (1.69) and adjusted (1.97) ORs exceeds the conventional 10% threshold for confounding, so the answer is generally yes. But several factors complicate that conclusion: (a) direction of change matters: the adjusted OR is larger than the crude, suggesting negative confounding where the confounder was masking the true effect; this is informative but less common than positive confounding. (b) Precision: if the CIs widely overlap, the apparent 17% difference may be sampling noise; check the CI on the difference. (c) Mechanism plausibility: confirm that the adjustment variable is biologically a confounder (not a mediator or collider) using a DAG. (d) Sensitivity: vary the adjustment variable list and check stability. The 10% rule is a heuristic; the proper test is whether the DAG and substantive reasoning support the variable as a confounder, with the 17% change being supportive evidence rather than definitive proof.

Minimum 20 characters required.

✓ Reflection saved

Section 3 of 5

Alternative Methods & Propensity Scores

⏱ Estimated reading time: 25 minutes

Section 3 of 5

Alternative Methods & Propensity Scores

When stratification breaks down: regression, marginal structural models, instrumental variables, and propensity scores.

The workhorse

Multivariable regression

Logistic regression model

\[ \color{#0B7B6B}{\ln\!\ \frac{p}{1-p}} = \beta_0 + \color{#C2410C}{\beta_E} \color{#6D28D9}{E} + \color{#1D4ED8}{\beta_1 C_1 + \beta_2 C_2} + \cdots \]

log odds log odds of diseaseβ_E adjusted log odds ratio for exposureE exposureβ_kC_k confounder terms held fixed

The adjusted odds ratio for the exposure is \(e^{\hat{\beta}_E}\). If \(\hat{\beta}_E\) changes by more than 30% when a candidate confounder \(C_k\) is added, confounding by \(C_k\) is meaningful.

Marginal effects

Standardisation and marginal structural models

Standardised Risk Ratio

\[ \color{#0B7B6B}{\text{SRR}} = \frac{\color{#C2410C}{\text{Observed cases}}}{\color{#1D4ED8}{\text{Expected cases under standard rates}}} \]

SRR standardised risk ratioobserved cases actually seenexpected cases predicted from the standard population

IPTW weight for each subject

\[ \color{#0B7B6B}{W_T} = \frac{1}{\color{#C2410C}{p(E = e \mid C)}} \]

W_T inverse-probability-of-treatment weightp(E=e | C) probability of the observed exposure given confounders

Weighting by the inverse of the exposure probability creates a pseudo-population where the exposure-confounder association is broken. The weighted crude estimate then approximates the true causal effect.

Bypassing confounders

Instrumental variable analysis

True causal effect via IV

\[ \color{#0B7B6B}{\text{TCE}} = \frac{\color{#C2410C}{p(D^+ \mid Z=1) - p(D^+ \mid Z=0)}}{\color{#1D4ED8}{p(E^+ \mid Z=1) - p(E^+ \mid Z=0)}} \]

TCE true causal effectnumerator instrument’s effect on the outcomedenominator instrument’s effect on the exposure

The instrument Z must: (1) affect E causally, (2) have no direct effect on D, and (3) share no common causes with D. Valid instruments are rare but powerful. Mendelian randomisation is the leading application in epidemiology.

A single-number summary

Propensity scores

A propensity score \(p(E^+ \mid C)\) condenses many confounders into one number. Four ways to use it:

Matching

Match exposed to unexposed with similar scores. Most common; estimates average treatment effect in the treated.

Stratification

Divide into quintiles of the PS; estimate effect within each stratum and pool.

IPTW

Weight by \(1/p\) (exposed) and \(1/(1-p)\) (unexposed). Equivalent to direct standardisation.

Covariate

Include PS directly in the outcome regression. Simplest but least flexible.

Carry forward

Next: the unmeasured case

Every method in this section assumes conditional exchangeability: given the measured confounders, the exposed and unexposed groups are comparable. Unmeasured confounders violate this assumption.

Propensity scores: only as good as the covariates in the prediction model.
Marginal structural models: same no-unmeasured-confounders requirement as regression.
Instrumental variables: the only method that can handle unmeasured confounders, if a valid instrument exists.

Introduction and Overview

Stratification works beautifully for one or two confounders but breaks down quickly when you need to adjust for several at once. This section turns to the analytic methods that scale beyond a few strata: multivariable regression (the workhorse of a later course), and the alternatives of restriction-by-design, instrumental variables, and propensity scores. Each is a different strategy for the same goal of estimating the exposure–outcome effect while making the remaining differences between groups irrelevant.

Learning Objectives

Use multivariable regression to adjust for multiple confounders simultaneously and interpret the adjusted exposure coefficient.
Distinguish standardised risk ratios, marginal structural models, and instrumental-variable estimators as alternative confounding-control strategies.
Estimate a propensity score and apply it via matching, stratification, weighting, or covariate adjustment.
Articulate when each method (regression, MSM, IV, propensity score) is preferred and what each requires from the data.

12.6 Multivariable Modelling

The most commonly used analytical method for controlling confounding is to include confounders in a multivariable model (e.g., logistic regression). The effect of the exposure is estimated while holding other factors constant.

Rule of Thumb

If the coefficient for a predictor changes by >30% when a putative confounder is added to the model, then substantial confounding exists. Note that the ‘adjusted’ measures from multivariable models are direct causal effects only, not total causal effects.

12.7 Other Approaches to Control Confounding

12.7.1 Standardised Risk Ratios (SRR)

Standardisation uses stratum-specific risks applied to a standard population. The SRR compares observed vs. expected number of cases:

Standardised risk ratio

SRR = (observed cases) / (expected cases using standard rates)

The standardised risk ratio compares the cases actually observed with the number expected if the standard population had experienced the stratum-specific risks; it stays valid even when the strata differ.

Unlike the MH estimator, the SRR provides a valid summary even in the presence of interaction, because the population of interest is specified. The SRR is a non-parametric method based on physical stratification.

12.7.2 Marginal Structural Models

The marginal structural model (Robins, Hernán, & Brumback, 2000) uses weights to create an unconfounded pseudo-population from which the causal effect can be estimated using a crude (marginal) measure.

The weight assigned to each subject is the inverse probability of treatment weight (IPTW): W_T = 1/p_E, where p_E = p(E=e|C) is the conditional probability of the observed exposure given confounders.

The total pseudo-population is twice the size of the observed population and contains information on the counterfactual outcome. The IPTW estimate is equivalent to the SRR_tot estimate.

12.7.3 Instrumental Variables

An instrumental variable (IV) Z (Angrist, Imbens, & Rubin, 1996; Hernán & Robins, 2006) must meet 3 requirements:

It has a direct causal effect on the exposure E
It is unrelated to the outcome D except through E
It shares no common causes with the outcome

The true causal effect (TCE) is estimated as:

True causal effect via an instrumental variable

TCE = [p(D+|Z=1) − p(D+|Z=0)] / [p(E+|Z=1) − p(E+|Z=0)]

An instrumental variable estimates the causal effect by scaling its effect on the outcome by its effect on the exposure, which sidesteps the need to measure the confounders.

The key advantage: we do not need to condition on confounders C. The IV approach bypasses confounding entirely. However, finding a valid IV in observational studies is very difficult.

12.8 Propensity Scores

A propensity score (PS) is the conditional probability of being treated/exposed given measured covariates: p(E+|C). Propensity scores condense multiple confounders into a single scalar summary (Rosenbaum & Rubin, 1983; Austin, 2011).

12.8.1 Computing Propensity Scores

With 1–2 categorical confounders, PSs can be calculated manually. With more confounders, use a logit or probit model predicting treatment (exposure) allocation as the outcome. Include all potential confounders (known or suspected) and their interactions.

12.8.2 Balancing Exposure Groups

A study is balanced if: (1) the average PS value is the same in exposed and non-exposed within each PS stratum, and (2) the mean of all covariates making up the PS is equal across groups within each stratum.

Analysis is limited to the region of common support, the observations falling in the range of PSs that includes both exposed and non-exposed individuals.

12.8.3–6 Using Propensity Scores

PSs can be used in four ways:

Method	Description
Matching	Match exposed to non-exposed with similar PSs. Methods: nearest-neighbour, radius, kernel matching
Stratification	Divide into PS strata (blocks); compute att within each stratum and pool
Covariate in model	Include PS as a continuous or categorical variable in the regression model
Weighting (IPTW)	Weight observations by inverse of PS to create pseudo-population

The most common effect measure with PS methods is the average treatment effect in the treated (att): the difference in outcome between treated (exposed) and non-treated (non-exposed) groups.

R Activity: Mantel-Haenszel adjustment + inverse-probability-of-treatment weighting

The companion R script r-activities/HSCI_341_Lesson_12_Confounding_and_Causal_Inference.R walks through two confounding-control workflows: (A) a Mantel-Haenszel adjusted OR for smoking and lung cancer stratified by age (with stratum-specific ORs to check for effect modification), and (B) an inverse-probability-of-treatment weighted (IPTW) logistic regression with a known simulated treatment effect of log-OR = 0.5.

# PART A -- Mantel-Haenszel adjusted OR (2x2x2 array)
arr <- array(c( 22, 5,    10, 25,
                75, 15,   35, 85),
              dim = c(2, 2, 2),
              dimnames = list(Smoke = c("Yes", "No"),
                              Case  = c("Yes", "No"),
                              Age   = c("Young", "Old")))

mantelhaen.test(arr, exact = FALSE)             # MH OR + CI
apply(arr, 3, function(t) (t[1,1]*t[2,2]) / (t[1,2]*t[2,1])) # stratum-specific

crude <- margin.table(arr, c(1, 2))             # collapse strata
(crude[1,1]*crude[2,2]) / (crude[1,2]*crude[2,1])    # crude OR

# PART B -- inverse-probability-of-treatment weighting
set.seed(341)
n   <- 2000
age <- rnorm(n, 60, 10)
A   <- rbinom(n, 1, plogis(-3 + 0.05*age))         # treatment
Y   <- rbinom(n, 1, plogis(-2 + 0.04*age + 0.5*A))  # outcome (true log-OR=0.5)
df  <- data.frame(age, A, Y)

ps_mod <- glm(A ~ age, data = df, family = binomial)
ps     <- predict(ps_mod, type = "response")
w      <- ifelse(df$A == 1, 1/ps, 1/(1-ps))         # IPTW

coef(glm(Y ~ A, data = df, family = binomial))["A"]                # crude
coef(glm(Y ~ A, data = df, family = binomial, weights = w))["A"]    # IPTW

What you should be able to do after this activity: compute and compare crude, stratum-specific, and Mantel-Haenszel ORs; estimate a propensity score; build IPT weights; and check whether the weighted estimate recovers a known simulated effect.

R Reflect on what you just ran

Use the questions below to interpret the actual numbers from your Mantel-Haenszel and IPTW outputs. Look at the console before answering.

1. From mantelhaen.test(arr, exact = FALSE), report the common (adjusted) OR with its 95% CI. How does it compare to the crude OR you computed from margin.table()? What does the difference (or lack of difference) say about age as a confounder?

Model answermantelhaen.test(arr, exact = FALSE) returns a common (adjusted) OR of about 11.9, with a 95% CI comfortably above 1, so the smoking and lung cancer association is strong and clearly non-null. The crude OR from margin.table() is almost identical, also about 11.9, because (97×110)/(45×20) = 11.9. Since the adjusted and crude estimates barely differ, far below the usual change-in-estimate threshold, age is NOT confounding these data: stratifying on age leaves the odds ratio essentially unchanged. A variable earns the label confounder only when adjusting for it actually moves the estimate.

2. The apply() line printed two stratum-specific ORs (Young, Old). Report both. Are they similar enough to justify a single MH summary OR, or is there evidence of effect modification (i.e., the OR differs substantially by stratum)?

Model answerThe apply() line returns stratum-specific ORs of about 11.0 for the Young stratum, since (22×25)/(10×5) = 11.0, and about 12.1 for the Old stratum, since (75×85)/(35×15) = 12.1. They sit close to one another and to the pooled MH value of about 11.9, so the odds ratios are homogeneous and a single Mantel-Haenszel summary is justified. Effect modification would instead show strata that differ substantially, for example an OR near 3 in one stratum and near 15 in the other; a formal Breslow-Day or Tarone homogeneity test would not reject homogeneity here.

3. Compare the crude coef(...)["A"] and the IPTW-weighted coef(..., weights = w)["A"]. Which is closer to the true simulated log-OR of 0.5, and why would the IPTW estimate be biased if you had OMITTED age from ps_mod?

Model answerThe crude log-OR coefficient on A is typically around 0.66 (OR 1.95), while the IPTW-weighted estimate is closer to 0.50, matching the simulated true log-OR. IPTW is closer because the inverse-probability weights create a pseudo-population where treatment assignment is independent of measured confounders. If age were omitted from ps_mod, the propensity score would not adjust for the confounding age induces, so the IPTW estimate would inherit the same age-driven bias as the crude analysis, drifting back toward 0.66. The general lesson: propensity-score methods depend critically on the no-unmeasured-confounders assumption; omitting a confounder from the PS model defeats the purpose of using PS at all.

Saved.

Reflection

Compare the propensity score approach to traditional multivariable regression for controlling confounding. In what situations might propensity scores be preferable, and what are their limitations?

Model answerPropensity score methods are preferable when: (a) many confounders, few outcome events, where PS reduces the dimensionality of the adjustment problem from many covariates to a single scalar; (b) treatment effect heterogeneity, where PS matching produces a sample in which exposure is balanced on covariates, supporting clearer ATT/ATE estimation; (c) policy relevance, where PS weighting creates a target population for the treatment effect that can be specified (ATE, ATT, ATU); (d) doubly robust estimation, where pairing PS with outcome regression protects against misspecification of either model. Limitations: (1) PS only adjusts for measured confounders, with no protection against unmeasured ones, same as regression; (2) requires positivity (every unit must have a non-zero chance of being treated and untreated); (3) extreme weights inflate variance; (4) covariate balance must be checked rigorously; (5) when the outcome model is well-specified and confounders few, multivariable regression is more efficient. PS is not a magic bullet; it is an alternative parameterisation of the same identification problem.

Minimum 20 characters required.

✓ Reflection saved

Section 4 of 5

Unmeasured Confounders & Causal Relationships

⏱ Estimated reading time: 25 minutes

Section 4 of 5

Unmeasured Confounders & Causal Relationships

External adjustment, sensitivity analysis, E-values, and the eight types of extraneous variable relationships.

Using external data

External adjustment for unmeasured confounders

Eq 12.12: Estimated cell values

\[ \color{#C2410C}{a_1} = a \cdot \color{#6D28D9}{p_1}, \quad \color{#C2410C}{b_1} = b \cdot \color{#1D4ED8}{p_2}, \quad c_1 = a - \color{#C2410C}{a_1}, \quad d_1 = b - \color{#C2410C}{b_1} \]

a₁, b₁ reconstructed cells with the confounder presentp₁ confounder prevalence among exposed casesp₂ confounder prevalence among unexposed cases

where \(p_1\) is the prevalence of the unmeasured confounder among exposed cases, and \(p_2\) is its prevalence among unexposed cases. The Mantel-Haenszel OR is then computed from the estimated strata.

Bounding the threat

Sensitivity analysis

Systematically vary two parameters across a plausible range:

Parameter 1

Strength of the unmeasured confounder–disease association: \(\text{OR}_{ZD}\).

Parameter 2

Prevalence difference of the confounder between exposed and unexposed groups: \(p_1 - p_2\).

If the adjusted estimate remains far from the null across all plausible combinations, the finding is robust. If even a modest confounder drives it to null, the finding is fragile.

A single-number summary

The E-value (VanderWeele & Ding, 2017)

E-value for an observed RR

\[ \color{#0B7B6B}{E} = \color{#C2410C}{\text{RR}} + \sqrt{\color{#C2410C}{\text{RR}}(\color{#C2410C}{\text{RR}}-1)} \]

E E-value (minimum confounder strength to explain the result)RR observed risk ratio

The E-value is the minimum strength of association that an unmeasured confounder would need with both exposure and outcome to fully explain the observed effect. Larger E-values indicate more robust findings. The formula shown applies when RR > 1; analogous forms exist for the OR and hazard ratio.

Curve of the E-value against the observed risk ratio, rising faster than the diagonal: an observed RR of 1.4 needs a confounder of about 2.2, while an RR of 2.0 needs about 3.4. — Figure 12.8. The E-value grows faster than the observed risk ratio, so stronger associations demand much stronger unmeasured confounders to explain away. The marked points are the worked examples (RR = 1.4 and RR = 2.0).

Eight structural types

Extraneous variable relationships

True confounders

Explanatory antecedent (complete or incomplete), distorter, suppressor: F is associated with both E and D, not on the causal path.

Should NOT be controlled

Intervening variable (mediator): F lies on E → F → D. Controlling it blocks the causal effect.

Not confounders

Exposure-independent variable, simple antecedent: F is associated with D or E but not both in the confounding-relevant way.

Report by stratum

Moderator (effect modifier): the E-D effect varies across levels of F. Not bias. Report stratum-specific estimates.

Wrapping up the course

The logic running through every method

External adjustment uses published prevalence data to estimate what an adjusted OR would have been.
Sensitivity analysis and E-values bound how strong an unmeasured confounder must be to nullify the finding.
Eight structural types: only explanatory antecedents, distorters, and suppressors are true confounders.
Mediators must not be controlled. Effect modifiers must be reported by stratum.

Introduction and Overview

Earlier sections covered the methods that work when confounders are measured. This section tackles the harder case: what to do when key confounders are unmeasured or unknown. Sensitivity analyses, E-values, and structural reasoning about extraneous variables (mediators, colliders, effect modifiers) all give the working investigator tools for stating, transparently, how robust their conclusions are to the confounders they could not adjust for.

Learning Objectives

Apply external adjustment to estimate what an effect estimate would be if an unmeasured confounder had been measured.
Compute and interpret an E-value (VanderWeele & Ding, 2017) to bound the strength of unmeasured confounding required to nullify an observed effect.
Distinguish confounders, intervening variables (mediators), colliders, and effect modifiers, and explain the consequences of (mis)treating each.
Decide when to report direct, indirect, or total effects, and articulate the structural assumptions each entails.

12.9 Unmeasured / Unknown Confounders

The Hidden Threat: all the methods discussed so far, including restriction, matching, stratification, multivariable modelling, and propensity scores, require that confounders be measured. But what if a critical confounder was never collected, or is entirely unknown? Residual confounding from unmeasured variables is one of the most important limitations of observational research.

External Adjustment

When a confounder was not measured in the study but information about its distribution exists from external sources, we can estimate what the adjusted measure would have been using external adjustment. The method works by estimating the cell values that would be expected if the confounder had been measured.

Equation 12.12: External adjustment cell estimation

For a 2×2 table stratified by an unmeasured binary confounder Z:

a₁ = a × p₁ b₁ = b × p₂ c₁ = a − a₁ d₁ = b − b₁

where p₁ = prevalence of Z among exposed cases,
p₂ = prevalence of Z among unexposed cases

The Mantel-Haenszel OR is then calculated from these estimated strata.

When a confounder is known only externally, its strata are filled in by multiplying the observed cell counts by the assumed exposure prevalences in each level of that confounder, producing an externally adjusted table.

Example 12.13: External Adjustment for STREP-CRD

Click to explore how external data on RSV prevalence can be used to estimate adjusted OR when RSV was not directly measured in the study.

Example 12.13: External Adjustment

In our STREP-CRD study, suppose RSV status was not measured. From external data we know:

Among exposed (STREP+) cases: 40% have RSV → p₁ = 0.40
Among unexposed (STREP−) cases: 10% have RSV → p₂ = 0.10

Using the crude data (a = 70, b = 30, c = 90, d = 210):

Stratum	a	b	c	d
RSV+ (estimated)	28	3	42	27
RSV− (estimated)	42	27	48	183

The MH OR from these estimated strata approximates the adjusted OR, illustrating how external information can help address unmeasured confounding, though with important assumptions about the accuracy of the external prevalence data.

Sensitivity Analysis

When no external data are available, sensitivity analysis explores how strong an unmeasured confounder would need to be to explain away an observed association. This does not eliminate confounding but quantifies the threat it poses to the study’s conclusions.

Key Question: “Could an unmeasured confounder plausibly be strong enough to account for the observed association?” If the required confounder-disease association or confounder-exposure prevalence difference is implausibly large, the finding is more robust.

Example 12.14: Sensitivity Analysis for Unmeasured Confounding

Click to see how varying assumptions about an unmeasured confounder’s strength affects the adjusted estimate.

Example 12.14: Sensitivity Analysis

Suppose we observe a crude OR = 5.44 for the STREP-CRD association. We suspect an unmeasured confounder Z might exist.

We systematically vary two parameters:

Prevalence difference of Z between exposed and unexposed groups
Strength of Z-disease association (OR_ZD)

OR_ZD	p₁=0.4, p₂=0.1	p₁=0.6, p₂=0.1	p₁=0.8, p₂=0.1
2.0	4.68	4.07	3.44
5.0	3.51	2.50	1.64
10.0	2.67	1.63	0.91

Even with a moderately strong unmeasured confounder (OR_ZD = 5, prevalence difference of 30%), the adjusted OR remains above 2.5, suggesting the STREP-CRD association is reasonably robust to unmeasured confounding.

Reflect: Unmeasured Confounding

Think about a published observational study you have encountered (or one from class). What unmeasured confounders might threaten its conclusions? How could sensitivity analysis help evaluate the robustness of its findings?

12.10 Understanding Causal Relationships with Extraneous Variables

The relationship between exposure (E), disease (D), and an extraneous variable (F) can take many forms. Understanding these patterns is critical for correctly interpreting what happens when you “control for” a variable.

Three Statistical Indicators: For each type of E-F-D relationship, we can predict:
1. Whether E-D association changes after controlling for F
2. Whether there is an F-D association
3. Whether there is an E-F association

Eight Types of Extraneous Variable Relationships

1. Exposure-Independent Variable

Click to reveal

F → D (no E-F link)

F causes D independently of E. Controlling for F does not change the E-D measure. There is an F-D association but no E-F association. F is not a confounder.

2. Simple Antecedent

Click to reveal

F → E → D

F causes E, which causes D. Controlling for F does not change the E-D measure. There is an F-D association and an E-F association. F is not a confounder; it acts through E.

3. Explanatory Antecedent (Complete)

Click to reveal

F → E and F → D (no E→D)

F causes both E and D, but E does not cause D. Controlling for F eliminates the E-D association. This is complete confounding: the entire observed E-D link is spurious.

4. Explanatory Antecedent (Incomplete)

Click to reveal

F → E and F → D and E → D

F causes both E and D, but E also independently causes D. Controlling for F changes but does not eliminate the E-D association. This is partial confounding, the classic confounder scenario.

5. Intervening (Mediating) Variable

Click to reveal

E → F → D

E causes F, which causes D (F is on the causal pathway). Controlling for F reduces or eliminates the E-D association. F should generally not be controlled for, as it would mask E’s true effect.

6. Distorter

Click to reveal

F distorts a null E-D relationship

There is no true E-D association, but F creates a spurious one. Crude analysis shows E-D association; controlling for F reveals the null. Both F-D and E-F associations exist. A distorter is a confounder that creates a false positive.

7. Suppressor

Click to reveal

F suppresses a true E-D relationship

A true E-D association exists but is hidden in crude analysis because F masks it. Controlling for F reveals or strengthens the E-D association. A suppressor is a confounder that creates a false negative.

8. Moderator (Effect Modifier)

Click to reveal

F modifies the E → D effect

F changes the magnitude of the E-D association across its strata. Controlling for F reveals different stratum-specific measures. Effect modification is a biological phenomenon, not a bias, so stratum-specific results should be reported separately.

Decision Guide: What Happens When You Control for F?

Type	E-D changes?	F-D assoc?	E-F assoc?	Confounder?
Exposure-independent	No	Yes	No	No
Simple antecedent	No	Yes	Yes	No
Explanatory (complete)	Yes → null	Yes	Yes	Yes
Explanatory (incomplete)	Yes → attenuated	Yes	Yes	Yes
Intervening variable	Yes → reduced	Yes	Yes	No*
Distorter	Yes → null	Yes	Yes	Yes
Suppressor	Yes → stronger	Yes	Yes	Yes
Moderator	Varies by stratum	May vary	May vary	No**

*Controlling for an intervening variable is usually inappropriate.
**Effect modification is a biological phenomenon, not bias.

12.11 Chapter Summary

Confounding is a fundamental threat to causal inference in observational studies. Its control requires a combination of study design strategies (restriction, matching) and analytical approaches (stratification, multivariable modelling, propensity scores). The choice among methods depends on the research question, data structure, and assumptions the investigator is willing to make.

Table 12.7. Summary: Effect of Controlling RSV on STREP-CRD

Click to review how different methods yielded similar adjusted estimates.

Table 12.7: Comparison of Confounding Control Methods

Method	OR Estimate	Key Feature
Crude (unadjusted)	5.44	No control for RSV
Restriction (RSV− only)	3.21	Limits generalizability
MH Stratification	3.38	Transparent, stratum-specific
Mantel-Haenszel (pooled)	3.38	Weighted average across strata
Logistic Regression	3.40	Handles multiple confounders
Propensity Score	~3.4	Balances many covariates
External Adjustment	~3.4	Uses external prevalence data

All methods converge on a similar adjusted OR of approximately 3.4, down from the crude OR of 5.44. This consistency strengthens confidence that RSV confounds the STREP-CRD association and that the true effect of STREP on CRD is approximately 3-fold.

Key Takeaways

Unmeasured confounders cannot be controlled directly; external adjustment and sensitivity analysis help evaluate their impact.
Sensitivity analysis asks how strong an unmeasured confounder must be to explain away findings, which strengthens or weakens confidence in results.
Eight types of extraneous variable relationships exist, and only some represent true confounding.
Intervening variables should generally not be controlled for; doing so obscures the causal pathway.
Effect modification is a biological phenomenon to report, not a bias to remove.
Multiple methods for controlling confounding typically yield similar results when applied correctly.

Reflection

Consider a real-world observational study (e.g., the association between coffee consumption and heart disease). Identify at least one potential unmeasured confounder and describe how you would design a sensitivity analysis to evaluate its impact on the study conclusions.

Model answerFor coffee and heart disease: a plausible unmeasured confounder is type-A behaviour pattern or chronic psychological stress, both associated with coffee consumption (stressed people may drink more coffee for energy) and with cardiovascular outcomes through cortisol, blood pressure, and inflammation. Sensitivity analysis: compute the E-value for the observed effect, that is, how strong an unmeasured confounder would need to be to nullify the observed RR? If the published effect is RR = 1.1, the E-value is ~1.43, meaning a confounder with associations of ~1.4 with both exposure and outcome would suffice, well within plausibility for stress. If RR = 2.0, E-value is ~3.4, which is harder to imagine for an unmeasured confounder. Combine with a quantitative bias analysis (Monte Carlo) varying the assumed prevalence and effect of the confounder. Conclude: a small observed effect with a small E-value is fragile; a large observed effect with a large E-value is more robust to plausible unmeasured confounding.

Minimum 20 characters required.

✓ Reflection saved

HSCI 341, Lesson 12

Fundamental Epidemiological Concepts and Approaches

Confounding and Causal Inference

Learning objectives for this lesson:

Glossary: Key Terms, People & Concepts

Introduction & Pre-Analysis Control of Confounding

Confounding and Causal Inference

What makes a variable a confounder?

Restriction

Complete restriction

Partial restriction

Matching: cohort versus case-control

Cohort matching

Case-control matching

Discordant pairs and McNemar's test

Next: detecting and adjusting

Introduction and Overview

Learning Objectives

12.1 Introduction

What Is Confounding?

12.1.1 Which Extraneous Factors Are Confounders?

Important Distinction

12.2 Control of Confounding Prior to Data Analysis

12.3 Matching on Confounders

Overmatching

Frequency vs. Pair Matching

Analysing Matched Data

Reflection

Detection of Confounding & Stratified Analysis

Detection of Confounding & Stratified Analysis

Directed acyclic graphs

Change-in-estimate: 20–30% threshold

Mantel-Haenszel adjusted odds ratio

Interaction and effect modification

Additive interaction

Multiplicative interaction

Non-collapsibility of the odds ratio

When it matters

Risk-based measures

Next: methods that scale up

Introduction and Overview

Learning Objectives

12.4 Detection of Confounding

12.4.1 Using Causal Diagrams (DAGs)

12.4.2 Change in Measure of Association

Three Important Notes

Non-Collapsibility of Odds Ratios

12.5 Analytic Control: The Mantel-Haenszel Estimator

Key Formulae

📊 Interactive: Mantel-Haenszel Stratified Analysis

Crude (unstratified) 2×2

Stratum 1: C+

Stratum 2: C−

Crude vs. stratum-specific vs. MH-adjusted OR

12.5.2 Interaction

Key Rule: When Interaction Is Present

Reflection

Alternative Methods & Propensity Scores

Alternative Methods & Propensity Scores

Multivariable regression

Standardisation and marginal structural models

Instrumental variable analysis

Propensity scores

Matching

Stratification

IPTW

Covariate

Next: the unmeasured case

Introduction and Overview

Learning Objectives

12.6 Multivariable Modelling

Rule of Thumb

12.7 Other Approaches to Control Confounding

12.7.1 Standardised Risk Ratios (SRR)

12.7.2 Marginal Structural Models

12.7.3 Instrumental Variables

12.8 Propensity Scores

R Reflect on what you just ran

Reflection

Unmeasured Confounders & Causal Relationships